未提供项目说明

MGEmasker的Python项目详细描述


MGE掩蔽器

此包根据富序列文件(Genbank或EMBL)中的注释查找mge。 有3个子命令

  1. find_mges在富序列文件中搜索带有提示与MGE相关元素的文本注释的特性
  2. mask_mges将伪基因组比对的区域与使用find_mges命令生成的GFF文件中的区域进行屏蔽
  3. default_matches显示搜索MGE时使用的默认regex模式。这可以通过在find\umges命令中使用-m参数提供一个格式类似的文件来覆盖

死亡模式匹配

.*\b[tT]ranspos
.*\b[pP]hage
.*\b[rR]epeat
.*\b[rR]eptitive
.*\b[iI]nsertion sequence
.*\bIS
.*\b[tT]n
.*\b[iI]ntegr
.*\b[Cc]onjug
.*\b[Pp]lasmid

安装

仅Python3

^{pr2}$

或者

pip3 install MGEmasker

用法

usage: mge_masker [-h] {find_mges,mask_mges,default_matches} ...

A module to find MGEs in a rich sequence file and mask regions corresponding to the MGEs in a pseudogenome alignment.

The find_mges command searches a gbk or embl file for features that have MGE-associated annotations.
It writes a GFF file containing the positions of the matched features.

The mask_mges command takes a GFF file produced using the find_mges command and masks those regions in all sequences of a pseudogenome alignment based on the reference sequence used to find MGEs.

positional arguments:
  {find_mges,mask_mges,default_matches}
                        The following commands are available. Type mge_masker
                        <COMMAND> -h for more help on a specific commands
    find_mges           Search a rich sequence file for features annotated
                        with text that suggests a MGE-associated element
    mask_mges           Mask regions from a pseudogenome alignment with the
                        regions in a GFF file produced using the find_mges
                        command
    default_matches     Show the default regex patterns used when searching
                        for MGEs

optional arguments:
  -h, --help            show this help message and exit

查找使用情况

usage: mge_masker find_mges [-h] -g GENOME_FILE_PATH [-f {genbank,embl}]
                            [-i MERGE_INTERVAL] [-m MGE_FILE_PATH]

optional arguments:
  -h, --help            show this help message and exit
  -g GENOME_FILE_PATH, --genome_file_path GENOME_FILE_PATH
                        path to a genome file
  -f {genbank,embl}, --file_format {genbank,embl}
                        genome file format
  -i MERGE_INTERVAL, --merge_interval MERGE_INTERVAL
                        The maximum distance between MGEs when performing the
                        merging step (Default 1000bp)
  -m MGE_FILE_PATH, --mge_file_path MGE_FILE_PATH
                        path to a file containing regex MGE annotations

掩码用法

usage: mge_masker mask_mges [-h] -f FASTA_PATH -g GFF_FILE_PATH
                            [-m MASKING_CHARACTER]

optional arguments:
  -h, --help            show this help message and exit
  -f FASTA_PATH, --fasta_path FASTA_PATH
                        path to either a single fasta reference or a
                        pseudogenome alignment file
  -g GFF_FILE_PATH, --gff_file_path GFF_FILE_PATH
                        path to a gff file containing MGE regions to be masked
  -m MASKING_CHARACTER, --masking_character MASKING_CHARACTER
                        character used to mask (default: N)

欢迎加入QQ群-->: 979659372 Python中文网_新手群

推荐PyPI第三方库


热门话题
java请帮助我完成这个while循环   使用trycatch块、filewriter和printwriter在java中创建自定义类   java无法将服务自动连接到AsyncUncaughtExceptionHandler的实现中   安卓 studio中的java Gradle同步问题   java Gradle多项目与外部项目?   java从命名窗口创建和选择   java在IntegrationTest服务器和测试之间共享Spring上下文   elasticsearch我无法使用java以302响应登录elastic search,但它与postman一起工作   java是在对映射排序时对某些字符串进行优先级排序的最佳方法   java递归不起作用   java将AWT画布转换为Swing   java设置放大gral plot会导致空plot   java为什么即使我声明了rand,它也会给我一个错误?   java不必要的if-else条件会影响代码性能吗?