用系统发育最大简约法预测谱系特异性序列元素的得失。

mapGL的Python项目详细描述


MAPGL

H2>基于系统进化最大简约性的基因组序列元件的遗传增益和丢失预测

将基因组区域标记为直系,在查询物种中获得,或在 目标物种,基于最近 共同祖先(mrca)。链接的路线文件用于映射 查询目标和一个或多个外部组物种。直接从 对目标的查询标记为Orthologs,并且 目标物种在输出中给出。非映射特征被指定为 基于最大简约算法预测存在或损益的存在性 或者没有参加核磁共振检查。

基于bnmapper.py,作者:ogert denas(james taylor lab):

依赖关系

纽比 赛隆 六

用法

mapGL.py [-h] [-o FILE] [-t FLOAT] [-g GAP] [-v {info,debug,silent}] [-k] input tree qname tname alignments [alignments ...]

必需参数

ArgumentDescription
inputInput regions to process. Should be in standard bed format. Only the first four bed fields will be used.
treePhylogenetic tree describing relationships of query and target species to outgroups. Must be in standard Newick format. Branch lengths are optional, and will be ignored.
qnameName of the query species. Regions from this species will be mapped to target species coordinates.
tnameName of the target species. Regions from the query species will be mapped to coordinates from this species.
alignmentsAlignment files (.chain or .pkl): One for the target species and one per outgroup species. Files should be named according to the convention: qname.tname[...].chain.gz, where qname is the query species name and tname is the name of the target/outgroup species. Names used for qname and tname must match names used in the phylogenetic tree.

选项

OptionDescription
-h, --helpShow help message and exit.
-o FILE, --output FILEOutput file. (default: stdout)
-t FLOAT, --threshold FLOATMapping threshold i.e., (elem * threshold) <= mapped_elem (default: 0.0)
-g GAP, --gap GAPIgnore elements with an insertion/deletion of this or bigger size. (default: -1)
-v {info,debug,silent}, --verbose {info,debug,silent}Verbosity level (default: info)
-d, --drop_splitFollow the bnMapper convention of silently dropping elements that span multiple chains, rather than the liftOver mapping convention for split alignments: keep elements that span multiple chains and report the longest aligned segment. This is not recommended, as it may lead to spurious gain/loss predictions for orthologous elements that happen to be split across chains due to chromosomal rearrangements, etc... (default: False)
-i {BED,narrowPeak}, --in_format {BED,narrowPeak}Input file format. (default: BED)

输出

预测以制表符分隔的格式报告,前四列遵循BED4约定。在“状态”列中报告预测的进化历史(即正交、查询中的增益或目标中的损失)。最后三列包含映射(正交)元素在目标坐标中的映射位置。

ColumnDescription
chromChromosome on which the query element is located.
startStart position on query chromosome.
endEnd position on query chromosome.
nameElement name or ID.
peakPeak location (narrowPeak input) or element midpoint (BED input)
statusPredicted phylogenetic history: ortholog, gain_qname, or loss_tname
mapped chromFor mapped (ortholog) elements, the chromosome on which the mapped element is located, in target coordinates.
mapped startFor mapped (ortholog) elements, the start position on the target chromosome on which the mapped element is located.
mapped endFor mapped (ortholog) elements, the end position on the target chromosome on which the mapped element is located.
mapped_peakFor mapped (ortholog) elements, the mapped peak position (narrowPeak input) or mapped element midpoint (BED input).

版权所有2018,Adam Diehl(adadiehl@umich.edu),密歇根大学博伊尔实验室

欢迎加入QQ群-->: 979659372 Python中文网_新手群

推荐PyPI第三方库


热门话题
java快速确定列表是否包含数组中至少一项的方法   java在任何HandlerMapping之前修改HttpServletRequest?   java这个SerialVersionID是通用版本标识符吗?   有没有Java库可以帮助绘制细胞自动机图形?   java如何使用microsoft graph api获取自定义扩展属性   java通过环境变量在Spring Boot中设置日志记录级别   java子类不允许world运行   如何检查Java源代码中是否没有对包的引用   java如何使用数组按字母顺序排列字符串。排序()?   数据结构在Java中,我如何访问在树状图中放置元素的顺序?   java为什么调用系统是不好的做法。gc()?   JavaTomcat删除临时目录和工作目录解决了一个类路径问题,但为什么呢?