简单的python包,用于生成和缓存任意深度的随机和染色体保持。
holdouts-generator的Python项目详细描述
简单的python包,用于生成和缓存任意深度的随机和染色体保持。
如何安装此软件包?
像往常一样,只需使用pip:
pip install holdouts_generator
生成随机保持
假设要生成3层holdouts,分别以0.3、0.2和0.1作为测试大小,5、3和2作为数量:
fromholdouts_generatorimportholdouts_generator,random_holdoutsdataset=pd.read_csv("path/to/my/dataset.csv")generator=holdouts_generator(dataset,holdouts=random_holdouts([0.3,0.2,0.1],[5,3,2]),cache=False,# Set this parameter to True to enable automatic cachingmemory_cache=False,# Set this parameter to True to enable automatic in memory caching, usefull when you would reload the objects multiple timescache_dir=".holdouts"# This is the default cache directory)for(training,testing),inner_holdoutsingenerator():for(inner_train,inner_test),small_holdoutsininner_holdouts():for(small_train,small_test),_insmall_holdouts():#do what you need :)
产生染色体保持率
假设你想产生两层持留物,两层外层的17号和18号染色体,三层内层的17/18号、20号和21号染色体:
fromholdouts_generatorimportholdouts_generator,chromosomal_holdoutsdataset=pd.read_csv("path/to/my/genomic_dataset.csv")generator=holdouts_generator(dataset,holdouts=chromosomal_holdouts([([17],[([18],None),([20],None),([21],None)])([18],[([17],None),([20],None),([21],None)])]),cache=False,# Set this parameter to True to enable automatic cachingmemory_cache=False,# Set this parameter to True to enable automatic in memory caching, usefull when you would reload the objects multiple timescache_dir=".holdouts"# This is the default cache directory)for(training,testing),inner_holdoutsingenerator():for(inner_train,inner_test),_ininner_holdouts():#do what you need :)
清除保持缓存
只需运行方法清除缓存:
fromholdouts_generatorimportclear_cacheclear_cache(cache_dir=".holdouts"# This is the default cache directory)
清除保持内存缓存
只需运行方法清除内存缓存:
fromholdouts_generatorimportclear_memory_cacheclear_memory_cache(cache_dir=".holdouts"# This is the default cache directory)