Python exp-runner包_程序模块 - PyPI

数据分析与机器学习实验框架

exp-runner的Python项目详细描述

实验转轮（实验转轮）

exp runner是一个简单且可扩展的框架，用于python中的数据分析和机器学习实验。

结构
框架包括以下步骤：
数据加载
数据转换
模型培训和测试
绩效评估
结果保存

主要功能
generability：支持模型和方法的变量，它可以用于许多任务（如预处理，降维，分类，回归、聚类、统计检验等）
flexability：可以轻松跳过和/或包括步骤
动态加载：在运行时自动导入模块-不需要额外的行
安装
pip install exp-runner
使用量
假设您的项目具有以下结构：
MyAwesomeProject/ main.py my_custom_module.py data/ data_00.npy data_01.npy ... data_NN.npy protocols/ experiment_config.json results/
给我一个密码！

您只需要在JSON配置文件中描述您的framework：

实验配置json
{"Setup":{"description":"You can add detailed description of the experiment","random_seed":42},"Dataset":{"class":"my_custom_module.MyAwesomeDataLoader","args":{"path_to_data":"data/*.npy"}},"Transforms":[{"class":"sklearn.decomposition.PCA","args":{"n_components":3,"whiten":true}}],"Model":{"class":"sklearn.cluster.KMeans","args":{"n_clusters":3,"n_jobs":-1,"verbose":0}},"Metric":{"class":"my_custom_module.SklearnMetricWrapper","args":{"metric":"normalized_mutual_info_score"}},"Saver":{"class":"my_custom_module.CSVReport","args":{"path_to_output":"results/evaluation_results.csv","sep":";"}}}
下面是前面提到的类（单击）：
我的自定义模块.py
importosimportglobimportnumpyasnpimportsklearn.metricsfromexp_runnerimportDataset,Metric,SaverfromcollectionsimportdefaultdictfromtypingimportAny,Dict,List,Union,NoReturn,Iterable,Callablefromsklearn.model_selectionimportStratifiedShuffleSplitclassMyAwesomeDataLoader(Dataset):def__init__(self,path_to_data:str,test_size:float=0.1,training:bool=True):super(MyAwesomeDataLoader,self).__init__()self._samples=dict()self._labels=dict()self._splits=defaultdict(dict)paths_to_data=glob.glob(path_to_data)forpathinpaths_to_data:fname=os.path.basename(path)data=np.load(path)X=data[:,:-1]y=data[:,-1]indices_train,indices_test=next(StratifiedShuffleSplit(test_size=test_size).split(X,y))self._samples[fname]=Xself._labels[fname]=yself._splits[fname]['train']=indices_trainself._splits[fname]['test']=indices_testself._indices=list(self._samples.keys())self._training=trainingdef__getitem__(self,index:int)->Dict[str,Dict[str,Union[str,np.ndarray]]]:ifnot(0<=index<len(self._indices)):raiseIndexErrorfname=self._indices[index]item={'X':self._samples[fname][self._splits[fname]['train']ifself.trainingelseself._splits[fname]['test']],'y':self._labels[fname][self._splits[fname]['train']ifself.trainingelseself._splits[fname]['test']]}item['desc']='it is possible to add description for each data sample'return{'filename':fname,'item':item}def__len__(self)->int:returnlen(self._indices)@propertydeftraining(self):returnself._trainingclassSklearnMetricWrapper(Metric):def__init__(self,metric:str):super(SklearnMetricWrapper,self).__init__()metric=getattr(sklearn.metrics,metric)self._metric:Callable[[Iterable[Union[float,int]],Iterable[Union[float,int]]],float]=metricdef__call__(self,y_true:Iterable[Union[float,int]],y_pred:Iterable[Union[float,int]])->float:returnself._metric(y_true,y_pred)classCSVReport(Saver):def__init__(self,path_to_output:str,sep:str=';',append:bool=True):super(CSVReport,self).__init__()self.path_to_output=path_to_outputself.sep=sepself.mode='a+'ifappendelse'w+'defsave(self,report:List[Dict[str,Any]])->NoReturn:withopen(self.path_to_output,self.mode)ascsv:forentryinreport:line=self.sep.join([entry['filename'],entry['desc'],entry['perf']])+'\n'csv.write(line)
最后，要在终端中运行实验类型：
cd /path/to/MyAwesomeProject python main.py --config protocols/experiment_config.json
标签：
to
path
import
self
机器
框架
data
def
fname
runner
欢迎加入QQ群-->： 979659372
推荐PyPI第三方库
more.chameleon
morepath的变色龙模板集成
dynamodbencrpytionsdk
你是说要安装dynamodb加密sdk吗？
omikuji
python绑定到omikuji，一种有效的分区标签树实现及其在极端多标签分类中的变化
openslides-gui
用于管理openslides服务器的gui前端
txMongoModel
异步MongoDB的数据模型包装器
flake8-variables-names
flake8扩展，有助于创建更可读的变量名
z3c.formjsdemo
“z3c.formjs”的一组演示应用程序``
folder-syncer
一个简单的文件夹同步器
scrapy-googleauth
用于scrapy的google auth下载中间件
jokes
用python编写的实验语言
aiida-gaussian-datatypes
作为一级公民管理高斯数据类型（基集和伪势）的aiida数据插件
analysis
python程序的源代码分析
fir
时间序列分析的有限脉冲响应包。
c2.search.customdescription
这个包为plone提供了类似google或yahoo的搜索结果视图。
logicmin
逻辑最小化

导航栏
项目描述
版本历史
下载文件
项目链接
首页
标签
许可证: BSD许可证（BSD 3条款）
作者信息:: 暂无
维护者
slipnits
最新PyPI项目
italian_vip_says
UFx
vofs
fake_item_generator
NerEva
django-monologue
fio_product_attribute_strict
climailsystem
pyshape
tbb-devel
npy-append-arra
anthill.tal.macrorenderer
odoo11-addon-stock-a
uuuu
contextil
fyl_nester
appomatic_renderable
teacher
chuletas
slackbot_ce
最新Python常见问题
无法使用Django restfram生成PDF
无法使用Django Rest框架发送压缩的gzip数据
无法使用Django rest框架进行身份验证(请求用户=匿名用户）
无法使用Django、Python和JavaScrip触发onclick函数
无法使用Django.views.generic.View保存表单
无法使用Django（python 2.7，OS X 10.11.1）
无法使用Django/mongoengine连接到MongoDB（身份验证失败）
无法使用Django\u mssql\u后端迁移到外部hos
无法使用Django&Python3.4连接到MySql
无法使用Django+nginx上载媒体文件
无法使用Django1.6导入名称模式
无法使用Django1.7和mongodb登录管理站点
无法使用Djangoadmin创建项目，进程使用了错误的路径，因为我事先安装了错误的Python
无法使用Djangockedi验证CBV中的字段
无法使用Djangocketditor上载图像（错误400）

exp-runner 0.1.0b2

exp-runner的Python项目详细描述

实验转轮（实验转轮）

结构 框架包括以下步骤：数据加载数据转换模型培训和测试 绩效评估结果保存

安装

使用量

我的自定义模块.py

推荐PyPI第三方库

more.chameleon

dynamodbencrpytionsdk

omikuji

openslides-gui

txMongoModel

flake8-variables-names

z3c.formjsdemo

folder-syncer

scrapy-googleauth

jokes

aiida-gaussian-datatypes

analysis

fir

c2.search.customdescription

logicmin

导 航 栏

项目 链接

标 签

维护者

最新PyPI项目

最新Python常见问题

结构
框架包括以下步骤：
数据加载
数据转换
模型培训和测试
绩效评估
结果保存

导航栏

项目链接

标签