在Python中打包Arules

2024-09-28 23:16:08 发布

您现在位置:Python中文网/ 问答频道 /正文

我在Python中使用arules。我执行下面的代码来生成所有关联。我想知道如何将arules的输出转换成Python中的某些数据结构。类型'rpy2的输出。robobjects.methods.RS4'。 --下面是代码---

from rpy2.robjects import pandas2ri
pandas2ri.activate()
import collections
from collections import OrderedDict
import pandas as pd
import numpy as np
from rpy2.robjects.vectors import ListVector
from rpy2.robjects.packages import importr
arules = importr("arules")
od = OrderedDict()
od["supp"] = 0.0005
od["conf"] = 0.7
od["target"] = 'rules'
df = pd.DataFrame (
    [
        ['1','1', '1'],
        ['1', '0','0'],
        ['1', '1', '1'],
        ['1', '0', '0'],
        ['1', '1', '1'],
        ['1', '0', '1'],
        ['1', '1', '1'],
        ['0', '0', '1'],
        ['0', '1', '1'],
        ['1', '0', '1'],
    ],
    columns=list ('ABC')) 
result = ListVector(od)
df['A'] = df['A'].astype('category')
df['B'] = df['B'].astype('category')
df['C'] = df['C'].astype('category')
my_rules = arules.apriori(df, parameter=result)
print("herererererere")
print(type(my_rules))
print("rules")

Tags: 代码fromimportdfcollectionsrulesordereddictprint
1条回答
网友
1楼 · 发布于 2024-09-28 23:16:08

下面是一个极简主义者如何做到这一点的例子:

# prepare the data as a dataframe with boolean values
import pandas as pd

df = pd.DataFrame (
    [
        [True,True, True],
        [True, False,False],
        [True, True, True],
        [True, False, False],
        [True, True, True],
        [True, False, True],
        [True, True, True],
        [False, False, True],
        [False, True, True],
        [True, False, True],
    ],
    columns=list ('ABC')) 

# set up rpy2
from rpy2.robjects import pandas2ri
pandas2ri.activate()
import rpy2.robjects as ro
from rpy2.robjects.packages import importr
arules = importr("arules")

# run apriori
itsets = arules.apriori(df, 
   parameter = ro.ListVector({"supp": 0.1, "target": "frequent itemsets"}))

# get itemsets as a dataframe
print(arules.DATAFRAME(itsets))

# get quality as a dataframe
print(itsets.slots["quality"])

# get itemsets as a matrix
itemset_as_matrix = ro.r('function(x) as(items(x), "matrix")')
itemset_as_matrix(itsets)

相关问题 更多 >