我对机器学习和孤独症很陌生。我正在学习各种ml概念,请原谅我的无知
我正在从事一个项目,其中我需要根据上一个历史数据中的销售代表电话预测未来一个季度的销售代表电话。我在此提供一个样本数据框架供您参考,并请提供建议
QTR4的代表呼叫预测应基于客户号码的代表呼叫&;过去三个季度可用的产品标识
df = pd.DataFrame({"CUSTOMER_NUMBER": ["CUST1", "CUST1", "CUST1", "CUST1", "CUST1", "CUST1", "CUST1", "CUST1", "CUST1", "CUST2", "CUST2", "CUST2", "CUST2", "CUST2", "CUST2", "CUST2", "CUST3", "CUST3", "CUST3", "CUST4", "CUST4", "CUST4"],
"PRODUCT": ["PRODUCT1", "PRODUCT2", "PRODUCT3", "PRODUCT1", "PRODUCT2", "PRODUCT3", "PRODUCT1", "PRODUCT2", "PRODUCT3", "PRODUCT1", "PRODUCT2", "PRODUCT3", "PRODUCT1", "PRODUCT2", "PRODUCT3", "PRODUCT3", "PRODUCT3", "PRODUCT3", "PRODUCT3", "PRODUCT1", "PRODUCT1", "PRODUCT2"],
"REP_VISITS": ["3", "3", "3", "3", "3", "3", "4", "4", "4", "3", "2", "2", "4", "6", "8", "5", "3", "1", "3", "2", "0", "3"],
"QTR": ["QTR1", "QTR1", "QTR1", "QTR2", "QTR2", "QTR2", "QTR3", "QTR3", "QTR3", "QTR1", "QTR1", "QTR1", "QTR2", "QTR2", "QTR2", "QTR3", "QTR1", "QTR2", "QTR3", "QTR1", "QTR2", "QTR3"],
"START_DATE": ["2020-01-01", "2020-01-01", "2020-01-01", "2020-04-01", "2020-04-01", "2020-04-01", "2020-07-01", "2020-07-01", "2020-07-01", "2020-01-01", "2020-01-01", "2020-01-01", "2020-04-01", "2020-04-01", "2020-04-01","2020-07-01", "2020-01-01", "2020-04-01", "2020-07-01", "2020-01-01", "2020-04-01", "2020-07-01"],
"END_DATE": ["2020-03-31", "2020-03-31", "2020-03-31", "2020-06-30", "2020-06-30", "2020-06-30", "2020-09-30", "2020-09-30", "2020-09-30", "2020-03-31", "2020-03-31", "2020-03-31", "2020-06-30", "2020-06-30", "2020-06-30", "2020-09-30", "2020-03-31", "2020-06-30", "2020-09-30", "2020-03-31", "2020-06-30", "2020-09-30"]})
数据框如下所示:
我需要找出QTR4的预测代表电话
CUST1|PRODUCT1||QTR4|
CUST1|PRODUCT2||QTR4|
CUST1|PRODUCT3||QTR4|
CUST2|PRODUCT1||QTR4|
CUST2|PRODUCT2||QTR4|
CUST2|PRODUCT3||QTR4|
CUST3|PRODUCT3||QTR4|
CUST4|PRODUCT1||QTR4|
CUST4|PRODUCT2||QTR4|
请指导我如何为客户/产品创建具有适当预测的培训数据集,以便我可以使用测试数据进行预测/评估
我认为您可以尝试使用客户编号和产品id作为特征,并使用逻辑回归或决策树来训练一个简单的分类器。您可以尝试对不同的客户编号和产品ID使用1-hot编码。如果您尝试这种方法,REP_访问可以是标签,功能可以是cust1、cust2、cust3、product1、product2等。 scikitlearn有这些算法的实现,它们易于使用。希望这有助于:
相关问题 更多 >
编程相关推荐