Python错误日志记录以检查重复的行和列

from datetime import datetime import logging logging.basicConfig( level=logging.INFO, format="%(asctime)s [%(threadName)-12.12s] [%(levelname)-5.5s] %(message)s", handlers=[logging.StreamHandler()]) os.chdir(r'M:\Loans') col_map = {'Loan #' : 'LoanNo', 'Last Name' : 'LastName', 'Purchase Price' : 'PurchasePrice', 'Loan Amt' : 'LoanAmt', 'Property Address' : 'PropertyAddress', 'City' : 'City', 'State' : 'State', 'Zip Code' : 'ZipCode', 'Interest Rate' : 'InterestRate', 'UPBCurrent' : 'UPBCurrent', 'NextDueDateAtPurchase' : 'NextDueDateAtPurchase', 'CurrentAdvanceRate': 'CurrentAdvanceRate', 'Comments' : 'Comments', 'CurrentAdvanceAmount': 'CurrentAdvanceAmount', 'SecondRoundCurrentAdvanceRate' : 'SecRoundCurrentAdvRate', 'SecondRoundCurrentAvanceAmount' : 'SecRoundCurrentAdvAmount', } for f in os.listdir(): logging.info('Reading in file {}'.format(f)) df=pd.read_excel('M:\Loans\Loan Blotter XYZ OLD.xlsx') df['UPBCurrent'] = None df['NextDueDateAtPurchase'] = None df = df[col_map.keys()] df.drop_duplicates(inplace=True) df.columns = [col_map[col] for col in df.columns] df['Channel'] = 'Whole Loans' df['DateCreated'] = datetime.today().date() df.to_excel(r'M:\Err Log.xlsx', index=False)

1条回答

网友

1楼 · 发布于 2024-10-03 17:21:45

要检查是否不会覆盖现有COL，请执行以下操作：

null_cols = ['UPBCurrent', 'UPBCurrent']
for null_col in null_cols:
    if null_col in df.columns:
        logging.error("{} will be overwritten.".format(null_col))
    else:
        logging.info("Adding null column {}.".format(null_col))
        df[null_col] = None

要检查拖放副本是否有效，请执行以下操作：

try:
    df.drop_duplicates(inplace=True)
except:
    logging.error("Failed to drop duplicate rows.")

相关问题更多 >

编程相关推荐

热门问题

热门文章