从列表中长期删除字符串中的单引号

ID TableID Number Decimal Letter Random 0 A 1 1.8 NULL A9B34 1 A 4 2.4 NULL C8J91 2 B 3 5.6 x NULL 3 B 8 4.8 y NULL

def createMaster(path): global master master = [] for file in os.listdir(path): if file.endswith('.csv'): with open(path + file) as inFile: csvFile = csv.reader(inFile) col = next(csvFile) # gets the first line of the file, aka the column headers master.extend(col) # adds the column headers from each file to the master list masterTemp = OrderedDict.fromkeys(master) # gets rid of duplicates while maintaining order masterFinal = list(masterTemp.keys()) # turns from OrderedDict to list return masterFinal

def createInsert(inPath, outPath): for file in os.listdir(inpath): if file.endswith('.csv'): with open(inPath + file) as inFile: with open(outPath + 'table_name' + '.sql', 'a') as outFile: csvFile = csv.reader(inFile) col = next(csvFile) # gets the first row of column headers for row in csvFile: tempMaster = [] # creates a tempMaster list insert = 'INSERT INTO ' + 'table_name' + ' (' + ','.join(master)+ ') VALUES ' # SQL syntax crap for x in master: try: i = col.index(x) # looks for the value in the column list r = row[i] # gets the row value at the same index as the found column tempMaster.append(r) # appends the row value to a temporary list except ValueError: tempMaster.append('NULL') # if the value is not found in the column list it just appends the string to the row master list values = map((lambda x: "'" + x.strip() + "'"), tempMaster) # converts tempMaster from a list to a string printOut = insert + ' (' + ','.join(values) + '):') outFile.write(printOut + '\n') # writes the insert statement to the file

def findBetween(s, first, last): try: start = s.index(first) + len(first) end = s.index(last, start) return s[start:end] except ValueError: print('ERROR: findBetween function failure.') def removeNull(aList): tempList = [] for x in aList: if x == 'NULL': norm = findBetween(x, "'", "'") tempList.append(norm) else: tempList.append(x) return tempList

2条回答

网友

1楼 · 编辑于 2024-06-25 06:35:01

我检查了你的要求，我发现你的目录里有多个CSV。这些csv有动态列。我的方法是创建所有列的静态列表

staticColumnList = ["ID","TableID","Number","Decimal","Letter","Random"]

现在，在读取文件时，获取头行并为相应列的元组创建一个临时列表，例如

[(ID, column no in csv), (TableID, 'A' - File Name), (Number, column no in csv) etc...]

如果csv中没有列，那么将x放在("Letter", x)中。现在对每一行创建一个循环，并指定或拾取值，例如这：在

wholeDataList = []
rowList = []
for column in staticColumnList:
    if int of type(column[1]):
      rowList.append("'"+str(rowCSV[column[1]])+"'")
    elif 'X' == column[1]:
      rowList.append('null')
    else:
      rowList.append("'"+column[1]+"'")


wholeDataList.append("("+",".join(rowList)+")")

最后你准备好了陈述，比如这：在

^{pr2}$

网友

2楼 · 编辑于 2024-06-25 06:35:01

而不是

values = map((lambda x: "'" + x.strip() + "'"), tempMaster)

把这个放进去

^{pr2}$

编辑

谢谢你接受/支持我的简单技巧，但我不确定这是最佳的。在一个更全局的范围内，您可以避免这种map/lambda的东西（除非我遗漏了什么）。

                for row in csvFile:
                    values = [] # creates the final list
                    insert = 'INSERT INTO ' + 'table_name' + ' (' + ','.join(master)+ ') VALUES ' # SQL syntax crap
                    for x in master:
                        try:
                            i = col.index(x) # looks for the value in the column list
                            r = row[i] # gets the row value at the same index as the found column
                            value.append("'"+r.strip()+"'") # appends the row value to the final list
                        except ValueError:
                            value.append('NULL') # if the value is not found in the column list it just appends the string to the row master list

{cd1>你就可以节省内存了。在

编辑

相关问题更多 >

编程相关推荐

热门问题

热门文章