写输出到CSV文件[以正确的格式]

我意识到这个问题已被问及一百万次，并有大量的文件。但是，我无法以正确的格式输出结果。写输出到CSV文件[以正确的格式]

下面的代码获得通过从：Replacing empty csv column values with a zero

# Save below script as RepEmptyCells.py 
# Add #!/usr/bin/python to script 
# Make executable by chmod +x prior to running the script on desired .csv file 

# Below code will look through your .csv file and replace empty spaces with 0s 
# This can be particularly useful for genetic distance matrices 

import csv 
import sys 

reader = csv.reader(open(sys.argv[1], "rb")) 
for row in reader: 
    for i, x in enumerate(row): 
       if len(x)< 1: 
         x = row[i] = 0 
    print(','.join(int(x) for x in row))

目前，以获得正确输出的.csv文件[即在正确的格式]可以在bash运行以下命令：

#After making the script executable   
./RepEmptyCells.py input.csv > output.csv # this produces the correct output

我试着使用csv.writer函数来产生正确格式化output.csv文件（类似于./RepEmptyCells.py input.csv > output.csv）没有多少运气。

我想了解如何将这最后一部分添加到代码来自动执行该过程，而无需在bash中执行此操作。

我曾尝试：

f = open(output2.csv, 'w') 

import csv 
import sys 

reader = csv.reader(open(sys.argv[1], "rb")) 
for row in reader: 
    for i, x in enumerate(row): 
       if len(x)< 1: 
         x = row[i] = 0 
    f.write(','.join(int(x) for x in row)) 

f.close()

当从这个代码和前一个原始文件看，它们看起来是一样的。

但是，当我用excel或iNumbers打开它们时，后者（即output2.csv）只显示一行数据。

重要的是，output.csv和output2.csv都可以在excel中打开。

来源

2015-11-12 Novice

2个选择：

只是做一个f.write('\n')您当前f.write后声明。

使用csv.writer。你提到它，但它不在你的代码中。

writer = csv.writer(f) 
... 
writer.writerow([int(x) for x in row]) # Note difference in parameter format

来源

2015-11-12 02:00:09

感谢。那样做了！所以你只需要添加新行（'/ n'）！ 1）的作品。 2）仍然没有，但没关系。 – Novice

请注意，我很惊讶1）的工作，因为在Unix上'\ n'会转换为LF，而我非常确定Excel只会在CRLF结束时接受csv文件。实际上，这是CSV格式的一个特性，单个LF表示单元格内的换行符。这就是为什么你打开Python 2的'rb'和Python 3的'newline =''的原因，因为csv编写器处理这个特定的方面，并且会被Python的默认换行抽象打扰。 – Cilyan

一种谦虚命题

#!/usr/bin/env python 
# -*- coding: utf-8 -*- 

import csv 
import sys 

# Use with statement to properly close files 
# Use newline='' which is the right option for Python 3.x 
with open(sys.argv[1], 'r', newline='') as fin, open(sys.argv[2], 'w', newline='') as fout: 
    reader = csv.reader(fin) 
    # You may need to redefine the dialect for some version of Excel that 
    # split cells on semicolons (for _Comma_ Separated Values, yes...) 
    writer = csv.writer(fout, dialect="excel") 
    for row in reader: 
     # Write as reading, let the OS do the caching alone 
     # Process the data as it comes in a generator, checking all cells 
     # in a row. If cell is empty, the or will return "0" 
     # Keep strings all the time: if it's not an int it would fail 
     # Converting to int will force the writer to convert it back to str 
     # anwway, and Excel doesn't make any difference when loading. 
     writer.writerow(cell or "0" for cell in row)

样品in.csv

1,2,3,,4,5,6, 
7,,8,,9,,10

输出out.csv

1,2,3,0,4,5,6,0 
7,0,8,0,9,0,10

来源

2015-11-12 02:16:49 Cilyan

import csv 
import sys 

with open(sys.argv[1], 'rb') as f: 
    reader = csv.reader(f) 
    for row in reader: 
     print row.replace(' ', '0')

我不明白你需要使用shell和重定向。一个CSV作家就是：

with open('output.csv', 'wb') as f: 
    writer = csv.writer(f) 
    writer.writerows(rows)

来源

2015-11-12 02:20:29

写输出到CSV文件[以正确的格式]

回答

相关问题