2015-10-08 16 views
0

开始我有大的文本文件与此结构线:只保留在文本文件与特定字符

MTXT file Version 1 
    file 001 good stuff 
    file 002 sdfdsfsf 
    file 003 cool stuff 
    file 004 fjgfhjhgj 
base64 
    file 005 more cool stuff 
    file 006 dgfdgfdgfcf 
    file 007 dfgdgffdg 
    - 
009 fsf002dsdfds 
010 dsfsfd003dsfs 
011 sdf005sd001fs 
001 IMPORTANT STUFF with numbers than can also contain 001, 002, 005 etc! 
002 asfdasdsa 
003 IMPORTANT STUFF with numbers than can also contain 001, 002, 005 etc! 
004 vld2004sfsfd005sfds 
005 IMPORTANT STUFF with numbers than can also contain 001, 002, 005 etc! 

作进一步处理我想摆脱不与001,003和005开头的所有行。但是,由于数据集通常包含相似的数字,因此搜索标准必须在行首开始包含001,003,005的位置。

所以我想输出是这样的:

001 IMPORTANT STUFF with numbers than can also contain 001, 002, 005 etc! 
003 IMPORTANT STUFF with numbers than can also contain 001, 002, 005 etc! 
005 IMPORTANT STUFF with numbers than can also contain 001, 002, 005 etc! 

正如我必须这样做,在各种机器简单的Windows操作系统命令将是最好的(例如像这样Delete certain lines in a txt file via a batch file)。 但我也可以用python脚本生活。

回答

3

使用findstr /b寻找任何指定的空间分隔的数字在每行的开头:

findstr /b "001 003 005" yourfile.txt 
相关问题