0
开始我有大的文本文件与此结构线:只保留在文本文件与特定字符
MTXT file Version 1
file 001 good stuff
file 002 sdfdsfsf
file 003 cool stuff
file 004 fjgfhjhgj
base64
file 005 more cool stuff
file 006 dgfdgfdgfcf
file 007 dfgdgffdg
-
009 fsf002dsdfds
010 dsfsfd003dsfs
011 sdf005sd001fs
001 IMPORTANT STUFF with numbers than can also contain 001, 002, 005 etc!
002 asfdasdsa
003 IMPORTANT STUFF with numbers than can also contain 001, 002, 005 etc!
004 vld2004sfsfd005sfds
005 IMPORTANT STUFF with numbers than can also contain 001, 002, 005 etc!
作进一步处理我想摆脱不与001,003和005开头的所有行。但是,由于数据集通常包含相似的数字,因此搜索标准必须在行首开始包含001,003,005的位置。
所以我想输出是这样的:
001 IMPORTANT STUFF with numbers than can also contain 001, 002, 005 etc!
003 IMPORTANT STUFF with numbers than can also contain 001, 002, 005 etc!
005 IMPORTANT STUFF with numbers than can also contain 001, 002, 005 etc!
正如我必须这样做,在各种机器简单的Windows操作系统命令将是最好的(例如像这样Delete certain lines in a txt file via a batch file)。 但我也可以用python脚本生活。