2017-07-26 42 views
1

给定一个字符串:根据Python中的搜索词对字符串进行分区?

x = 'foo test1 test1 foo test2 foo' 

我想foo分割字符串,所以我得到的线沿线的东西:

['foo', 'test1 test1 foo', 'test2 foo'] (preferred) 

       or 

[['foo'], ['test1', 'test1', 'foo'], ['test2', 'foo']] (not preferred, but workable) 

我试过itertools.groupby

In [1209]: [list(v) for _, v in itertools.groupby(x.split(), lambda k: k != 'foo')] 
Out[1209]: [['foo'], ['test1', 'test1'], ['foo'], ['test2'], ['foo']] 

但它并不完全给我我要找的东西。我知道我可以使用一个循环,这样做:

In [1210]: l = [[]] 
     ...: for v in x.split(): 
     ...:  l[-1].append(v) 
     ...:  if v == 'foo': 
     ...:   l.append([]) 
     ...:  

In [1211]: l 
Out[1211]: [['foo'], ['test1', 'test1', 'foo'], ['test2', 'foo'], []] 

但它不是非常有效的离开空列表的末尾。有一种更简单的方法吗?

我想保留分隔符。

+1

可以通过Python中的分隔符[分割字符串复制](https://stackoverflow.com/questions/3475251/split-a-string-by-a-delimiter-in-python) –

+1

如果字符串不以''foo''结尾会出现什么情况? – holdenweb

+0

@holdenweb它总是会的。 –

回答

3

也许不是最漂亮的方法,但简洁straightfoward:

[part + 'foo' for part in g.split('foo')][:-1] 

输出:

['foo', ' test1 test1 foo', ' test2 foo'] 
+0

不错。没想到这个。 –

3

您可以使用str.partition为您的情况:

def find_foo(x): 
    result = [] 
    while x: 
     before, _, x = x.partition("foo") 
     result.append(before + "foo") 
    return result 

>>> find_foo('foo test1 test1 foo test2 foo') 
>>> ['foo', ' test1 test1 foo', ' test2 foo'] 
0

试试这个

x = 'foo test1 test1 foo test2 foo' 

word = 'foo' 
out = [] 
while word in x: 
    pos = x.index(word) 
    l = len(word) 
    out.append(x[:int(pos)+l]) 
    x = x[int(pos)+l:] 

print out 

输出:

['foo', ' test1 test1 foo', ' test2 foo'] 
1

假如你想过遍历字符串,并使用您的搜索起始位置?这通常会比您随时切断弦乐更快。这可能会为你工作:

x = 'foo test1 test1 foo test2 foo' 

def findall(target, s): 
    lt =len(target) 
    ls = len(s) 
    pos = 0 
    result = [] 
    while pos < ls: 
     fpos = s.find(target, pos)+lt 
     result.append(s[pos:fpos]) 
     pos = fpos 
    return result 

print(findall("foo", x)) 
1

你可以使用背后积极(?<=)正则表达式的样子

In [515]: string = 'foo test1 test1 foo test2 foo' 

In [516]: re.split('(?<=foo)\s', string) 
Out[516]: ['foo', 'test1 test1 foo', 'test2 foo'] 

而且,

In [517]: [x.split() for x in re.split('(?<=foo)\s', string)] 
Out[517]: [['foo'], ['test1', 'test1', 'foo'], ['test2', 'foo']] 
相关问题