Split a string with unknown number of spaces as separator in Python
s = ' 1234 Q-24 2010-11-29 563 abc a6G47er15 '
ss = s.split()
print(ss) # ['1234', 'Q-24', '2010-11-29', '563', 'abc', 'a6G47er15']
If you have single spaces amid your data (like an address in one field), here's a solution for when the delimiter has two or more spaces:
with open("textfile.txt") as f:
content = f.readlines()
for line in content:
# Get all variable-length spaces down to two. Then use two spaces as the delimiter.
while line.replace(" ", " ") != line:
line = line.replace(" ", " ")
# The strip is optional here.
data = line.strip().split(" ")
print(data)
We can also use regex's split method here too.
import re
sample = ' 1234 Q-24 2010-11-29 563 abc a6G47er15 '
word_list = re.split("\s+", sample.strip())
print(word_list) #['1234', 'Q-24', '2010-11-29', '563', 'abc', 'a6G47er15']
I hope this might help someone
If you don't pass any arguments to str.split()
, it will treat runs of whitespace as a single separator:
>>> ' 1234 Q-24 2010-11-29 563 abc a6G47er15'.split()
['1234', 'Q-24', '2010-11-29', '563', 'abc', 'a6G47er15']