Run Length Encoding in Python with List Comprehension

前端未结

关注

 2  1371

I have a more basic Run Length Encoding question compared to many of the questions about this topic that have already been answered. Essentially, I\'m trying to take the str

相关标签:

2条回答

天命终不由人

2020-12-19 13:08

You can use itertools.groupby():

from itertools import groupby

grouped = [list(g) for k, g in groupby(string)]

This will produce your per-letter groups as a list of lists.

You can turn that into a RLE in one step:

rle = ''.join(['{}{}'.format(k, sum(1 for _ in g)) for k, g in groupby(string)])

Each k is the letter being grouped, each g an iterator producing N times the same letter; the sum(1 for _ in g) expression counts those in the most efficient way possible.

Demo:

>>> from itertools import groupby
>>> string = 'aabccccaaa'
>>> [list(g) for k, g in groupby(string)]
[['a', 'a'], ['b'], ['c', 'c', 'c', 'c'], ['a', 'a', 'a']]
>>> ''.join(['{}{}'.format(k, sum(1 for _ in g)) for k, g in groupby(string)])
'a2b1c4a3'

0 讨论(0)

孤独总比滥情好

2020-12-19 13:32

Consider using the more_itertools.run_length tool.

Demo

import more_itertools as mit


iterable = "aabccccaaa"
list(mit.run_length.encode(iterable))
# [('a', 2), ('b', 1), ('c', 4), ('a', 3)]

Code

"".join(f"{x[0]}{x[1]}" for x in mit.run_length.encode(iterable))  # python 3.6
# 'a2b1c4a3'

"".join(x[0] + str(x[1]) for x in mit.run_length.encode(iterable))
# 'a2b1c4a3'

Alternative itertools/functional style:

"".join(map(str, it.chain.from_iterable(x for x in mit.run_length.encode(iterable))))
# 'a2b1c4a3'

Note: more_itertools is a third-party library that installable via pip install more_itertools.

0 讨论(0)