发表新帖

发表新帖

How do I remove duplicate characters and keep the unique one only in Perl?

前端未结

关注

 11  759

隐瞒了意图╮ 2020-12-05 16:08

How do I remove duplicate characters and keep the unique one only. For example, my input is:

EFUAHUU
UUUEUUUUH
UJUJHHACDEFUCU

Expected out

11条回答

心在旅途 (楼主)

2020-12-05 16:49
From the shell, this works:
```
sed -e 's/$// ; s/./&\n/g' test.txt | uniq | sed -e :a -e '$!N; s/\n//; ta ; s//\n/g'
```
In words: mark every linebreak with a string, then put every character on a line of its own, then use uniq to remove duplicate lines, then strip out all the linebreaks, then put back linebreaks instead of the markers.

I found the -e :a -e '$!N; s/\n//; ta part in a forum post and I don't understand the seperate -e :a part, or the $!N part, so if anyone can explain those, I'd be grateful.

Hmm, that one does only consecutive duplicates; to eliminate all duplicates you could do this:
```
cat test.txt | while read line ; do echo $line | sed -e 's/./&\n/g' | sort | uniq | sed -e :a -e '$!N; s/\n//; ta' ; done
```
That puts the characters in each line in alphabetical order though.
0 讨论(0)

查看其它11个回答
发布评论:

提交评论
- 加载中...

热议问题