storing long strings (DNA sequence) in R

后端未结

关注

 2  1094

刺人心 2020-12-21 02:11

I have written a function that finds the indices of subsequences in a long DNA sequence. It works when my longer DNA sequence is < about 4000 characters. However, when I

2条回答

一整个雨季 (楼主)

2020-12-21 02:51
Rather than write your own function, why not use the function words.pos in package seqinr. It seems to work even for strings up to a million base pairs.

For example,
```
library(seqinr)
data(ec999)
myseq <- paste(ec999[[1]], collapse="")
myseq <- paste(rep(myseq,100), collapse="")
words.pos("atat", myseq)
```
0 讨论(0)

查看其它2个回答
发布评论:

提交评论
- 加载中...