Sebastian Deorowicz (@sdeorowicz) 's Twitter Profile
Sebastian Deorowicz

@sdeorowicz

Data compression. Algorithms for genome sequencing compresion and analysis.

ID: 1167138749710557184

linkhttps://refresh-bio.github.io/ calendar_today29-08-2019 18:15:59

125 Tweet

360 Takipçi

31 Takip Edilen

Sebastian Deorowicz (@sdeorowicz) 's Twitter Profile Photo

Clustering large datasets can be challenging. Fortunately, even slow methods can sprint for sparse similarity matrices. Clusty offers s-, c-link, uclust, set-cover, cd-hit, leiden. The paper shows an application for 15M+ sequences. github.com/refresh-bio/cl… biorxiv.org/content/10.110…