
Wei Shen 沈 伟
@shenwei356
Associate professor of Bioinformatics at Chongqing Medical University, China. Lab: mbio.info Personal: shenwei.me shenwei356.bsky.social
ID: 2202777386
http://shenwei.me/ 19-11-2013 09:11:46
1,1K Tweet
1,1K Followers
1,1K Following



I did a project on making an optimized implementation of the S+ tree. The result is 40x speedup over plain binary search! It builds on Algorithmica's post on S-trees and the famous paper "Array layouts for comparison based searching" by Paul Khuong ([email protected]). 🧵 curiouscoding.nl/posts/static-s…


I'm glad to announce that the simd-minimizers library is out! Ragnar {Groot Koerkamp} 🦋 and I have been optimizing the computation of minimizers down to the smallest detail. The result is an order of magnitude faster than existing methods ; processing an entire human genome takes only 4s!






Thrilled that our work on this problem with Karel Břinda, Zamin Iqbal, and others is out in Nature Methods today! We used phylogenetic compression (described in the thread) to compress every microbe ever sequenced onto a flash drive so that it can be searched with a laptop!

A decade ago, we had thousands of bacterial genomes. Now, we have millions. How to scale computational methods? Our paper in Nature Methods answers this: use evolutionary history to guide compression and search. …From terabytes to tens of GBs… w/@Baym Zamin Iqbal et al. 🧵1/

