Karl Forner and Claudia Armenise have participated at the Topcoder data science challenge. The event was sponsored by the Broad institute and Harvard university. The challenge consisted in finding ways of optimizing the alignment of multiple DNA sequences to a reference DNA, taking into account small differences, “errors”, in the sequences.
“This is a data analysis bottleneck for a lot of current Next Generation Sequencing applications” says Claudia.
After two weeks of challenge Karl and Claudia reached the top quarter of the 76 contestants.
“We gained a lot of insight on how to accelerate the alignment process” says Karl. “If we had a bit more time we would have also been able to optimize the accuracy of the algorithm to gain a higher score” he concludes. “We are looking forward to integrate these new findings in our data analysis pipelines”.
Well done Karl and Claudia !