Analysis Of Large-Scale Population Genetic Data Using Efficient Algorithms And Data Structures