Skip to content

hasanatkazmi/insight-data-engr

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Insight data engineering challenge

Details

Details about the problem can be found at:
https://github.com/InsightDataScience/cc-example
Python script (insight.py) has comments detailing asymptotic bounds on each function. Specially, blist's sortedlist was used that has O(logn) insert and access times for sorted list.

Benchmark

Machine:16GB memory with 8 cores and SSD
Total input files: 4929
Total combined lines in the files: 2325714
Total time taken: 3m53.597s
Total size: 134 M

About

Submission for Insight Data Engineering Challenge

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published