0
I have the following commands:
time grep -F -f 'in2.txt' test.fastq
time zgrep -F -f 'in2.txt' test.fastq.gz
There are about 30 search terms on files with ~5 GB. However I notice that on one computer it takes over 3-5x time to finish searching, this is on an Amazon spinup. Thus I'm wondering what is impacting the speed? Should I spin up an ECS that has more memory or better CPU speed?
2An Amazon ecs could be running on any physical hardware, right? You might not have any guarantee of what it's really using, regardless of what it reports... but anyway zgrep searches compressed files, grep doesn't, so they're very different. – Xen2050 – 2018-03-13T04:20:36.230
Xen2050, you're right about grep and zgrep being distinct in performance profile. Most notably, you should find that if you are I/O constrained, but not CPU constrained, operating on well-compressed files should help by reducing the time required to pull data from media. – Slartibartfast – 2018-03-18T16:12:40.203