Thursday, October 29, 2009

First/Second Entromics Tests

Test variables:
2000 sequences
401 sequence size
All exact copies (name and sequences)
>gene1_hg19_chr7_87127948_87347564_gene_1_A_rs1002205
AddBuffer - true
Nicol n e n 10 90 (Max matrix: 500)
Nisa 1 1e36 -1e36
eH 20 500 100 10
----------------------------------------------------------------------------------------
10/27/09 - Scheduled 2000 sequences in PittGrid and Locally starting at 11:20pm.

10/28/09 morning -
PittGrid results - Checked PittGrid and saw that most of the jobs were off the queue, but when I checked the working directory I saw only the Nicol and Nicol Profile jobs output. No Nisa or eH. Emailed Senthil about this and checked the CMD and DAG files. Both looked good, and the input file looked good as well.
Local results - For some reason AddBuffer failed after 509 files. Everything computed (eH and eS) for the 509 but that was it. Check the AddBuffer application and made modifications (changed the char sizes, allocated more space for each sequence, etc.) was not able to fix the problem. Did some research and found out that because I was opening each file in AddBuffer (509), plus adding in stdout, stdin, stderr (509 + 3 = 512). Which made more sense. Apparently for a given process in Windows, there is a limit of 512 files it can open (even if I close the file in the application). This can be changed using _setmaxstdio(2048);

10/28/09 night - After making changes to AddBuffer. I tested both the PittGrid and Local Entromics with 2000 files. Started at 8:33pm.

10/29/09 morning - Checked results:
PittGrid results - same as yesterday morning, email Senthil again
Local results - All files finished!
eH application was incredible faster, ending at 11:14 pm.
Nisa finished at 6:50am.
The final file size = 18.9 Gb

No comments:

Post a Comment