Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Memory Issue - Running 'peaks' tool #157

Open
raflynn opened this issue Aug 15, 2017 · 0 comments
Open

Memory Issue - Running 'peaks' tool #157

raflynn opened this issue Aug 15, 2017 · 0 comments

Comments

@raflynn
Copy link

raflynn commented Aug 15, 2017

Hi all,
For some input BED files the peaks tool works nicely. For others (all formatted the same way) I get the following error. I am running this with 16 CPUs and 60GB of available, which seemed like it would be sufficient for the possible memory. Additionally, it takes ~17-20 hours to get all the way to the point where it throws this error. Any thoughts or help would be appreciated.

Executing the following command: iCount peaks gencode.v21.annotation.segment.gtf input.bed iCountPeaks.bed --scores iCountPeaks_scores.tsv
Input parameters for function 'run' in iCount.analysis.peaks
    annotation: gencode.v21.annotation.segment.gtf
    sites: input.bed
    peaks: iCountPeaks.bed
    scores: iCountPeaks_scores.tsv
    features: None
    group_by: gene_id
    merge_features: False
    half_window: 3
    fdr: 0.05
    perms: 100
    rnd_seed: 42
    report_progress: False
Loading annotation file...
60155 out of 2581788 annotation records will be used (2521633 skipped).
Loading cross-links file...
Calculating intersection between annotation and cross-link file...
Processing intersections...

Traceback (most recent call last):
  File "/share/PI/bertozzi/users/raflynn/tools/iCount/iCount/cli.py", line 436, in main
    result_object = func(**args)
  File "/share/PI/bertozzi/users/raflynn/tools/iCount/iCount/analysis/peaks.py", line 515, in run
    processed = _process_group(hits, group_size, half_window, perms)
  File "/share/PI/bertozzi/users/raflynn/tools/iCount/iCount/analysis/peaks.py", line 371, in _process_group
    random_ = get_avg_rnd_distrib(group_size, sum_scores, half_window, perms=perms)
  File "/share/PI/bertozzi/users/raflynn/tools/iCount/iCount/analysis/peaks.py", line 269, in get_avg_rnd_distrib
    rnd_ps = numpy.zeros((perms, total_hits + 1))
MemoryError

During handling of the above exception, another exception occurred:

Traceback (most recent call last):
  File "iCount", line 11, in <module>
    load_entry_point('iCount', 'console_scripts', 'iCount')()
  File "/share/PI/bertozzi/users/raflynn/tools/iCount/iCount/cli.py", line 444, in main
    exception_message = exception.args[0]
IndexError: tuple index out of range
Traceback (most recent call last):
  File "/share/PI/bertozzi/users/raflynn/tools/iCount/iCount/cli.py", line 436, in main
    result_object = func(**args)
  File "/share/PI/bertozzi/users/raflynn/tools/iCount/iCount/analysis/peaks.py", line 515, in run
    processed = _process_group(hits, group_size, half_window, perms)
  File "/share/PI/bertozzi/users/raflynn/tools/iCount/iCount/analysis/peaks.py", line 371, in _process_group
    random_ = get_avg_rnd_distrib(group_size, sum_scores, half_window, perms=perms)
  File "/share/PI/bertozzi/users/raflynn/tools/iCount/iCount/analysis/peaks.py", line 269, in get_avg_rnd_distrib
    rnd_ps = numpy.zeros((perms, total_hits + 1))
MemoryError

During handling of the above exception, another exception occurred:

Traceback (most recent call last):
  File "iCount", line 11, in <module>
    load_entry_point('iCount', 'console_scripts', 'iCount')()
  File "/share/PI/bertozzi/users/raflynn/tools/iCount/iCount/cli.py", line 444, in main
    exception_message = exception.args[0]
IndexError: tuple index out of range
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant