Hello,
I'm quite excited to use uORF-Tools. It seems to run fine at the beginning but I'm getting the following error after about 8% of jobs are done:
`[Wed May 15 09:22:47 2019]
rule sizeFactors:
input: uORFs/longest_protein_coding_transcripts.gtf, maplink/RIBO-ISRIB-1.bam, maplink/RIBO-ISRIB-2.bam, maplink/RIBO-ISRIB-3.bam, maplink/RIBO-vehicle-1.bam, maplink/RIBO-vehicle-2.bam, maplink/RIBO-vehicle-3.bam
output: uORFs/sfactors_lprot.csv
jobid: 21
Activating conda environment: /Users/mo/project/.snakemake/conda/54de1460
Loading required package: methods
Loading required package: stats4
Loading required package: BiocGenerics
Loading required package: parallel
Attaching package: ‘BiocGenerics’
The following objects are masked from ‘package:parallel’:
clusterApply, clusterApplyLB, clusterCall, clusterEvalQ,
clusterExport, clusterMap, parApply, parCapply, parLapply,
parLapplyLB, parRapply, parSapply, parSapplyLB
The following objects are masked from ‘package:stats’:
The following objects are masked from ‘package:base’:
anyDuplicated, append, as.data.frame, basename, cbind, colMeans,
colnames, colSums, dirname, do.call, duplicated, eval, evalq,
Filter, Find, get, grep, grepl, intersect, is.unsorted, lapply,
lengths, Map, mapply, match, mget, order, paste, pmax, pmax.int,
pmin, pmin.int, Position, rank, rbind, Reduce, rowMeans, rownames,
rowSums, sapply, setdiff, sort, table, tapply, union, unique,
unsplit, which, which.max, which.min
Loading required package: S4Vectors
Attaching package: ‘S4Vectors’
The following object is masked from ‘package:base’:
Loading required package: IRanges
Loading required package: GenomeInfoDb
Loading required package: SummarizedExperiment
Loading required package: Biobase
Welcome to Bioconductor
Vignettes contain introductory material; view with
'browseVignettes()'. To cite Bioconductor, see
'citation("Biobase")', and for packages 'citation("pkgname")'.
Loading required package: DelayedArray
Loading required package: matrixStats
Attaching package: ‘matrixStats’
The following objects are masked from ‘package:Biobase’:
Loading required package: BiocParallel
Attaching package: ‘DelayedArray’
The following objects are masked from ‘package:matrixStats’:
colMaxs, colMins, colRanges, rowMaxs, rowMins, rowRanges
The following objects are masked from ‘package:base’:
Loading required package: Biostrings
Loading required package: XVector
Attaching package: ‘Biostrings’
The following object is masked from ‘package:DelayedArray’:
The following object is masked from ‘package:base’:
Loading required package: Rsamtools
Attaching package: ‘plyr’
The following object is masked from ‘package:XVector’:
The following object is masked from ‘package:matrixStats’:
The following object is masked from ‘package:IRanges’:
The following object is masked from ‘package:S4Vectors’:
Error in while (grepl("^#", line)) { : argument is of length zero
Calls: import.gff ... import -> import -> import -> .local -> .sniffGFFVersion
Execution halted
[Wed May 15 09:23:03 2019]
Error in rule sizeFactors:
jobid: 21
output: uORFs/sfactors_lprot.csv
conda-env: /Users/mo/project/.snakemake/conda/54de1460
shell:
mkdir -p uORFs; uORF-Tools/scripts/generate_size_factors.R -t uORF-Tools/samples.tsv -b maplink/ -a uORFs/longest_protein_coding_transcripts.gtf -s uORFs/sfactors_lprot.csv;
`
I can't exactly seem to figure out why exactly the argument is of length zero in the grep function. What am I missing? I'm working with Mus musculus GRC38.92 Ensembl genome and corresponding gtf file.
Morgane