Comments (19)
I would use as the input format what is spit out by https://github.com/nf-core/fetchngs so it generally has the same columns but different headers for it. I would drop the format column. If needed, that can be figured out from the filenames.
from taxprofiler.
Sorry yup - eager is TSV 😅
from taxprofiler.
Sarek was TSV too, we're now csv
from taxprofiler.
Don't abanon me!
Back on topic:
accept: fastq, fq, fasta, fna. fa + all with .gz
from taxprofiler.
from taxprofiler.
OK actually I agree, that's what I actually based this off of. Do you have an example of a fetchngs sheet?
from taxprofiler.
I ran it once and the only samplesheet I got was filled with millions of columns which I didn't like
from taxprofiler.
Nevermind, I saw this:
--nf_core_pipeline [string] Name of supported nf-core pipeline e.g. 'rnaseq'. A samplesheet for direct use with the pipeline will be created with
the appropriate columns.
so we can customise it I guess
from taxprofiler.
Yeah, it adds a lot of columns but we can pick the ones we need. I do think it's nice, though, if the pipeline keeps all input columns. This makes it easier for users to add any kind of meta information that they would like. The minimal information, in my opinion, is:
sample,fastq_1,fastq_2
from taxprofiler.
@jfy133 I think this is csv and not tsv
from taxprofiler.
CSV seems to be the standard in nf-core pipelines. In Python it's quite easy to allow both but that's harder in nextflow I think.
from taxprofiler.
not at all, you have the splitCsv
operator: https://www.nextflow.io/docs/latest/operator.html#splitcsv
from taxprofiler.
Yes, but it cannot "sniff" if it's CSV or TSV by itself, so you either need to hard code it, look at the file extension, or let the user determine it.
from taxprofiler.
Oh I see what you mean, then yes you're right.
And as you said, csv is the standard in DSL2 nf-core pipelines.
from taxprofiler.
@maxibor and I decided to go for an explicit .fasta
column as this means fastq_1
and fastq_2
can be taken directly from fetchNGS
from taxprofiler.
should change platform to specific machine, as we need 2/4 colour chemsity info
from taxprofiler.
should change platform to specific machine, as we need 2/4 colour chemsity info
Can you provide some more context, please, why this is needed?
from taxprofiler.
@maxibor did you add a check that if you can't supply FASTA and FASTQ in the same line?
from taxprofiler.
I think this is set for now, can reopen if more issues crop up
from taxprofiler.
Related Issues (20)
- Empty files were also published from the module samtools/fastq
- kaiju2table not reporting taxon names
- Generate samplesheet for nf-core/mag HOT 4
- Logo does not match the logo in tube HOT 1
- Move/copy DB troubleshooting content from Usage/Tutorial to Usage/ or new Usage/Database-troubleshooting HOT 1
- Failed to produce kaiju_combined_reports.txt HOT 4
- Current UNTAR scheme inefficent and can cause overwriting for database sheet input HOT 1
- KrakenUniq read extraction HOT 4
- Database parameter validation for bracken doesn't work currently
- taxprofiler krona output files missing HOT 1
- Recommended procedure for profiling nanopore data HOT 1
- The run_accession and sample_name should be unique.
- Add example in documentation for multiple runs of the same sample
- No logo in multiqc report HOT 1
- Add support Metabuli.
- Merged tables not produced if not run with --run_profile_standardisation HOT 1
- Add support for the sylph profiler
- Update nonpareil to allow gzip input HOT 3
- Add a long read classifier: lemur
- Add CAT/BAT/RAT for long reads
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from taxprofiler.