Comments (7)
This might be related to #6.
from universalmotif.
Also, in case this is relevant to a fix, the custom alphabet spec is defined here.
from universalmotif.
Okay. So there is no intend on running run_meme with a custom alphabet?
I tried writing the sequences to a fasta file and running the whole thing through a system command with the intention of using read_meme to then read the results, however meme-chip (version 5.4.1) doesn't produce an "END ALPHABET" line in the meme output, so I end up having to add it myself in order to read it. So in short - I can neither run meme with a custom alphaber, nor read one after it's ran if I don't myself edit the meme file. Is this desired?
(a snippet from the meme output file)
********************************************************************************
ALPHABET "MethDNA" DNA-LIKE
********************************************************************************
A "Adenine" 660000 ~ T "Thymine" 000066
C "Cytosine" 666600 ~ G "Guanine" 006600
M "mC:Guanine" 006666 ~ m "Methylcytosine" 660066
N = ACGMTm
********************************************************************************
********************************************************************************
TRAINING SET
********************************************************************************
PRIMARY SEQUENCES= output/seqs-centered
CONTROL SEQUENCES= --none--
Sequence name Weight Length Sequence name Weight Length
------------- ------ ------ ------------- ------ ------
1 1.0000 11 2 1.0000 11
3 1.0000 11 4 1.0000 11
5 1.0000 11 6 1.0000 11
7 1.0000 11 8 1.0000 11
9 1.0000 11
********************************************************************************
Still using the custom alphabet from the example I mentioned
from universalmotif.
So there is no intend on running run_meme with a custom alphabet?
The intent is to support custom alphabets. Not fully supporting them was a stopgap in that PR to at least broaden the support to DNA/etc.-like alphabets while waiting on a full custom alphabet support, which Ben later implemented. Clearly there's still some kinks to work out.
The END ALPHABET
chunk does exist in some types of MEME suite output file (see: dreme.txt for an example), but the fact that it is not always included is frustrating. I'm working on a quick fix that could solve this.
I appreciate that this is frustrating, thanks for your patience.
from universalmotif.
Thank you! I will continue with the above mentioned fix until there is an official fix :)
from universalmotif.
Fix is filed for PR, I also added a better error message to indicate it's failing at parsing the custom alphabet in case this happens with other output types in the future with a hint to file an issue here. How each MEME tool writes the meme.txt
file varies with different parameters or just is different between tools, so it's hard to write a perfect parser in one go. Think we'll just have to keep catching these edge cases unless there's a documented spec somewhere.
@bjmt seems worth throwing into release as well since this is a bug? Up to you.
from universalmotif.
Sorry for taking forever to get around to this. I had no idea the MEME alphabet definition could change like that. Though admittedly I've never used custom alphabets for my own work, so I never really tested read_meme()
for such cases too thoroughly.
@snystrom Thanks for the fix, I'm testing it now and will merge into release today/tomorrow. You're right that it's a bit hacky but it doesn't change previous behaviour so I'm happy to merge it in until the next bug report/time I become inclined to fix it properly/etc.
Assuming I get this into the release branch today or tomorrow, it should show up in the Bioconductor release (3.15) sometime in the middle of the week. Thanks for reporting this!
from universalmotif.
Related Issues (18)
- shuffle_sequence truncated HOT 2
- read_meme() yields out that "alphabet type cannot be detected" although "ALPHABET= ACGT" is provided HOT 1
- importing pwd by read_cisbp HOT 3
- Cannot get scan_sequences to report p-values HOT 11
- Improve support for metadata-based manipulations HOT 22
- new view_motifs() function plots in alphabetical order instead of list input order HOT 2
- error in install HOT 15
- update_motif on motifs with 1 extrainfo column gives misnamed results
- Error with read_meme HOT 5
- Option to deprotect `motif` column in universalmotif_df HOT 3
- validObject_universalmotif(motifs) fails when strand is "*" HOT 2
- R session aborted / fatal error when running read_homer from a list of motifs HOT 5
- How to merge motifs with variable length of gap HOT 7
- Error in convert_motifs for TFBStools-PFMatrixList HOT 10
- `create_motif` makes incorrect motif for amino acid sequences HOT 1
- read_meme fails when alphabet is DNA/RNA/AA-LIKE or custom
- altname is dropped during summarise_motif
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from universalmotif.