Comments (16)
Hi, Robin.
Can your provide the full error message you are receiving when trying to run cbBuild for these datasets?
from cellbrowser.
Thanks matthew.
Here is it.
root@bioinformatics:/home/CellBrowser/OPC/otx169to176integrated# cbBuild -i cellbrowser.conf -o /home/CellBrowser
INFO:root:Determining if /home/CellBrowser/OPC/otx169to176integrated/exprMatrix.tsv.gz needs to be created
INFO:root:input matrix has input file size that is different from previously processed matrix. Expression matrix must be reindexed. Old file(s): {'fname': '/home/anaconda/opc211101/otx169to176integrated/exprMatrix.tsv.gz', 'md5': '970a1f0448', 'size': 177807570, 'mtime': '2022-03-08 03:07:17'}, current file: 120814330
INFO:root:/home/CellBrowser/OPC/otx169to176integrated/meta.tsv has the same md5 as in /home/CellBrowser/OPC/otx169to176integrated/dataset.json, no need to rebuild meta data
INFO:root:Reading sample names from /home/CellBrowser/OPC/otx169to176integrated/meta.tsv
INFO:root:Checking and reordering meta data to /home/CellBrowser/OPC/otx169to176integrated/meta.tsv
INFO:root:Reading sample names from /home/CellBrowser/OPC/otx169to176integrated/meta.tsv
INFO:root:Reading headers from file /home/CellBrowser/OPC/otx169to176integrated/exprMatrix.tsv.gz
INFO:root:Data contains 14244 samples/cells
INFO:root:Converting to numbers and compressing meta data fields
INFO:root:Field Cell: type uniqueString, 14244 different values
INFO:root:Field origident: type enum, 4 different values
INFO:root:Field nCount_RNA: type int, 3749 different values
INFO:root:Field nFeature_RNA: type int, 2422 different values
INFO:root:Field percentmt: type float, 7721 different values
INFO:root:Field percentribo: type float, 11120 different values
INFO:root:Field predictedsubclassscore: type float, 13235 different values
INFO:root:Field predictedsubclass: type enum, 11 different values
INFO:root:Field SScore: type float, 14244 different values
INFO:root:Field G2MScore: type float, 14244 different values
INFO:root:Field Phase: type enum, 3 different values
INFO:root:Field seurat_clusters: type enum, 11 different values
INFO:root:Field CellType: type enum, 5 different values
INFO:root:Field integrated_snn_res01: type enum, 5 different values
INFO:root:Field integrated_snn_res02: type enum, 9 different values
INFO:root:Field integrated_snn_res03: type enum, 9 different values
INFO:root:Field integrated_snn_res04: type enum, 11 different values
INFO:root:Field integrated_snn_res05: type enum, 11 different values
INFO:root:Field Cluster: type enum, 11 different values
INFO:root:Indexing meta file /home/CellBrowser/OPC/otx169to176integrated/meta.tsv to /home/CellBrowser/OPC/otx169to176integrated/meta.index
INFO:root:Kept 14244 cells present in both meta data file and expression matrix
INFO:root:Auto-detecting number type of /home/CellBrowser/OPC/otx169to176integrated/exprMatrix.tsv.gz
INFO:root:Auto-detect: Numbers in matrix are of type 'float'
INFO:root:Auto-detected gene IDs type: symbols
INFO:root:Copying/compressing /home/CellBrowser/OPC/otx169to176integrated/exprMatrix.tsv.gz to /home/CellBrowser/OPC/otx169to176integrated/exprMatrix.tsv.gz
cp: '/home/CellBrowser/OPC/otx169to176integrated/exprMatrix.tsv.gz' and '/home/CellBrowser/OPC/otx169to176integrated/exprMatrix.tsv.gz' are the same file
ERROR:root:Could not run: cp "/home/CellBrowser/OPC/otx169to176integrated/exprMatrix.tsv.gz" "/home/CellBrowser/OPC/otx169to176integrated/exprMatrix.tsv.gz"
ERROR:root:Unexpected error: (<class 'SystemExit'>, SystemExit(1), <traceback object at 0x7fae7e6d3340>)
Traceback (most recent call last):
File "/usr/local/lib/python3.8/dist-packages/cellbrowser/cellbrowser.py", line 4783, in cbBuildCli
build(confFnames, outDir, port, redo=options.redo)
File "/usr/local/lib/python3.8/dist-packages/cellbrowser/cellbrowser.py", line 4598, in build
convertDataset(inDir, inConf, outConf, datasetDir, redo)
File "/usr/local/lib/python3.8/dist-packages/cellbrowser/cellbrowser.py", line 3955, in convertDataset
convertExprMatrix(inConf, outMatrixFname, outConf, sampleNames, geneToSym, datasetDir, needFilterMatrix)
File "/usr/local/lib/python3.8/dist-packages/cellbrowser/cellbrowser.py", line 3285, in convertExprMatrix
matType = copyMatrixTrim(matrixFname, outMatrixFname, metaSampleNames, needFilterMatrix, geneToSym, outConf, matType)
File "/usr/local/lib/python3.8/dist-packages/cellbrowser/cellbrowser.py", line 2525, in copyMatrixTrim
ret = runCommand(cmd)
File "/usr/local/lib/python3.8/dist-packages/cellbrowser/cellbrowser.py", line 2459, in runCommand
errAbort("Could not run: %s" % cmd)
File "/usr/local/lib/python3.8/dist-packages/cellbrowser/cellbrowser.py", line 168, in errAbort
sys.exit(1)
SystemExit: 1
root@bioinformatics:/home/CellBrowser/OPC/otx169to176integrated#
last time I run cbBuild in "/home/anaconda/opc211101/otx169to176integrated/" successfully.
After I re-set the dataRoot and group my dataset. I copy the old cellbrowser.conf to the subdirectory and try to rebuild everything.
I found the inMatrix md5 is different from outMatrix in the dataset.json
{
"fileVersions": {
"inMeta": {
"fname": "/home/anaconda/opc211101/otx169to176integrated/meta.tsv",
"md5": "5d5e981856",
"size": 2189471,
"mtime": "2022-03-08 04:19:11"
},
"outMeta": {
"fname": "/home/UCSCcellbrowser/otx169to176integrated/meta.tsv",
"md5": "5d5e981856",
"size": 2189471,
"mtime": "2022-03-08 04:26:16"
},
"inMatrix": {
"fname": "/home/anaconda/opc211101/otx169to176integrated/exprMatrix.tsv.gz",
"md5": "970a1f0448",
"size": 177807570,
"mtime": "2022-03-08 03:07:17"
},
"outMatrix": {
"fname": "/home/UCSCcellbrowser/otx169to176integrated/exprMatrix.tsv.gz",
"md5": "0c55cac114",
"size": 120814330,
"mtime": "2022-03-08 04:29:46"
},
"conf": {
"fname": "/home/anaconda/opc211101/otx169to176integrated/cellbrowser.conf",
"md5": "85c7479abb",
"size": 1170,
"mtime": "2022-03-08 04:25:14"
}
},
"sampleCount": 14244,
"matrixWasFiltered": true,
"metaFields": [
{
"name": "Cell",
"label": "Cell",
"type": "uniqueString",
"maxSize": 20,
"diffValCount": 14244,
"md5": "770c0e2419"
},
from cellbrowser.
Robin, can you share more details about how you installed the cellBrowser package (i.e. pip, conda)? And maybe what operating system you're running on (i.e. Windows, Mac OSX, or Linux)?
@maximilianh Do you have ideas? I've never seen this error before:
ERROR:root:Could not run: cp "/home/CellBrowser/OPC/otx169to176integrated/exprMatrix.tsv.gz"
from cellbrowser.
Hi Matthew,
I installed it by pip two years ago and upgrade it recently.
I have an ubuntu and host an ucsc cellbrowser well for about 20 datasets (20 directories) without hierarchy at /home/UCSCcellbrowser/.
Now I have to organize those datasets with hierarchy.
So I configured my cellbrowser with a dataroot "/home/CellBrowser/", made several directories there as collections, made "cellbrowser.conf" there too. Then I copy the old 20 directories to these collections. I try to rebuilt each dataset.
would you mind to suggest what is the best way to re-set the cellbrowser with hierarchy?
Thanks
Robin
from cellbrowser.
Hmm, that's odd that the hierarchy stuff didn't work for you.
Just to be sure, you've removed the 'dataRoot' line from the .cellbrowser.conf file in your home directory?
from cellbrowser.
I did not have a .cellbrowser.conf before.
Just made one with line of "dataRoot=/home/CellBrowser/".
from cellbrowser.
Hi Matthew,
How do you add a dataset to a collection with hierarchy?
If I run cbBuild in a directory not under dataRoot, it just de-activated the hierarchy.
If I move the output folder of cbSeurat under a collection of dataRoot and run cbBuild there, I got the same errors as showed.
Thanks
Robin
from cellbrowser.
Could you try setting up a .cellbrowser.conf (note the '.' at the beginning of the file name) in your home directory with the dataRoot line to see if that helps?
from cellbrowser.
from cellbrowser.
I had ".cellbrowser.conf" with dataRoot line. For redundancy, I also set CBDATAROOT=/home/CellBrowser/.
my error came from here
2515 shutil.copyfile(inFname, outFname)
when I move the default outputs of cbSeurat to a subdirector of dataRoot and run cbBuild there:
outDir == inDir, inFname == outFname and errors happened.
So I renamed file 'exprMatrix.tsv.gz' to 'oldMatrix.tsv.gz' in the output of cbSeurat and run cbBuild again, it works.
from cellbrowser.
from cellbrowser.
I managed an internal bioinformatic server and used it to host the cell browser for my colleagues of 30-40 people. Several colleagues might add some datasets sometime independently. I did not make a specific htmlDir or dataRoot.
I just simply made one directory and have httpd visit that directory.
So my case could be special and I thought I could set outDir as the dataRoot. I have not realized that hierarchy need a specific "dataRoot" other than outDir for 'cbBuild -o'.
I have thought hierarchy only need a tree of outputs of cbBuild, not related to where and how you run cbBuild.
I figured out that I just need to put the input files needed for cbBuild in other places, put their paths in the cellbrowser.conf under a tree of directories, run cbBuild under each subdirectory.
I also modified the codes and make it works when outDir == inDir :), not re-write the exprMatrix, which make it easy to rebuild the tree of outputs anytime.
Thank you very much for your great of work of build the cellbrowser!
from cellbrowser.
Hi, @yesonse. Can we close this ticket? Or are you still running into issues?
from cellbrowser.
I am fine now.
from cellbrowser.
from cellbrowser.
from cellbrowser.
Related Issues (20)
- mtx nomenclature in conf file HOT 2
- Question - Linking cellBrowser through another web-app with dynamic data selection HOT 9
- gene expression and proteins together HOT 4
- stop() instead of error() in the cellbrowser.R script? HOT 1
- Special characters in the column names HOT 1
- os.getLogin() function breaks seurat.py HOT 2
- Question/Issue: Index not showing HOT 28
- Split view by dataset HOT 1
- abstract.html do not working HOT 8
- cache problem of google chrome browser or server httpd setting? HOT 3
- Custom colors for each attribute HOT 8
- Cannot find dataset when refreshing HOT 7
- Cannot read single sample name from the matrix HOT 3
- Spatial data in cell browser HOT 9
- [Feature Request] Find Cells by named selection HOT 6
- --useMtx is ignored if too.big == TRUE in runSeurat.R HOT 6
- On killing cbScanpyImport due to Out of memory, process exits with 0 exit code HOT 1
- `scanpy` docs bug still not resolved HOT 5
- Issue w/ running cbSeurat: argument "counts" is missing, with no default HOT 17
- Error in `[.data.frame`(feature.names, , gene.column) : undefined columns selected HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from cellbrowser.