Comments (3)
Hi!
I'm not entirely sure what you're asking.
Are you trying to add more weight to a specific meaning of an ambiguous word (name)?
Are you trying to avoid recognition of certain words (names) altogether?
Something else?
But in general, there is no way to add more weights to any specific concept or a specific name of a concept.
With that said, the training set will have a significant impact on which concepts and/or names the model is able to effectively identify.
Then again, if you wish to limit the concepts your model is working with, you can always filter out the CUIs you don't need (CDB.filter_by_cui).
Or you could add the CUIs to a filter in the config (config.linking.filters.cuis
).
from medcat.
Thanks for your reply.
What I mean is whether I can 'modify the concept vector' of a specific word in vocabulary.
Or Can I further train an already trained model(download completed) with my additional document?
I want to transfer and adjust this model for my experiment.
from medcat.
Yes, you are more than welcome to further train and/or fine tune an existing model. The additional training data can change what the model can recognise significantly. But it all depends on the training route you're taking (whether unsupervised or supervised) as well as the specific training set.
So all in all, by using your own dataset to further train the model, you can probably achieve what you're trying to do.
But it almost certainly won't be possible with a single document.
from medcat.
Related Issues (20)
- Dependencies for 3.0 with scispacy HOT 4
- Turning on spelling and abbreviations HOT 1
- Warning: The CDB was exported by an unknown version of MedCAT. HOT 9
- Upgrade spacy dependency HOT 3
- Loosen dependency constraints HOT 1
- How to use CUI filtering
- Version 1.6.1 is not on pypi HOT 1
- new install attempt with SNOMED model throws spacy/thinc config validation error HOT 4
- remove elasticsearch dependency HOT 5
- Show nested ents results in error HOT 2
- i am running NER-L tutorial but while running this cell, model and dataset are not downloading. HOT 1
- Model resources no longer available. HOT 4
- TypeError: Snomed._refset_df2dict() takes 1 positional argument but 2 were given HOT 1
- Concept not found if token order is slightly changed contrary to mentioned note in paper HOT 3
- Install MedCAT with pytorch cpu HOT 2
- Error: [E050] when trying to load sample annotate entity project
- MedCAT model used in validation HOT 3
- Stopwords do not load properly HOT 4
- How do I download the models? HOT 3
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from medcat.