- Aim
- Business Values
- Process
- Data Collection
- Data Preprocessing
- Optical Character Recognition
- Natural Language Processing
- Recommendations
- Result
- Conclusion
- Next Step
Assist recruiters to find talents Jobmigo brings the job world to you
- Simple to use: Drag and Drop
- Save time
- Tailor-made dashboard
For job seeker:
Firstly, Input the CV and then it will be processed by OCR. After that, the CV and job posts will undergo NLP process. And then both CV and job post will be compared by varies methods to find the similarity. Lastly, the system will list out recomendation of jobs.
For recruiter:
Firstly, Input the job post and it will undergo NLP process along with CV in database. And then both CV and job post will be compared by varies methods to find the similarity. Lastly, the system will list out recomendation of candidates.
Web Scraping Scraping useful information from websites, including Title, Company, Location, Salary and Post Date etc. Extract useful content from scraped data. E.g. HK$ 11,000 - HK$ 25, 000/month to 11000, 25000.
- Job:
- 20,137 jobs
- Across 5 main industries (including main industry such as finance, banking, logistics, and IT)
- Method: Web scraping from Jobsdb
- CV:
- 54 CVs
- From friends & google search
1.Word tokenization:
3.Lemmatization
Converting a word to its base form.
"motivated" to "motivate", "Learning" to "Learn".
4.Bigram Collection Finder
Finding meaning double words.
CountVectorizer
- It converts a set of strings into a sparse matrix - One hot encode the text document
TF-IDF (Term Frequency-Inverse Document Frequency)
- Term Frequency, TF: The number of times a term occurs in a document
- Inverse Document Frequency, IDF: It measures how important a term is
(Please note that for the word "earth", the IDF should be log(2/1)=0.3)
KNN
Streamlit Demo
edited.mp4
* Engage with job posting companies to gain access to database / API
* Invite applicants to join the CV database
* More features on the dashboard