Python library for parsing resumes using natural language processing and machine learning

Last update: Jul 29, 2021

Overview

CVParser

Python library for parsing resumes using natural language processing and machine learning.

Setup

Installation on Linux and Mac OS

Follow the guide here on how to clone or fork a repo
Follow the guide here on how to create virtualenv

To create a normal virtualenv (example myvenv) and activate it (see Code below).

$ virtualenv --python=python3 myvenv

$ source myvenv/bin/activate

(myvenv) $ pip install -r requirements.txt

Usage

from cvparser.parser import CVParser

CVParser.download_nlk_data()


parser = CVParser(file_path="path/to/file.[pdf|doc|docx|png|jpeg]")
parser.parse()
print(parser.json())

Re-training the Model

cd into the train folder.
Delete the folder model and the file train.json.
Copy your new training data into the train folder. The train data must be in json. This can be generated using the data annotation tool called Dataturk. The file containing the training data must be named train.json.
Then, start re-training the model by execute the python script in the train folder named manual_training.py.
Then test your new model by #usage .

Python library for parsing resumes using natural language processing and machine learning

Related tags

Overview

CVParser

Setup

Installation on Linux and Mac OS

Usage

Re-training the Model

Owner

nafiu

Explore different way to mix speech model(wav2vec2, hubert) and nlp model(BART,T5,GPT) together

Codes for coreference-aware machine reading comprehension

Creating a python chatbot that Starbucks users can text to place an order + help cut wait time of a normal coffee.

Sentiment-Analysis and EDA on the IMDB Movie Review Dataset

Multilingual text (NLP) processing toolkit

Unofficial Implementation of Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration

KakaoBrain KoGPT (Korean Generative Pre-trained Transformer)

The official repository of the ISBI 2022 KNIGHT Challenge

Random-Word-Generator - Generates meaningful words from dictionary with given no. of letters and words.

An easy-to-use Python module that helps you to extract the BERT embeddings for a large text dataset (Bengali/English) efficiently.

Persian Bert For Long-Range Sequences

Image2pcl - Enter the metaverse with 2D image to 3D projections

Neural network sequence labeling model

A multi-voice TTS system trained with an emphasis on quality

The projects lets you extract glossary words and their definitions from a given piece of text automatically using NLP techniques

fastai ulmfit - Pretraining the Language Model, Fine-Tuning and training a Classifier

Multi-Task Pre-Training for Plug-and-Play Task-Oriented Dialogue System

Text to speech converter with GUI made in Python.

Resources for "Natural Language Processing" Coursera course.

Easy-to-use CPM for Chinese text generation