DocAI
- Prepare training data
- Using this tool to convert text or pdf files to JSONL and CSV for the training.
- This article has more information on preparing data for training
custom
- How to group values of similar keys (e.g. Company/ Name/ Corporate/ Title)
- Using Word2vec