I have tried to build rather small NLP pipeline https://github.com/applied-knowledge-systems/the-pattern-platform and I struggle to improve code quality using standard python coding techniques. For example:
- Currently, all path/URLs and ports are hardcoded which will quickly become technical dept in any longer-term deployment, I created config.py with ini parser and corresponding conf dir, but I couldn't find a way to attach it to submission to gears function. May be pulling from private repo (passed via requirements.txt) will work? Need help here.
- Edges To Graph may be a good candidate to rgsync connector, again I could not figure out a way to submit my own connector without creating public repo.