Discussion about this post

User's avatar
Andreas Schurch's avatar

Appreciated the way you separate taxonomy work (controlled vocab, hierarchy, synonyms) from ontology work (relationship types that let you reason across concepts). The two failure modes I see in the field line up with your scope and granularity section: (1) trying to model the whole domain up front, and (2) over-specifying relationships that no downstream workflow actually uses. Both end with an ontology that is expensive to maintain and quietly ignored.

A pragmatic pattern that holds up: start from a concrete use case and real text, build a thin set of canonical concepts with strong synonym/acronym coverage, then add relationships only when they drive a decision (routing, retrieval, analytics). And treat change management as first-class: versioning, review, and feedback from how practitioners actually write. That is what keeps the ontology a living asset, not a one-off taxonomy dump.

Rainbow Roxy's avatar

Thanks for writing this, it realy clarifies a lot. It highlights how crucial and difficult making sense of 'raw' text data remains, even with modern NLP and AI models.

No posts

Ready for more?