How do you account for the term variation in legacy documents? You are surely only able to capture a fragment of synonyms (and term variation) in the taxonomy tool.

A combination of techniques including stemming, sentence rules (reordering the order of terms) text mining and synonym enrichment will help to overcome a significant amount of variations.