How big training corpus?
Unlike most Deep Learning systems ours doesn't require millions of examples. A dozen paraphrase questions (queries) per class (answer/article) shall be enough. The system is pre-trained on generic English. You only have to add your jargon. Still, the bigger the training corpus the better. The system becomes more accurate as the training corpus grows.