Publications


  1. (In review) Khouja et al. “LINGOLY-TOO: Disentangling Memorisation from Reasoning with Linguistic Templatisation and Orthographic Obfuscation” (2025) [Paper, Website]

  2. Hajij et al. “TopoX: A Suite of Python Packages for Machine Learning on Topological Domains.” Journal of Machine Learning Research, (2024) [Paper, Github]

  3. Khouja, Jude. “Stance Prediction and Claim Verification: An Arabic Perspective.” Proceedings of the Third Workshop on Fact Extraction and VERification (FEVER) workshop at ACL 2020, (2020). [Paper, Data]

  4. Mudunuri et al. “Knowledge and theme discovery across very large biological data sets using distributed queries: a prototype combining unstructured and structured data.” PloS ONE 8(12), (2013) [HTML]

  5. Zhai et al. “Mr. LDA: A Flexible Large Scale Topic Modeling Package using Variational Inference in MapReduce. ACM International Conference on World Wide Web, (2012). [PDF]