TU Wien Informatics

Role

  • TU Wien at SemEval-2024 Task 6: Unifying Model-Agnostic and Model-Aware Techniques for Hallucination Detection / Arzt, V., Azarbeik, M. M., Lasy, I., Kerl, T., & Recski, G. (2024). TU Wien at SemEval-2024 Task 6: Unifying Model-Agnostic and Model-Aware Techniques for Hallucination Detection. In A. K. Ojha, A. S. Dogruöz, H. Tayyar Madabushi, G. Da San Martino, S. Rosenthal, & A. Rosá (Eds.), Proceedings of the 18th International Workshop on Semantic Evaluation (SemEval-2024) (pp. 1183–1196). Association for Computational Linguistics. https://doi.org/10.18653/v1/2024.semeval-1.173
  • Beyond the Numbers: Transparency in Relation Extraction Benchmark Creation and Leaderboards / Arzt, V., & Hanbury, A. (2024). Beyond the Numbers: Transparency in Relation Extraction Benchmark Creation and Leaderboards. In D. Hupkes, V. Dankers, K. Batsuren, A. Kazemnejad, C. Christodoulopoulos, M. Giulianelli, & R. Cotterel (Eds.), Proceedings of the 2nd GenBench Workshop on Generalisation (Benchmarking) in NLP (pp. 120–130). Association for Computational Linguistics. https://doi.org/10.18653/v1/2024.genbench-1.8