Huayu Sha

23302010032@m.fudan.edu.cn

Shanghai, Shanghai, CN

Summary

Software Engineering student at Fudan University working on NLP, trustworthy evaluation of large language models, medical benchmarks, and scientific intelligence.

Education

Software Engineering

Present

Fudan University

Skills

Research Areas

Natural Language Processing
Trustworthy LLM Evaluation
Medical NLP
Scientific Intelligence

Publications

OpenNovelty: An Open-domain Benchmark for Evaluating the Open-ended Novelty of Language Models

2026

arXiv

A benchmark for evaluating whether language models can assess the novelty of open-ended research ideas and claims.

View Publication
LLMEval-Fair: A Large-Scale Longitudinal Study on Robust and Fair Evaluation of Large Language Models

2025

arXiv

A dynamic evaluation framework for robust, contamination-resistant, and fair assessment of large language models.

View Publication
LLMEval-Med: A Real-world Clinical Benchmark for Medical LLMs with Physician Validation

2025

Findings of EMNLP 2025

A physician-validated real-world clinical benchmark for evaluating medical LLMs across diverse medical scenarios.

View Publication

Interests

Research Interests

Robust and fair evaluation of language models, Medical benchmarks with expert validation, Open-ended novelty assessment

CV