每日Paper进步屋
LLM LLM
2024-03-30 更新MATEval: A Multi-Agent Discussion Framework for Advancing Open-Ended Text EvaluationAuthors:Yu Li, Shenyu
2024-03-30
LLM LLM
2024-01-22 更新Mementos: A Comprehensive Benchmark for Multimodal Large Language Model Reasoning over Image SequencesAut
2024-01-22
LLM LLM
2024-01-19 更新R-Judge: Benchmarking Safety Risk Awareness for LLM AgentsAuthors:Tongxin Yuan, Zhiwei He, Lingzhong Dong,
2024-01-19
LLM LLM
2024-01-18 更新Augmenting Math Word Problems via Iterative Question ComposingAuthors:Haoxiong Liu, Andrew Chi-Chih Yao D
2024-01-18
LLM LLM
2024-01-17 更新AesBench: An Expert Benchmark for Multimodal Large Language Models on Image Aesthetics PerceptionAuthors:
2024-01-17
LLM LLM
2024-01-11 更新Generating Diverse and High-Quality Texts by Minimum Bayes Risk DecodingAuthors:Yuu Jinnai, Ukyo Honda, Tet
2024-01-11
LLM LLM
2024-01-05 更新Revisiting Zero-Shot Abstractive Summarization in the Era of Large Language Models from the Perspective o
2024-01-05
LLM LLM
2024-01-04 更新GOAT-Bench: Safety Insights to Large Multimodal Models through Meme-Based Social AbuseAuthors:Hongzhan Li
2024-01-04
LLM LLM
2023-12-28 更新Black-Box Tuning of Vision-Language Models with Effective Gradient ApproximationAuthors:Zixian Guo, Yuxia
2023-12-28
2 / 4