Selective Preference Optimization via Token-Level Reward Function Estimation - NaCTeM Publications

Type of publication:	Inproceedings
Citation:	yang:2025
Publication status:	Accepted
Booktitle:	Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing (EMNLP)
Year:	In Press
URL:	https://arxiv.org/abs/2408.135...
Keywords:
Authors	Yang, K Liu, Z. Xie, Q. Huang, J. Min, E. Ananiadou, S.
Added by:	[PRT]
Total mark:	0
Attachments

Notes

Topics

processing time: 0.0231 seconds.