q=%E5%A4%A7%E6%A8%A1%E5%9E%8B%E7%AE%97%E6%B3%95%EF%BC%9A%E5%BC%BA%E5%8C%96%E5%AD%A6%E4%B9%A0%E3%80%81%E5%BE%AE%E8%B0%83%E4%B8%8E%E5%AF%B9%E9%BD%90%EF%BC%9Areinforcement+learning%2C+fine%5C-tuning%2C+and+alignment&searchType=standard&isFacet=true&view=standard&rows=10&sortWay=score&sortOrder=desc&curlibcode=PY&searchWay0=marc&logical0=AND
rows=10&curlibcode=PY&searchWay0=marc&logical0=AND
大模型算法:强化学习、微调与对齐:reinforcement learning, fine-tuning, and alignment