Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
login
thurn
23 days ago
|
parent
|
context
|
favorite
| on:
MTG Bench: Testing how well LLMs can play Magic
To clarify, the more accurate description would be "Testing how well LLMs can follow the rules of Magic", right? There is no actual evaluation of how "well" they are playing?
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search: