This tool transforms your natural language text into structured API queries for searching NOMAD. We want to test and evaluate the models capablities and find its shortcomings to improve the model and the search functionality.
The LLM generates two options for you to evaluate. Select which query performed better (1 or 2), or indicate if both performed equally well (Tie) or if neither were satisfactory (Both are bad).
The LLM generates two options for you to evaluate. Select which query performed better (1 or 2), or indicate if both performed equally well (Tie) or if neither were satisfactory (Both are bad). Note: The two options may be the same sometimes.
You can also edit either generated query to demonstrate the correct query or provide free-form feedback in the notes section.