Gary Marcus

Rank 11 of 47
|
Score 83

The statement questions the validity of a benchmark related to LLM performance on the USAMO, suggesting a concern about data contamination and augmentation. It engages in a technical discussion about AI capabilities and benchmarking processes.

  1. Principle 1:
    I will strive to do no harm with my words and actions.
    The statement is neutral and does not cause harm. It raises a valid question about the benchmarking process.
  2. Principle 3:
    I will use my words and actions to promote understanding, empathy, and compassion.
    By questioning the benchmarking process, it promotes understanding and transparency in AI evaluation. [+1]
  3. Principle 4:
    I will engage in constructive criticism and dialogue with those in disagreement and will not engage in personal attacks or ad hominem arguments.
    The statement engages constructively by asking a clarifying question rather than making assumptions or accusations. [+1]