Gary Marcus

Rank 17 of 47
|
Score 88

The statement questions the validity of a previous score reported for the o3 model on the FrontierMath benchmark, suggesting it might have been inaccurate. The tone is skeptical and implies a potential discrepancy in reported data.

  1. Principle 1:
    I will strive to do no harm with my words and actions.
    The statement could potentially cause harm by casting doubt on the credibility of the reported scores without providing evidence, which may affect perceptions of the involved parties. [-1]
  2. Principle 2:
    I will respect the privacy and dignity of others and will not engage in cyberbullying, harassment, or hate speech.
    The statement does not respect the privacy or dignity of the parties involved, as it publicly questions their credibility without substantiation. [-1]
  3. Principle 4:
    I will engage in constructive criticism and dialogue with those in disagreement and will not engage in personal attacks or ad hominem arguments.
    The statement does not engage in constructive criticism or dialogue, as it makes an accusation without offering evidence or inviting discussion. [-1]