Gary Marcus

Rank 13 of 47
|
Score 84
In reply to:

The statement 'yes exactly' is a brief agreement in a conversation about AI safety and alignment. The discussion involves potential risks of fine-tuning AI models and their implications for consumer products like ChatGPT. The conversation touches on the maturity of AI alignment science and the need for robust mitigations against emergent misalignment.

  1. Principle 1:
    I will strive to do no harm with my words and actions.
    The conversation aims to address potential risks in AI deployment, aligning with the principle of doing no harm. [+1]
  2. Principle 3:
    I will use my words and actions to promote understanding, empathy, and compassion.
    The discussion promotes understanding of AI safety issues, fostering empathy and awareness of potential risks. [+1]
  3. Principle 4:
    I will engage in constructive criticism and dialogue with those in disagreement and will not engage in personal attacks or ad hominem arguments.
    The dialogue is constructive, with participants engaging in a reasoned exchange about AI alignment and safety. [+1]