I've been doing this for one of the major companies in the space for a few years...

merksittich · 2025-07-23T17:13:55 1753290835

Thank you, I'll bite. If within your code of conduct:

- Are you providing reasoning traces, responses or both?

- Are you evaluating reasoning traces, responses or both?

- Has your work shifted towards multi-turn or long horizon tasks?

- If you also work with chat logs of actual users, do you think that they are properly anonymized? Or do you believe that you could de-anonymize them without major efforts?

- Do you have contact to other evaluators?

- How do you (and your colleagues) feel about the work (e.g., moral qualms because "training your replacement" or proud because furthering civilization, or it's just about the money...)?

dbmikus · 2025-07-23T19:44:05 1753299845

What kinds of data are you working on? Coding? Something else?

I've been curious how much these AI models look for more niche coding language expertise, and what other knowledge frontiers they're focusing on (like law, medical, finance, etc.)

mNovak · 2025-07-23T19:06:19 1753297579

Curious how one gets involved in this, and what fields they're seeking?

kristianp · 2025-07-24T00:10:57 1753315857

They do advertise. E.g. outlier.ai : https://app.outlier.ai/en/expert/opportunities?location=All&...