Home AI safety and research company Anthropic calls for proposals to evaluate advanced models

AI safety and research company Anthropic calls for proposals to evaluate advanced models

Anthropic, a company that does research into AI safety, is calling for proposals on ways to evaluate advanced learning models.

The rapid growth of AI means there are new AI providers and models all the time. Every major tech company has its own model, while there are dozens more smaller ones as well. That means that the industry has a rising benchmarking problem, making it tough to accurately evaluate how well an AI model performs.

Not only is it hard to state how effective a model is, but it’s also hard to evaluate the risks involved with AI safety. Anthropic is calling for proposals to plug this gap in AI evaluation.

“Developing high-quality, safety-relevant evaluations remains challenging, and the demand is outpacing the supply,” the company writes. “To address this, today we’re introducing a new initiative to fund evaluations developed by third-party organizations that can effectively measure advanced capabilities in AI models.”

The main focus of the proposals should be centered around: AI Safety Level assessments; advanced capability and safety metrics; and infrastructure, tools, and methods for developing evaluations. There should be details on tests that can assess an AI model’s ability to accomplish tasks ranging from cyberattacks, working on weapons of mass destruction, including nuclear weapons, and creating deepfakes or misinformation).

How to submit a proposal on AI safety evaluation

You can read more details on each of the sections that Anthropic is calling for proposals on and submit a proposal on the research company’s website. The team is reviewing submissions on a rolling basis and will follow up with select proposals to discuss next steps.

Anthropic has allocated various levels of funding to help get those research options to their next stages, as well as offering the possibility to talk directly with experts from across their in-house research and safety teams.

Featured image: Ideogram

About ReadWrite’s Editorial Process

The ReadWrite Editorial policy involves closely monitoring the tech industry for major developments, new product launches, AI breakthroughs, video game releases and other newsworthy events. Editors assign relevant stories to staff writers or freelance contributors with expertise in each particular topic area. Before publication, articles go through a rigorous round of editing for accuracy, clarity, and to ensure adherence to ReadWrite's style guidelines.

Rachael Davies
Tech Journalist

Rachael Davies has spent six years reporting on tech and entertainment, writing for publications like the Evening Standard, Huffington Post, Dazed, and more. From niche topics like the latest gaming mods to consumer-faced guides on the latest tech, she puts her MA in Convergent Journalism to work, following avenues guided by a variety of interests. As well as writing, she also has experience in editing as the UK Editor of The Mary Sue , as well as speaking on the important of SEO in journalism at the Student Press Association National Conference. You can find her full portfolio over on…

Get the biggest tech headlines of the day delivered to your inbox

    By signing up, you agree to our Terms and Privacy Policy. Unsubscribe anytime.

    Tech News

    Explore the latest in tech with our Tech News. We cut through the noise for concise, relevant updates, keeping you informed about the rapidly evolving tech landscape with curated content that separates signal from noise.

    In-Depth Tech Stories

    Explore tech impact in In-Depth Stories. Narrative data journalism offers comprehensive analyses, revealing stories behind data. Understand industry trends for a deeper perspective on tech's intricate relationships with society.

    Expert Reviews

    Empower decisions with Expert Reviews, merging industry expertise and insightful analysis. Delve into tech intricacies, get the best deals, and stay ahead with our trustworthy guide to navigating the ever-changing tech market.