NIST Launches Generative AI Testing Program -- Campus Technology

Breaking News

NIST Launches Generative AI Testing Program

By Gladys Rama
05/06/24

The National Institute of Standards and Technology (NIST) is taking incremental steps toward establishing a more standardized national approach to AI safety. The government agency has announced the launch of NIST GenAI, described as an "evaluation program to support research in Generative AI technologies."

The launch comes six months after the Biden White House signed an Executive Order requiring LLM makers to implement guardrails around AI technologies that protect the privacy and security of consumer data. For instance, the order mandated the development of "standards, tools, and tests to help ensure that AI systems are safe, secure, and trustworthy," and of "standards and best practices for detecting AI-generated content and authenticating official content."

The NIST GenAI program is part of the department's effort to address those mandates.

A companion NIST program, dubbed Aria, is set to launch soon. Aria's stated goal is "to advance measurement science for safe and trustworthy AI."

In a press release Monday, the U.S. Department of Commerce, of which NIST is part, described the GenAI program as a platform to "evaluate and measure generative AI technologies."

"The NIST GenAI program will issue a series of challenge problems designed to evaluate and measure the capabilities and limitations of generative AI technologies," said the agency. "These evaluations will be used to identify strategies to promote information integrity and guide the safe and responsible use of digital content."

The first of these challenges aims to evaluate the efficacy of text-to-text (T2T) AI models -- those that generate human-like text ("generators"), as well as those that purport to detect AI-generated text ("discriminators"). Findings from the challenge will help guide the NIST's eventual recommendations to LLM makers for how to convey the provenance of content made using their AI systems. This is how NIST describes the challenge in its Overview page:

NIST GenAI T2T is an evaluation series that supports research in Generative AI Text-to-Text modality. Which generative AI models are capable of producing synthetic content that can deceive the best discriminators as well as humans? The performance of generative AI models can be measured by (a) humans and (b) discriminative AI models. To evaluate the "best" generative AI models, we need the most competent humans and discriminators. The most proficient discriminators are those that possess the highest accuracy in detecting the "best" generative AI models. Therefore, it is crucial to evaluate both generative AI models (generators) and discriminative AI models (discriminators).

The challenge is open to academics, researchers and LLM makers; those interested can read the participation guidelines here. A similar challenge to evaluate text-to-image models is set to start soon.

Besides the GenAI program launch, NIST this week released preliminary versions of four papers about the secure development and implementation of AI. These papers, which are described as "initial drafts," are as follows:

"Artificial Intelligence Risk Management Framework: Generative Artificial Intelligence Profile": Identifies 13 potential problems caused by generative AI -- including "easier access to information related to chemical, biological, radiological or nuclear weapons; a lowered barrier to entry for hacking, malware, phishing, and other cybersecurity attacks; and the production of hate speech and toxic, denigrating or stereotyping content" -- and 400 potential solutions for developers to implement against them.
"Secure Software Development Practices for Generative AI and Dual-Use Foundation Models": Focuses on risks stemming from AI training data and proposes strategies to mitigate them such as "analyzing training data for signs of poisoning, bias, homogeneity and tampering."
"Reducing Risks Posed by Synthetic Content": Provides guidelines for identifying and labeling AI-generated content, for instance through metadata or watermarks.
"A Plan for Global Engagement on AI Standards": Provides a case for, and suggestions for implementing, a worldwide framework for AI development, testing and usage.

Each draft is still subject to change based on public input. The NIST is accepting feedback for each publication until June 2. Final versions will be published "later this year," the agency said.

About the Author

Gladys Rama (@GladysRama3) is the editorial director of Converge360.

E-Mail this page

Printable Format

Featured

Report: Cloud Certifications Bring Biggest Salary Payoff

It pays to be conversant in cloud, according to a new study from Skillsoft The company's annual IT skills and salary survey report found that the top three certifications resulting in the highest payoffs salarywise are for skills in the cloud, specifically related to Amazon Web Services (AWS), Google Cloud, and Nutanix.
Ditch the DIY Approach to AI on Campus

Institutions that do not adopt AI will quickly fall behind. The question is, how can colleges and universities do this systematically, securely, cost-effectively, and efficiently?
Windows Server 2025 Release Offers Cloud, Security, and AI Capabilities

Microsoft has announced the general availability of Windows Server 2025. The release will enable organizations to deploy applications on-premises, in hybrid setups, or fully in the cloud, the company said.
AI Dominates Key Technologies and Practices in Cybersecurity and Privacy

AI governance, AI-enabled workforce expansion, and AI-supported cybersecurity training are three of the six key technologies and practices anticipated to have a significant impact on the future of cybersecurity and privacy in higher education, according to the latest Cybersecurity and Privacy edition of the Educause Horizon Report.

CAMPUS TECHNOLOGY NEWS

Email Address*Country*Select primary job title/function*

Please type the letters/numbers you see above.

NIST Launches Generative AI Testing Program

Featured

Report: Cloud Certifications Bring Biggest Salary Payoff

Ditch the DIY Approach to AI on Campus

Windows Server 2025 Release Offers Cloud, Security, and AI Capabilities

AI Dominates Key Technologies and Practices in Cybersecurity and Privacy

Portals

Artificial Intelligence

Cybersecurity

Data & Analytics

Learning Tools

Student Services

WEBCASTS

From Applicant to Alumni: Integrating the Student Lifecycle into Identity Management

Getting Identity Right: Why Flexibility Is Key to a Modern IAM Solution

An AI-Native Network Puts Location Services at the Core of the Student Experience

Upskill Your Students with the Right Mix of Training in Evolving Tech

Whitepapers

The Faculty Guide to Getting Started with Gen AI

From Complexity to Clarity: Securing Cloud Environments in Higher Education

4 Causes of Student Disengagement (& How to Overcome Them)

How to Overcome 4 Common Work Execution Challenges in Higher Education With Smartsheet