This page contains press release content distributed by XPR Media. Members of the editorial and news staff of the USA TODAY Network were not involved in the creation of this content.

AIM Intelligence and BMW Group Examine Gaps in Evaluating Enterprise AI Policy Compliance

Research reveals LLMs follow allowlist policies but systematically fail to enforce organizational prohibitions, exposing a critical gap in enterprise AI safety

SF, CA, UNITED STATES, February 12, 2026 /EINPresswire.com/ — Seoul, South Korea / Munich, Germany – January 2026 – BMW Group and AIM Intelligence, a leading AI safety startup, today announced the publication of COMPASS (Company/Organization Policy Alignment Assessment), the first systematic framework for evaluating whether large language models (LLMs) comply with organization-specific policies. The research, now available on arXiv, reveals a critical gap that remains under-measured in current evaluation practices: models that pass standard safety benchmarks often fail dramatically when enforcing the nuanced, context-dependent rules that govern real-world business operations.

Why Enterprise AI Policies Break Down in Practice

As organizations across healthcare, finance, automotive, and government sectors rapidly adopt LLMs for customer-facing applications, the research team discovered a fundamental asymmetry that poses significant risks for policy-critical deployments.
Key Findings:
Strong Allowlist Compliance: Models reliably handle legitimate requests with over 95% accuracy
Critical Denylist Failures: Models fail to correctly refuse prohibited requests in up to 97% of cases
Catastrophic Adversarial Vulnerability: Under adversarial conditions, some models refuse fewer than 5% of policy-violating requests
“Most AI safety tests focus on whether a model behaves safely in general,” said Dasol Choi, AI Safety Researcher at AIM Intelligence. “COMPASS looks at a more practical question: can an AI system reliably follow the specific rules of an organization? Our findings show that, in many real-world deployments today, the answer is often no.”

Why Generic AI Safety Isn’t Enough

The research addresses a critical disconnect between how AI systems are evaluated and how they are deployed. While existing safety benchmarks focus on universal harms such as toxicity and violence, real enterprises operate under complex internal policies—compliance manuals, operational playbooks, legal edge cases, and brand-specific constraints.
COMPASS evaluates models across four dimensions that typical benchmarks ignore:
1. Policy Selection: Can the model identify which policy applies to a given situation?
2. Policy Interpretation: Can it reason through conditionals, exceptions, and vague clauses?
3. Conflict Resolution: When rules collide, does the model resolve conflicts as the organization intends?
4. Justification: Can the model ground its decisions in actual policy text?

“Our evaluation revealed a striking asymmetry,” noted DongGeon Lee, AI Safety Researcher at AIM Intelligence. “While models achieve near-perfect accuracy on what they can do, they remain structurally vulnerable in enforcing what they must not do. This gap persists across model scales and architectures, indicating that scaling alone cannot solve the problem.”

Industry-Scale Validation

The research team applied COMPASS across eight diverse industry scenarios—Automotive, Government, Financial, Healthcare, Travel, Telecom, Education, and Recruiting—generating and validating 5,920 queries that test both routine compliance and adversarial robustness. Fifteen state-of-the-art models were evaluated, including leading proprietary and open-source systems.

Making Misalignment Measurable

Perhaps the most significant contribution of COMPASS is transforming alignment from a philosophical concern into an engineering problem. The framework and benchmark datasets are publicly available on GitHub and Hugging Face, enabling organizations to evaluate their AI systems against their own policies.

About the Research Collaboration

This research represents a collaboration between AIM Intelligence, BMW Group, Yonsei University, Pohang University of Science and Technology, and Seoul National University. The full paper, “COMPASS: A Framework for Evaluating Organization-Specific Policy Alignment in LLMs,” is available at https://arxiv.org/abs/2601.01836.

About AIM Intelligence

AIM Intelligence is a Seoul-based AI safety company specializing in automated red-teaming, real-time guardrails, and AI monitoring solutions. Founded in 2024, AIM Intelligence serves major enterprises and conducts research across large language models, multimodal systems, autonomous agents, and emerging physical AI. The company has published over 15 research papers at top-tier conferences including ICML, ACL, NeurIPS, and IEEE.

Team Cookie Official
Team Cookie
email us here
Visit us on social media:
LinkedIn
Facebook

Legal Disclaimer:

EIN Presswire provides this news content “as is” without warranty of any kind. We do not accept any responsibility or liability
for the accuracy, content, images, videos, licenses, completeness, legality, or reliability of the information contained in this
article. If you have any complaints or copyright issues related to this article, kindly contact the author above.

Information contained on this page is provided by an independent third-party content provider. XPRMedia and this Site make no warranties or representations in connection therewith. If you are affiliated with this page and would like it removed please contact pressreleases@xpr.media

Selling a Business in Florida 2026: Complete Guide Released for Business Owners

Selling a Business in Florida 2026: Complete Guide Released for Business Owners

Consumers can learn about how to sell a business in Florida successfully through IRAEmpire's new and updated guide for

February 17, 2026

Steinberg Law Firm Attorneys Honored with Elite Lawyer 2026 Recognition

Steinberg Law Firm Attorneys Honored with Elite Lawyer 2026 Recognition

North Charleston, South Carolina – Eight attorneys at Steinberg Law Firm have been recognized for excellence in the

February 17, 2026

Winter-Ready Water and Aggregate Heating Options for Concrete Production

Winter-Ready Water and Aggregate Heating Options for Concrete Production

Process Water Solutions portfolio covers rapid-start industrial hot water, dry-heat aggregate warming, and

February 17, 2026

Spotter AI Launches Sentinel To Cut Hiring Costs And Improve Trucking Safety

Spotter AI Launches Sentinel To Cut Hiring Costs And Improve Trucking Safety

Spotter Sentinel brings automated hiring, compliance monitoring, scoring, and recruiting tools to the trucking industry

February 17, 2026

Next Day Access Greensboro & Winston-Salem: New Franchise Location

Next Day Access Greensboro & Winston-Salem: New Franchise Location

Next Day Access, a national leader in mobility and accessibility solutions, is honored to announce the opening of Next

February 17, 2026

Avionica Defines the Future of Airline Operations with Real-Time Aircraft Data

Avionica Defines the Future of Airline Operations with Real-Time Aircraft Data

Live aircraft sensor streaming platform connecting planes directly to cloud analytics, transforming safety,

February 17, 2026

Cut to Black Prize Recognized as the Industry’s Only Genuine Invitation-Only Screenwriting Contest

Cut to Black Prize Recognized as the Industry’s Only Genuine Invitation-Only Screenwriting Contest

The industry’s only true invitation-only screenwriting contest, Cut to Black Prize prioritizes blind judging, fairness,

February 17, 2026

Premier Media Group Launches ‘Southwest Travel & Life’

Premier Media Group Launches ‘Southwest Travel & Life’

The new magazine is the seventh brand and the second travel brand in the company’s portfolio. We are excited to bring a

February 17, 2026

Digital Neighbor Receives 2026 Workplace Award for Excellence in SEO Agency Culture

Digital Neighbor Receives 2026 Workplace Award for Excellence in SEO Agency Culture

TAMPA, FL – February 17, 2026 – PRESSADVANTAGE – Digital Neighbor, a Tampa-based search engine optimization firm, has

February 17, 2026

RestoPros of Northeast Georgia Outlines Seven-Step Recovery Process

RestoPros of Northeast Georgia Outlines Seven-Step Recovery Process

LAWRENCEVILLE, GA – February 17, 2026 – PRESSADVANTAGE – A comprehensive guide detailing the systematic approach to

February 17, 2026

Grace Point Treatment Center Introduces New Website Resource Examining the Signs of Functional Alcoholism in Relationships

Grace Point Treatment Center Introduces New Website Resource Examining the Signs of Functional Alcoholism in Relationships

FORT LAUDERDALE, FL – February 17, 2026 – PRESSADVANTAGE – A newly released educational resource from Grace Point

February 17, 2026

Carrot LASIK & Eye Center Outlines Key Lens Implant Options for Refractive Lens Exchange in New Educational Article

Carrot LASIK & Eye Center Outlines Key Lens Implant Options for Refractive Lens Exchange in New Educational Article

MESA, AZ – February 17, 2026 – PRESSADVANTAGE – Carrot LASIK & Eye Center has released a detailed overview titled

February 17, 2026

Rocket CRM Announces Expanded Marketing Automation Feature to Strengthen Structured Customer Communication Workflows

Rocket CRM Announces Expanded Marketing Automation Feature to Strengthen Structured Customer Communication Workflows

Los Angeles, California – February 17, 2026 – PRESSADVANTAGE – Rocket CRM has announced the expansion of its Marketing

February 17, 2026

Like Father Like Son Roofing Expands Roof Repair Services Throughout Monett Missouri

Like Father Like Son Roofing Expands Roof Repair Services Throughout Monett Missouri

PURDY, MO – February 17, 2026 – PRESSADVANTAGE – Like Father Like Son Roofing and Construction LLC has expanded its

February 17, 2026

Conifer Gutter Service Launches Energy Efficient Drainage Systems for Colorado Properties

Conifer Gutter Service Launches Energy Efficient Drainage Systems for Colorado Properties

Conifer, Colorado – February 17, 2026 – PRESSADVANTAGE – Conifer Gutter Service has launched a new line of energy

February 17, 2026

Webmaster Pub Introduces Comprehensive WordPress Development Services for Swiss Businesses in Winterthur

Webmaster Pub Introduces Comprehensive WordPress Development Services for Swiss Businesses in Winterthur

WINTERTHUR, CH – February 17, 2026 – PRESSADVANTAGE – Webmaster Pub, a professional web design company based in

February 17, 2026

BEYOND SLOWING DOWN: GRIECO AUTOMOTIVE GROUP WARNS OF LESSER-KNOWN WINTER DRIVING DANGERS

BEYOND SLOWING DOWN: GRIECO AUTOMOTIVE GROUP WARNS OF LESSER-KNOWN WINTER DRIVING DANGERS

JOHNSTON, RI, UNITED STATES, February 17, 2026 /EINPresswire.com/ — As winter conditions continue across Rhode Island,

February 17, 2026

Alejandro Hernandez Secures Texas Life Insurance License, Expands Integrated UHNW Advisory & Attorney Referral Platforme

Alejandro Hernandez Secures Texas Life Insurance License, Expands Integrated UHNW Advisory & Attorney Referral Platforme

Alejandro Hernandez Secures Texas Life Insurance License, Expands Integrated UHNW Advisory & Attorney Referral

February 17, 2026

Michael Martin Murphey Joins the Lone Star Cowboy Poetry Gathering Lineup in Bastrop

Michael Martin Murphey Joins the Lone Star Cowboy Poetry Gathering Lineup in Bastrop

Award-winning singer-songwriter added to “Tres Amigos” performance at Bastrop Convention & Exhibit Center It's a

February 17, 2026

Next Hour Voted Best Santa Clarita Garage Door Repair; Fleet Expands for 24/7 Near Me Service

Next Hour Voted Best Santa Clarita Garage Door Repair; Fleet Expands for 24/7 Near Me Service

Voted Best of Santa Clarita. Next Hour expands 24/7 fleet for garage door repair near me in Valencia, Saugus &

February 17, 2026

Virtue Solar Now Offers TPO Solar Financing

Virtue Solar Now Offers TPO Solar Financing

Rising electricity rates, falling solar costs, and new $0-down financing options mean the answer is still yes —

February 17, 2026

African Adventure Specialists Reports Operations Across Five East African Destinations

African Adventure Specialists Reports Operations Across Five East African Destinations

African Adventure Specialists – Travel experiences across Kenya, Tanzania, Uganda, Rwanda, and Zanzibar. the company

February 17, 2026

BPI Recognized as 3M Supplier of the Year

BPI Recognized as 3M Supplier of the Year

Contract manufacturing partner recognized for year-over-year performance and collaboration We approach our relationship

February 17, 2026

MSI² fortalece cooperación de defensa hemisférica con nombramientos estratégicos en Chile

MSI² fortalece cooperación de defensa hemisférica con nombramientos estratégicos en Chile

El Instituto incorpora a Enzo Ibaceta como Enlace para Chile y al Contraalmirante (R) Leonardo Quijarro Santibáñez como

February 17, 2026

Eccentex and Enforce Assist Announce Strategic Investment in Next-Generation Law Enforcement Records Management

Eccentex and Enforce Assist Announce Strategic Investment in Next-Generation Law Enforcement Records Management

Workflow-centric Records Management System aims to redefine usability, officer productivity, and data lifecycle

February 17, 2026

Meditative Animal Deals Royal Flush in Hearts with New Album Metaphysical Sherpa: Karmic Poker & Announces Digital Event

Meditative Animal Deals Royal Flush in Hearts with New Album Metaphysical Sherpa: Karmic Poker & Announces Digital Event

Dr. Duddha releases the official audio companion to his book Metaphysical Sherpa: Misunderstood Mystic (Karmic Poker

February 17, 2026

Ramzi Najjar Advances the Law of Alignment with Empirical Evidence on the Path to Systemic Collapse

Ramzi Najjar Advances the Law of Alignment with Empirical Evidence on the Path to Systemic Collapse

System Theorist and Creator of Post-Performance Philosophy (PPP) Releases Data-Driven Study on Structural Drift and

February 17, 2026

Rayse and OneKey MLS Partner to Bring Agent Value and Transparency to New York Metro REALTORS®

Rayse and OneKey MLS Partner to Bring Agent Value and Transparency to New York Metro REALTORS®

Rayse continues 2026 momentum with another major MLS launch — delivering agent‑centric education and engagement support

February 17, 2026

Grease Management Experts Highlight Operational and Compliance Risks as Beef Tallow Use Expands in Commercial Kitchens

Grease Management Experts Highlight Operational and Compliance Risks as Beef Tallow Use Expands in Commercial Kitchens

Beef tallow use rises in SoCal; The Grease Company ensures safe, compliant collection, disposal, and sustainable

February 17, 2026

Signature Leaders Announces Acquisition by TiER1 Impact

Signature Leaders Announces Acquisition by TiER1 Impact

Signature Leaders has been acquired by TiER1 Impact, an employee-owned professional services development company

February 17, 2026

Call HR: New Video Podcast Brings Unfiltered Conversations with HR Leaders from Google, Snowflake, Netflix, Hulu & More

Call HR: New Video Podcast Brings Unfiltered Conversations with HR Leaders from Google, Snowflake, Netflix, Hulu & More

CEO Michelle Volberg launches the first podcast dedicated to the real stories behind talent and people leadership at

February 17, 2026

Teqtivity Analysis: IT Staffing Crisis Leaves Organizations Blind to Hardware Assets, Creating Security Gaps

Teqtivity Analysis: IT Staffing Crisis Leaves Organizations Blind to Hardware Assets, Creating Security Gaps

Lean IT Teams Struggle to Track Devices as Manual Asset Management Breaks Down at Scale CERRITOS, CA, UNITED STATES,

February 17, 2026

Green Fields School Ranked #1 Private School in Tucson by Niche

Green Fields School Ranked #1 Private School in Tucson by Niche

Green Fields named #1 private school in Tucson by Niche, recognized for academic excellence, college prep, and strong

February 17, 2026

Houston Premises Liability Attorney Joe I. Zaid Expands Focus on Injuries from Unsafe Properties Across Texas

Houston Premises Liability Attorney Joe I. Zaid Expands Focus on Injuries from Unsafe Properties Across Texas

HOUSTON, TX, UNITED STATES, February 17, 2026 /EINPresswire.com/ — Houston personal injury lawyer Joe I. Zaid of Joe

February 17, 2026

io Health Expands Care Optimized™ Platform to Support Home Health Agencies with Real-Time HOPE Assessment Validation

io Health Expands Care Optimized™ Platform to Support Home Health Agencies with Real-Time HOPE Assessment Validation

Advancing Documentation Infrastructure to Ensure Compliance with Federal HOPE Assessment Mandates and Reduce Clinician

February 17, 2026

Beyond Ride and Côte Bonneville Hint at a More Accessible, Joy-Filled Future for Seniors in Tacoma

Beyond Ride and Côte Bonneville Hint at a More Accessible, Joy-Filled Future for Seniors in Tacoma

Beyond Ride and Côte Bonneville Hint at a More Accessible, Joy-Filled Future for Seniors in Tacoma TACOMA, WASHINGTON,

February 17, 2026

Statement of Attorney Jeffery M. Leving on the Death of the Rev. Jesse Jackson

Statement of Attorney Jeffery M. Leving on the Death of the Rev. Jesse Jackson

CHICAGO, IL, UNITED STATES, February 17, 2026 /EINPresswire.com/ — The following is the statement of attorney Jeffery

February 17, 2026

Meljestic Spa Offers Expert Laser Hair Removal and Facial Treatments in Cooper City, FL

Meljestic Spa Offers Expert Laser Hair Removal and Facial Treatments in Cooper City, FL

Cooper City med spa provides customized skincare treatments including laser hair removal, microneedling, and anti-aging

February 17, 2026

Serendipity Labs Costa Mesa Hosts Sunset Business Social & Open House for Orange County Professionals

Serendipity Labs Costa Mesa Hosts Sunset Business Social & Open House for Orange County Professionals

Serendipity Labs Costa Mesa welcomes local leaders for an evening of networking, tours, sponsored bites and drinks,

February 17, 2026

Congresswoman Sara Jacobs Visits GALT Aerospace

Congresswoman Sara Jacobs Visits GALT Aerospace

GALT Aerospace welcomed Congresswoman Sara Jacobs to its HQ for a visit focused on defense innovation, workforce

February 17, 2026