Enterprise AI Safety, Without the Hype - Trends in AI: October 2025

Dinos Papakostas
Oct 13
4 min read

For enterprises and organizations deploying AI applications in production, AI Safety is no longer optional. And by AI Safety we don't mean some abstract "AI risk", but the practical policies, controls, and engineering that keep systems secure, compliant, and trustworthy.

In this post, we recap our latest Trends in AI webinar, highlight the main concerns around AI enterprise deployments, and outline pragmatic steps to safely and securely bring AI apps to production.

*** Join our Luma community to stay up to date with our upcoming events! ***

Adoption is real; trust is the bottleneck

The latest State of AI report shows enterprise adoption and retention of AI is high, with pilots turning into large-scale deployments and budgets following suit. What slows production isn't model quality so much as trust: compliance, security, and privacy consistently top the reasons executives hesitate to green-light use cases, according to recent executive reports from McKinsey and Strategy.

AI adoption and spending rise with increased retention in enterprises. Graphs show trends, highlighting Ramp's projected growth to $1M by 2026. — *Figure from the* *State of AI Report 2025*.

Meanwhile, Shadow AI is growing. Surveys suggest roughly nine in ten employees now use LLMs for work, while fewer than half of organizations hold enterprise subscriptions. That gap means personal accounts, personal devices, and public networks are touching company data. Risks range from leaks and unvetted coding suggestions to incorrect answers based on public rather than internal knowledge. It's a productivity surge with a governance hole.

Bar chart titled "Exhibit: shadow AI economy". Light blue shows 40% company subscriptions, dark blue shows 90% employee usage of LLMs. — *Figure from the* *State of AI in Business 2025*.

Turning policy into practice with ISO 42001

Our guest, Maarten Stolk of Deeploy, walked us through ISO 42001 - the emerging standard for AI management systems (AIMS) - as a practical blueprint. Start small but real: define roles, minimal policies, and a lifecycle you can actually run. Map those principles to concrete controls per use case and phase (from ideation to monitoring). Build an AI registry that covers both built and procured systems; avoid static spreadsheets and connect it to code, inference, and logs where possible. Document artifacts with model and data cards, and capture required assessments (e.g., DPIA/FRIA) with accountable sign-offs.

From there, instrumentation matters. Trace inputs, outputs, versions, and performance so you can answer "what changed?" and act fast when drift or degradation appears. Keep humans in the loop with oversight and explainability that makes outputs inspectable and correctable. The point isn't paperwork, but rather integrating governance with your MLOps to scale AI without flying blind.

Regulation without the whiplash

Rules are diverging by region, but the underlying themes rhyme. On the research front, Credo AI proposed the Unified Control Framework to translate different jurisdictions into a shared map of risks and controls, which is useful when you operate across borders.

Flowchart of policy requirements and risk taxonomy mapping to a unified control library, featuring a focus on AI system access controls. — *Figure from Eisenberg et al. (2025)*

California's new SB 53 is notably more pragmatic than last year's SB 1047. It emphasizes disclosure and risk reporting for the largest frontier developers (above a high compute threshold), drops ideas like mandated kill switches, and keeps penalties meaningful but targeted. Contrast that with the EU AI Act's broader scope and tougher fines. However, the US patchwork is anticipated to keep evolving at the state and even city level, while Europe's single framework is easier to plan against, even if it's not universally loved.

Comparison chart of California SB 53 and EU AI Act features like scope, thresholds, transparency, penalties, and enforcement.

What safe-by-design looks like

On the engineering side, a few foundations keep coming up. Identity and isolation come first: multi-tenancy, Single Sign-On (SSO), and Role-Based Access Control (RBAC) must propagate into search, RAG, and agents so users only see what they're entitled to see. Logging and auditability across models, prompts, tools, and retrieval enable explainability and safe rollbacks, while encryption at rest and in transit is table stakes.

Model strategy is a trade-off; on-premise open weights give maximum control and minimal red tape, but require serious infrastructure and MLOps, and they still trail the very top models on capability. AI-infra-as-a-service providers (e.g., Together AI, Groq) buy flexibility and cost control, but add vendor diligence in the deployment requirements. Direct access to frontier labs offers the best ergonomics and performance, but for sensitive data you'll need to negotiate residency, retention, and contractual protections.

Comparison table of AI models: Performance, Ease of Use, and Red Tape. Emojis show varying satisfaction levels: sad, shrug, 100, crying, thumbs up, okay, grimacing, thinking.

The new threat surface

As agents gain tools and autonomy, the attack surface expands. A widely discussed browser-agent demo showed how a single malicious instruction could coax an agent into posting a sensitive token - a classic example of prompt injection that happens when web content is not treated as untrusted input.

In research, the Silent Saboteur shows how a single adversarial document can reliably manipulate the answers of a black-box RAG system, while other work, such as "Your Agent May Misevolve" and "Breaking the Code", finds that memory and tool use (i.e., context overloading) can quietly undo alignment, producing harmful responses or executing erroneous actions despite safeguards.

Flowchart of an adversarial example generation process using RL with surrogate retriever and LLM. Documents ranked, rewards assessed. — *Figure from Song et al. (2025)*

Diagram showing robot agent's self-evolution issues, with misuse scenarios and undesirable outcomes like refund errors, data leaks, and privacy risks. — *Figure from Shao et al. (2025)*

Diagram showing three attacker levels (naive, capable, expert) and Python code prompts interacting with code agents. Includes a judge framework. — *Figure from Saha et al. (2025)*

On the bright side, defenses are evolving too. Google DeepMind previewed CodeMender, a system that proactively scans open-source repositories and proposes fixes with humans in the loop. And while guardrails help - with major cloud providers now offering hosted options, and open source models like NVIDIA's NeMo GuardRails and Meta's Llama Guard serving as a solid foundation - they aren't a silver bullet. As No Free Lunch with Guardrails shows, coverage is imperfect and LLM-in-the-loop filters add latency. Treat guardrails as defense-in-depth, not as the whole plan.

Table comparing AI models: Open-Source, Provider, LLM-Based. Columns: PHTest, GuardrailsAI, Arena, Awesome, NoRobots, Latency. — *Table from Kumar et al. (2025)*

Practical takeaways

In this post, we've covered the reality of adoption (and why trust is the bottleneck), the risks of shadow AI, how to operationalize governance with ISO 42001, how to navigate a shifting regulatory map, what secure-by-design engineering looks like in practice, and why the agent era expands the threat surface alongside the defenses.

At Zeta Alpha, we frequently work with sensitive, proprietary R&D data for knowledge-intensive companies such as BASF, Envalior, and Festo. That's why we integrate SSO and RBAC across our Agentic RAG and Deep Research solutions, and deploy our AI-native knowledge management platform in clients' private clouds or on-premises. Do you want to explore how to safely bring AI apps to production with a clear path to value? Reach out to set up an introductory call.

For more insights and detailed coverage, watch the full webinar recording below, and join our Luma community for upcoming discussions and events.

Until next time, enjoy discovery!