Metano SkillTracer: SAST and DAST for Skills | San Francisco .

Members-Only

Recent Talks & Demos are for members only

Exclusive feed

You must be an AI Tinkerers active member to view these talks and demos.

June 25, 2026 · San Francisco

Metano SkillTracer: Agent Skill Scanner

Learn how Metano SkillTracer uses SAST and DAST to scan agent skills for malicious code before loading, mapping findings to OWASP Agentic and MITRE ATLAS.

Overview
Links
Tech stack
  • OWASP Agentic
    A critical security framework mapping the top ten vulnerabilities for autonomous, tool-wielding AI systems.
    The OWASP Agentic Security Initiative (ASI) establishes a definitive, peer-reviewed security framework specifically for autonomous AI agents that plan, use tools, and execute actions independently. Moving beyond standard LLM risks, this framework targets agent-specific vulnerabilities: including Agent Goal Hijacking (ASI01), Tool Misuse (ASI02), and Cascading Multi-Agent Failures (ASI08). It equips security engineers and system architects with concrete mitigation strategies (such as sandboxed execution environments, strict identity delegation, and human-in-the-loop guardrails) to safely deploy agentic workflows in enterprise environments.
  • MITRE ATLAS
    MITRE ATLAS: The definitive, globally accessible knowledge base cataloging adversary tactics and techniques against AI and machine learning systems.
    MITRE ATLAS (Adversarial Threat Landscape for Artificial-Intelligence Systems) is the community-driven, structured knowledge base for AI security. Modeled directly after MITRE ATT&CK, it catalogs adversary tactics, techniques, and procedures (TTPs) specifically targeting AI and machine learning systems. The framework currently details 15 tactics, 66 techniques, and 33 real-world case studies (as of its October 2025 update) to inform defense strategy. Security teams leverage ATLAS for threat modeling, red teaming, and developing threat-informed defenses against attacks like model poisoning and prompt injection. It provides the necessary common vocabulary to align security analysts and AI developers, ensuring comprehensive coverage across the entire AI lifecycle.
  • LLM
    Large Language Models (LLMs) are deep learning models, built on the Transformer architecture, that process and generate human-quality text and code at scale.
    LLMs are a class of foundation models: massive, pre-trained neural networks (often with billions to trillions of parameters) that leverage the self-attention mechanism of the Transformer architecture (introduced in 2017) to predict the next token in a sequence. Trained on vast datasets (e.g., Common Crawl's 50 billion+ web pages), these models—like GPT-4, Gemini, and Claude—acquire predictive power over syntax and semantics. They function as general-purpose sequence models, enabling critical applications such as complex content generation, language translation, and automated code completion (e.g., GitHub Copilot). Their core value: generalizing across diverse tasks with minimal task-specific fine-tuning.
  • Sandbox
    A secure, isolated desktop environment to run untrusted software without risking host system integrity.
    Sandboxing isolates active processes and untrusted code within a restricted virtual boundary, preventing malicious software from accessing host system files or local network resources. In enterprise security, tools like Windows Sandbox spin up lightweight, disposable virtual machines using hardware-based virtualization (Hyper-V technology) to execute suspicious files or test new applications. Once the user closes the session, the entire environment is permanently discarded: all generated files, registry changes, and temporary data are wiped in milliseconds, leaving zero footprint on the physical machine.
  • Static Analysis
    Static analysis inspects source code without execution to identify security vulnerabilities, logic flaws, and compliance violations early in the SDLC.
    Static Analysis (SAST) operates as a high-speed debugger for uncompiled code. By scanning the codebase against predefined rule sets (like OWASP Top 10 or CWE), tools such as SonarQube and Snyk identify critical risks like SQL injection or buffer overflows before a single line of code runs. This shift-left approach reduces remediation costs by up to 100x compared to post-release fixes. It ensures consistent code quality across large engineering teams by automating peer reviews and enforcing strict syntax standards.