Cognium publishes original research on AI agent security. Advisories, benchmarks, quarterly reports, and methodology documentation.
Monthly category-level disclosures on AI agent vulnerabilities. Numbered CA-YYYY-NNN format with permanent URLs.
Reproducible vulnerability detection benchmarks. We measured 42.5% (SAST-only) and 81.7% (SAST+LLM) on CWE-Bench-Java.
Quarterly reports on the AI agent ecosystem. Q1: Skills. Q2: Agents. Q3: OSS. Q4: Supply Chain.
How we scan, score, and verify. Dataset documentation, evaluation harness, and limitations. arXiv preprint coming.
Reproducible benchmark summaries, demo Spaces, and evaluation metadata are published under CogniumHQ on Hugging Face.
CWE-Bench-Java, 120 projects. Cognium SAST alone. CodeQL on same dataset: 22.5%. Reproduce it yourself.
Same dataset. SAST + LLM verification layer. 3.6x improvement over CodeQL baseline. Repo ships Week 3.