openclaw - 💡(How to fix) Fix [Ballast] Research agents — automated PubMed/web sourcing for SKU vetting [1 comments, 1 participants]

dbental · 2026-04-22T08:42:08Z

[openclaw] Build automated research agents that search the web and PubMed, extract study data, and produce professional vetting conclusions for supplement cand… Build automated research agents that search the web and PubMed, extract study data, and produce professional vetting conclusions for supplement candidates. This automates the evidence research layer of the vetting pipeline. ## Overview Build automated research agents that search the web and PubMed, extract study data, and produce professional vetting conclusions for supplement candidates. This automates the evidence research layer of the vetting pipeline. ## Problem Manual SKU vetting (issue #69867) requires reading PubMed papers and extracting: - Mechanism of action - Study design (RCT, observational, etc.) - Effect size (Cohen's d, odds ratio, etc.) - Sample size and population - Dose used in study vs. product dose - Limitations and conflicts of interest This is time-consuming to do manually at scale. An agent pipeline can do the first-pass research, leaving the human to review conclusions rather than read raw papers. ## Proposed agent design ### Agent 1 — PubMed Scout - Input: ingredient name + health category - Searches PubMed via Entrez API (free, no key needed) - Returns: top 5–10 most relevant studies by relevance + recency - Extracts: title, authors, journal, year, abstract, PMID, DOI - Output: structured study list with links ### Agent 2 — Evidence Extractor - Input: PubMed abstract (or full text if available via PMC) - Uses LLM to extract: - Study type (RCT / meta-analysis / observational / case study) - Population (n, age, condition) - Intervention (dose, form, duration) - Primary outcome + effect size - Limitations acknowledged by authors - Output: structured evidence record matching vetting.mjs schema ### Agent 3 — Vetting Analyst - Input: evidence records from Agent 2 + product spec from supplier - Applies T1–T4 rubric + G1–G4 gates (from PRODUCT_VETTING_PROTOCOL.md) - Produces verdict: approve / hitl_required / auto_reject - Writes mechanism_one_sentence, nonclaims[], weak_evidence notes - Flags: hallucinated URLs, under-dosed products, unsupported claims - Output: vetting verdict matching runVetting() return schema ### Agent 4 — Supplier Cross-check - Input: approved ingredient name - Searches CJ Dropshipping catalog for matching products - Checks: spec sheet availability, US warehouse, dose per serving - Returns: sourcing_url, spec_sheet_url, price, moq ## Stack - PubMed: NCBI Entrez E-utilities API (free) - Web search: Tavily or Firecrawl (already in extensions/) - LLM: chatWithFallback() from lib/ollama_cloud.mjs - Output: feeds directly into vetting.mjs and initial_pass JSON ## Integration point These agents slot into the pipeline between Agent A (radar/trend discovery) and Agent B (store builder): ``` radar_candidates → Research Agents (PubMed + web) → vetting.mjs → B.5 product listing ``` ## Acceptance criteria - Agent 1 returns ≥5 relevant PubMed studies for any of the 15 initial SKU candidates - Agent 2 correctly extracts study type, n, dose, effect size from abstracts (spot-check 5 papers) - Agent 3 produces verdicts matching manual vetting for ≥80% of initial SKU pass (#69867) - All PubMed URLs verified live (HEAD fetch — no hallucinated PMIDs) - Full pipeline run on one candidate takes < 60 seconds ## Related - #69867 Initial SKU research (manual baseline to validate against) - #69866 vetting.mjs (consumes agent output) - data/scripts/PRODUCT_VETTING_PROTOCOL.md - extensions/firecrawl/, extensions/tavily/ (existing search tools)

openclaw2026-04-22 08:42:08

ON THIS PAGE

Recommended Tools

×6

Utilities matched from this issue’s tags and category — try them while you read without losing context.

GitHub issue graph ai analysis

Paste a GitHub issue URL. We fetch that issue, discover linked issues from bodies/comments/timeline, collect linked pull requests, and produce a structured English report.

The report is written in English Markdown for sharing and archival.

GitHub issue URL

Helpful · Quick feedback

GitHub stats

openclaw/openclaw#70077•Fetched 2026-04-23 07:29:36

View on GitHub

Comments

Participants

Timeline

Reactions

Author

dbental

Participants

dbental

Timeline (top)

commented ×1

Build automated research agents that search the web and PubMed, extract study data, and produce professional vetting conclusions for supplement candidates. This automates the evidence research layer of the vetting pipeline.

Root Cause

Code Example

radar_candidates → Research Agents (PubMed + web) → vetting.mjs → B.5 product listing

RAW_BUFFERClick to expand / collapse

Overview

Problem

Manual SKU vetting (issue #69867) requires reading PubMed papers and extracting:

Mechanism of action
Study design (RCT, observational, etc.)
Effect size (Cohen's d, odds ratio, etc.)
Sample size and population
Dose used in study vs. product dose
Limitations and conflicts of interest

This is time-consuming to do manually at scale. An agent pipeline can do the first-pass research, leaving the human to review conclusions rather than read raw papers.

Proposed agent design

Agent 1 — PubMed Scout

Input: ingredient name + health category
Searches PubMed via Entrez API (free, no key needed)
Returns: top 5–10 most relevant studies by relevance + recency
Extracts: title, authors, journal, year, abstract, PMID, DOI
Output: structured study list with links

Agent 2 — Evidence Extractor

Input: PubMed abstract (or full text if available via PMC)
Uses LLM to extract:
- Study type (RCT / meta-analysis / observational / case study)
- Population (n, age, condition)
- Intervention (dose, form, duration)
- Primary outcome + effect size
- Limitations acknowledged by authors
Output: structured evidence record matching vetting.mjs schema

Agent 3 — Vetting Analyst

Input: evidence records from Agent 2 + product spec from supplier
Applies T1–T4 rubric + G1–G4 gates (from PRODUCT_VETTING_PROTOCOL.md)
Produces verdict: approve / hitl_required / auto_reject
Writes mechanism_one_sentence, nonclaims[], weak_evidence notes
Flags: hallucinated URLs, under-dosed products, unsupported claims
Output: vetting verdict matching runVetting() return schema

Agent 4 — Supplier Cross-check

Input: approved ingredient name
Searches CJ Dropshipping catalog for matching products
Checks: spec sheet availability, US warehouse, dose per serving
Returns: sourcing_url, spec_sheet_url, price, moq

Stack

PubMed: NCBI Entrez E-utilities API (free)
Web search: Tavily or Firecrawl (already in extensions/)
LLM: chatWithFallback() from lib/ollama_cloud.mjs
Output: feeds directly into vetting.mjs and initial_pass JSON

Integration point

These agents slot into the pipeline between Agent A (radar/trend discovery) and Agent B (store builder):

radar_candidates → Research Agents (PubMed + web) → vetting.mjs → B.5 product listing

Acceptance criteria

Agent 1 returns ≥5 relevant PubMed studies for any of the 15 initial SKU candidates
Agent 2 correctly extracts study type, n, dose, effect size from abstracts (spot-check 5 papers)
Agent 3 produces verdicts matching manual vetting for ≥80% of initial SKU pass (#69867)
All PubMed URLs verified live (HEAD fetch — no hallucinated PMIDs)
Full pipeline run on one candidate takes < 60 seconds

#69867 Initial SKU research (manual baseline to validate against)
#69866 vetting.mjs (consumes agent output)
data/scripts/PRODUCT_VETTING_PROTOCOL.md
extensions/firecrawl/, extensions/tavily/ (existing search tools)

extent analysis

TL;DR

Implementing the proposed agent pipeline with four specialized agents can automate the evidence research layer of the vetting pipeline, reducing manual effort in researching and vetting supplement candidates.

Guidance

Review the proposed agent design and ensure that each agent's input and output match the requirements of the vetting pipeline.
Verify that the Entrez API and LLM are properly integrated and configured to extract relevant study data from PubMed.
Test the pipeline with the initial 15 SKU candidates to ensure that Agent 1 returns relevant studies and Agent 2 correctly extracts study data.
Validate the output of Agent 3 against manual vetting results to ensure accuracy and consistency.

Example

No code snippet is provided as the issue does not contain specific code-related problems.

Notes

The success of the pipeline depends on the accuracy of the LLM in extracting study data and the effectiveness of the T1-T4 rubric and G1-G4 gates in producing verdicts. Regular monitoring and evaluation of the pipeline's performance are necessary to ensure its reliability and accuracy.

Recommendation

Apply the proposed agent pipeline workaround to automate the evidence research layer of the vetting pipeline, as it has the potential to significantly reduce manual effort and improve efficiency.

Vote matrix · Quick signals

Works

Did the solution work? Tap to confirm.

Easy Fix

Was it a quick fix?

Time Saver

Did it save you time?

Blocking

Was it severely blocking?

Common Issue

Are others likely hitting this too?

Flaky / Intermittent

Is it intermittent?

Verified / Reproducible

Can you reproduce it reliably?

#api #ISR setup #authentication setup #request error #file not found

Still need to ship something?

×6

Another batch ranked right after the header list — different links, same matching logic.

Data

Security

Network

Code

UI/UX

Text

System

Multimedia

Protocol

API

Engineering

openclaw - 💡(How to fix) Fix [Ballast] Research agents — automated PubMed/web sourcing for SKU vetting [1 comments, 1 participants]

Recommended Tools

GitHub issue graph ai analysis

Root Cause

Code Example

Overview

Problem

Proposed agent design

Agent 1 — PubMed Scout

Agent 2 — Evidence Extractor

Agent 3 — Vetting Analyst

Agent 4 — Supplier Cross-check

Stack

Integration point

Acceptance criteria

Related

extent analysis

TL;DR

Guidance

Example

Notes

Recommendation

Still need to ship something?

TRENDING

openclaw - 💡(How to fix) Fix [Ballast] Research agents — automated PubMed/web sourcing for SKU vetting [1 comments, 1 participants]

Recommended Tools

GitHub issue graph ai analysis

Root Cause

Code Example

Overview

Problem

Proposed agent design

Agent 1 — PubMed Scout

Agent 2 — Evidence Extractor

Agent 3 — Vetting Analyst

Agent 4 — Supplier Cross-check

Stack

Integration point

Acceptance criteria

Related

extent analysis

TL;DR

Guidance

Example

Notes

Recommendation

Still need to ship something?

RELATED_DISCOVERY

TRENDING