Overview
Data sources are the foundation of your AI agent’s knowledge base. They provide the information the agent uses to answer customer questions.Types of Data Sources
Files
Upload documents (PDF, Word, text, etc.) that are automatically processed: Supported formats:- PDF (
application/pdf) - Word Documents (
.docx,.doc) - Text files (
.txt) - HTML (
.html) - Markdown (
.md) - PowerPoint (
.pptx,.ppt)
Q&A Pairs
Create question-answer pairs for common queries:Text Content
Add custom text snippets:Websites
Scrape website contentAdding Data Sources
Via Web Interface
- Navigate to Agents → Your Agent → Data Sources
- Choose data source type
- Upload file, add Q&A, or enter text
- Data source is automatically linked to agent
Processing Status
File data sources track processing status:Querying Data Sources
Once processed, data sources are searchable via vector search:Best Practices
- Organize by topic: Group related data sources together
- Keep content updated: Update data sources as information changes
- Use Q&As for common questions: Faster and more accurate than searching documents
- Test after adding: Verify agent can find and use new content
- Monitor processing: Check file processing status before relying on new files
Troubleshooting
Files not processing
- Check SolidQueue is running:
bin/rails solid_queue:start - Verify environment variables:
UNSTRUCTURED_API_KEY,OPENAI_API_KEY,TURBOPUFFER_API_KEY - Check job dashboard:
http://localhost:3000/jobs
Agent not finding content
- Verify data source is linked to agent
- Check processing status is “completed”
- Test vector search directly
- Review agent’s knowledge base in UI
Next Steps
Procedures
Create structured procedures with conditional logic
File Processing Setup
Configure file processing pipeline