Comprehensive AI-powered workflow for extracting, transcribing, summarizing, and analyzing content from various file types using Gemini LLM
Tutorial Video
Pipeline Overview

Pipeline Components
π Web Hook
Receives uploaded files via HTTP
βοΈ Parse/Process/Embed
Extracts and processes document content
π΅ Audio – Transcribe
Converts audio/video to text (when applicable)
π§ Text – Summarization: LLM
Creates intelligent summaries using Gemini
π€ Response
Returns structured JSON output
How to Use the Pipeline
Start the Pipeline
- Run your pipeline in the Aparavi Engine
- Look for the Webhook URL message in the Project Log
- Copy the webhook URL (e.g.,
http://localhost:8080/webhook/...)
Configure API Testing Tool
We recommend using Talend API Tester for easy testing:
- Download: Talend API Tester Extension
- Alternative: Postman, curl, or any HTTP client
Request Configuration
| Field | Value | Description |
|---|---|---|
| Method | PUT |
HTTP method for file upload |
| URL | Your webhook URL | The URL from Step 1 |
| Content-Type | Auto |
Automatically set based on file type |
| Authorization | Your API key | Found in the webhook URL parameters |
Body Configuration
- Type:
File - Upload Method: Drag & drop or click to browse
- Supported Formats: PDF, DOC, DOCX, TXT, MP3, MP4, and more
Send and Process
- Upload your file to the request body
- Click “Send” to submit the request
- Wait for processing (typically 10-30 seconds)
- Check response status:
β 200 OK
Successβ Error codes
Check file format and size
Extract Results
Response Structure
{
"data": {
"objects": {
"cce2fa78-f7fb-5a2e-b391-7c896aeda5cf": {
"text": "Your processed content here..."
}
}
}
}
Extracting Content
- Open the response JSON
- Navigate to:
data/objects/[object-id]/text - Copy the text content – this is your processed output
Component Details
1. Web Hook Connector
Purpose: Receives HTTP file uploads and triggers pipeline processing
Configuration:
- Protocol:
webhook:// - Class Type:
source - Capabilities:
noinclude - Register:
endpoint
Supported Input Types: tags, text, audio, video, image
2. Parse/Process/Embed Connector
Purpose: Extracts content from various document formats and prepares for processing
Configuration:
- Protocol:
autopipe:// - Class Type:
other - Capabilities:
internal - Register:
filter
3. Audio – Transcribe Connector
Purpose: Converts audio and video content to text using Whisper models
Configuration:
- Protocol:
audio_transcribe:// - Class Type:
audio - Register:
filter
Model Options
| Model | Speed | Accuracy | Use Case |
|---|---|---|---|
| Tiny | Fastest | Lowest | Quick processing |
| Base | Fast | Low | General use |
| Small | Medium | Medium | Balanced |
| Medium | Slow | High | Quality focus |
| Large | Slowest | Highest | Best quality |
4. Text – Summarization: LLM Connector
Purpose: Creates intelligent summaries, key points, and entity extraction using Gemini LLM
Configuration:
- Protocol:
summarization:// - Class Type:
text - Register:
filter - Invoke: Requires LLM connection
Configuration Options
| Setting | Description | Default |
|---|---|---|
| Number of Summaries | Chunks to summarize after document split | Optional |
| Summary Words | Words per summary (0 = disabled) | Optional |
| Key Point Words | Words per key point (0 = disabled) | Optional |
| Entities | Number of entities to extract (0 = disabled) | Optional |
5. Response Connector
Purpose: Returns structured JSON responses with processed content
Configuration:
- Protocol:
response:// - Class Type:
target - Register:
filter
Supported File Types
π Documents
- PDF (.pdf)
- Microsoft Word (.doc, .docx)
- Text Files (.txt)
- Rich Text (.rtf)
π΅ Media Files
- Audio: MP3, WAV, M4A, FLAC
- Video: MP4, AVI, MOV, MKV
- Images: JPG, PNG, GIF, TIFF
π Other Formats
- Presentations: PPT, PPTX
- Spreadsheets: XLS, XLSX
- Web Content: HTML, XML
Error Handling
Common HTTP Status Codes
| Code | Meaning | Solution |
|---|---|---|
| 200 | Success | β Processing completed |
| 400 | Bad Request | Check file format and size |
| 401 | Unauthorized | Verify API key |
| 404 | Not Found | Check webhook URL |
| 500 | Server Error | Restart pipeline |
Troubleshooting Tips
- File Size: Ensure files are under 100MB
- Format: Use supported file types
- API Key: Verify authorization header
- Pipeline: Ensure all components are running
- Network: Check connectivity to webhook endpoint
Performance Considerations
Processing Times
| File Type | Size | Estimated Time |
|---|---|---|
| Text Document | < 1MB | 5-10 seconds |
| PDF Document | 1-10MB | 10-30 seconds |
| Audio File | 5-30 minutes | 30-60 seconds |
| Video File | 1-10 minutes | 1-3 minutes |
Security and Authentication
API Key Management
- Location: Found in webhook URL parameters
- Format: Long alphanumeric string
- Security: Keep private and secure
- Rotation: Change regularly for production use
Request Validation
The pipeline validates:
- File format compatibility
- File size limits
- API key authenticity
- Request method (PUT only)
