Uploading Images

Supported Models for Image Processing

Model Name	Capabilities
meta-llama/Llama-3.2-90B-Vision-Instruct	Multi-modal vision model supporting image understanding.
Qwen/Qwen2-VL-7B-Instruct	Supports both text and image-based inputs for AI interactions.

Sending an Image via API Request

The API allows two methods to send an image:

Passing an Image URL (recommended for publicly hosted images)
Sending a Base64 Encoded Image (for local images)

import requests

url = "https://api.intelligence.io.solutions/api/v1/chat/completions"

headers = {
    "Authorization": "Bearer \$IOINTELLIGENCE_API_KEY",
    "Content-Type": "application/json"
}

data = {
    "model": "meta-llama/Llama-3.2-90B-Vision-Instruct", 
    "messages": [
        {"role": "system", "content": "You are an AI assistant."},
        {"role": "user", "content": [
            {"type": "text", "text": "What is in this image?"},
            {"type": "image_url", "image_url": {"url": "https://your-image-url.com/image.jpg"}}
        ]}
    ]
}

response = requests.post(url, json=data, headers=headers)
print(response.json())

The image URL must be publicly accessible. Private or authentication-required URLs will not work.

Image Input Requirements

To ensure successful processing, images must meet the following requirements:

Requirement	Details
Format	JPEG, PNG, WEBP, or GIF (static)
Max File Size	20MB
Resolution	At least 512x512 pixels (recommended)
Max Dimensions	4096×4096 pixels
Accessibility	If using a URL, ensure it is publicly accessible
Multi-Image Support	Up to 10 images per request

Best Practices for Image Uploads

Optimize File Size: While the maximum limit is 20MB, smaller files (1-5MB) ensure faster processing.
Use Clear Images: Avoid blurry or low-resolution images for better AI analysis.
Ensure Public URLs: If passing a URL, test it in a browser to confirm that it is accessible.

Expected API Response

Upon successful submission, the API will return a structured response with AI-generated insights based on the image. Example Response:

{
    "id": "chatcmpl-abc123",
    "object": "chat.completion",
    "created": 1710456789,
    "model": "meta-llama/Llama-3.2-90B-Vision-Instruct",
    "choices": [
        {
            "index": 0,
            "message": {
                "role": "assistant",
                "content": "This is an image of a cat sitting on a table."
            },
            "finish_reason": "stop"
        }
    ],
    "usage": {
        "prompt_tokens": 120,
        "completion_tokens": 20,
        "total_tokens": 140
    }
}

Common Issues & Troubleshooting

Issue	Possible Cause	Solution
”An image? I’m in text format, so I can’t see it…”	Model does not support image input.	Ensure you are using one of the supported vision models.
”Invalid image format”	Image not encoded properly.	Convert image to base64 before sending.
”Unauthorized”	API key is missing or incorrect.	Check that your API key is valid and correctly formatted.

IO Explorer

IO Intelligence

IO Cloud

Uploading Images

Sending an Image via API Request

Image Input Requirements

Expected API Response

Common Issues & Troubleshooting

IO Explorer

IO Intelligence

IO Cloud

​Sending an Image via API Request

​Image Input Requirements

​Expected API Response

​Common Issues & Troubleshooting

Sending an Image via API Request

Image Input Requirements

Expected API Response

Common Issues & Troubleshooting