Models
Available models, capabilities, and pricing for each Persly API.
Persly offers specialized models for different medical AI tasks. Each model is optimized for its specific use case.
Model Overview
| Model | API | Pricing | Description |
|---|---|---|---|
persly-chat-v1 | Chat | $0.02/request | Standard medical Q&A with source citations |
persly-chat-pro-v1 | Chat | $0.07/request | Advanced reasoning with tool use (web search, medical databases) |
persly-embed-v1 | Embeddings | $0.30/1M tokens | 768-dimensional medical text embeddings |
persly-rerank-v1 | Rerank | $2.00/1K searches | Medical document relevance ranking |
persly-finder-v1 | Finder | $0.005–$0.01/request | AI-powered medical information search |
Chat Models
persly-chat-v1
Standard medical chat model. Best for general medical Q&A where speed and cost matter.
- Pricing: $0.02 per request
- Features: Source citations, follow-up question suggestions
- Best for: Patient-facing FAQs, general medical queries, high-volume applications
persly-chat-pro-v1
Advanced medical chat model with enhanced reasoning capabilities and access to external tools.
- Pricing: $0.07 per request
- Features: Everything in v1 + web search, medical database access, deeper reasoning
- Best for: Complex medical questions, differential diagnosis support, clinical decision support
Embeddings Model
persly-embed-v1
Generates 768-dimensional embedding vectors optimized for medical text similarity and retrieval.
- Pricing: $0.30 per 1M tokens
- Dimensions: 768
- Max batch size: 100 texts per request
- Best for: Medical document search, semantic similarity, RAG pipelines
Rerank Model
persly-rerank-v1
Reranks documents by medical relevance to a query. Significantly improves search result quality.
- Pricing: $2.00 per 1K searches (1 search = up to 100 documents)
- Max documents: 1,000 per request
- Best for: Search result optimization, RAG pipeline reranking, medical literature retrieval
Finder Model
persly-finder-v1
AI-powered search across medical information sources with optional answer generation.
| Depth | Pricing | Description |
|---|---|---|
basic | $0.005/request | Quick search, fewer sources |
advanced | $0.01/request | Deep search, more comprehensive results |
- Max results: 1–20 per request (default: 5)
- Best for: Medical information retrieval, evidence gathering, fact-checking