FlashLabs Introduces OrcaRouter to Dify Marketplace
FlashLabs, based in Chiyoda, Tokyo, has announced the introduction of their innovative smart routing plugin, OrcaRouter, developed in partnership with Continuum AI, a leader in next-generation AI infrastructure from the United States. This groundbreaking plugin is now available on the Dify Marketplace, enabling users to seamlessly integrate access to over 200 large language models (LLMs) through a single API. By automatically selecting the most suitable model based on the prompt content, OrcaRouter promises to reduce AI inference costs by as much as 70% while maintaining high output quality.
Addressing Key Challenges
As a no-code/low-code platform rapidly gaining traction, Dify allows developers to create AI applications easily. However, using multiple LLM providers presents some significant challenges:
- - Complex Contract Management: Dealing with individual contracts and administration for multiple model providers.
- - Excess Costs from Fixed Model Operations: Operating with static models can lead to unnecessary expenditures.
- - Difficulties in Optimal Model Selection: Choosing the most suitable model based on the complexity of the prompt can be challenging.
- - Slow Response to New Model Releases: Keeping up with the latest models can be a cumbersome process.
OrcaRouter tackles these issues head-on by utilizing an adaptive routing technology that enables Dify users to select the optimal model automatically.
Overview of OrcaRouter and Dify Integration
The OrcaRouter plugin provides several key features within the Dify workflow:
- - Integration of 200+ LLMs through a Single API: Access models from over 15 providers, including OpenAI, Anthropic, Google, xAI, Meta, and DeepSeek, simplifying endpoint management, API keys, and billing.
- - Intelligent Routing Measured in Milliseconds: For each request, the system calculates which model can process the prompt according to specified quality standards at the lowest cost. For simple tasks like email summarization, a less complex model might suffice, whereas more intricate tasks like code generation would require a more advanced option.
- - Continuous Optimization: Quality signals from reference model scoring, downstream success rates, and user feedback are continually integrated into routing policies, enabling the system to improve both efficiency and cost-effectiveness weekly without needing any modifications on the client's side.
- - Real-Time Market Responsiveness: The system keeps an eye on provider pricing, latency, error rates, and new model launches. For example, if Anthropic releases a new Sonnet model, OrcaRouter can immediately reroute coding prompts to tap into this new resource.
Supported Models and Latest APIs
Dify users can immediately access the following latest models through OrcaRouter:
- - DeepSeek V4 Pro API
- - Anthropic Claude Opus 4.7 API
- - OpenAI GPT 5.5 API
- - Qwen3.7 Max
Out of these 200+ LLMs, Dify users benefit from automatic model selection that optimizes their workflow seamlessly.
Benefits of Implementing OrcaRouter
1.
Cost Reduction: Compared to fixed model operations, users can expect savings in inference expenses ranging from 47% to 71%, tailored to their workload.
2.
Quality Maintenance: End-users will experience no measurable degradation in quality, particularly in agent-related workloads that require both simple processing and advanced inference.
3.
Operational Efficiency: Existing Dify workflows remain functional with just a modification of the Base URL and API key; no redesign or additional procurement cycles are required.
4.
Transparency and Predictability: Each response model's performance is entirely visible, ensuring that users always select the most cost-effective model capable of fulfilling their requests.
How to Get Started with OrcaRouter
The OrcaRouter plugin is available today on the Dify Marketplace. Users can implement it by following these steps:
1. Install the OrcaRouter plugin from the Dify Marketplace.
2. Obtain the OrcaRouter API key at
orcarouter.ai.
3. Integrate the OrcaRouter node into your Dify workflow and set the routing strategies.
4. Existing prompts remain unchanged; the system automatically selects the best model for each task.
The usage fee for the pay-as-you-go plan includes just the provider's cost plus a standard platform fee, ensuring no markup is applied.
Comments from the Executives
Yoichi Hosoi, CEO of FlashLabs
"Dify is a platform aimed at democratizing AI application development without coding skills. However, many users face challenges with model selection when dealing with multiple LLMs. OrcaRouter’s adaptive routing automates this selection process, relieving Dify users of the need to focus on which model to choose, as the optimal one is selected automatically every time."
Continuum AI Representative
"The integration with Dify represents a strategic extension for OrcaRouter. The no-code/low-code philosophy of Dify complements the intelligent routing technologies behind OrcaRouter. This partnership allows Dify users to free themselves from intricate model management and focus on application development itself."
Company Overview
FlashLabs
FlashLabs is an AI research institution aimed at automating and ultimately achieving self-sufficiency in sales and customer experience processes. By merging the speed and accuracy of machine processing with human strategic insights, we deliver outcomes that surpass conventional methods. As the creator of the open-source voice model Chroma series, our technology is leveraged by developers and machine learning engineers at esteemed companies such as NTT Data, Xiaomi, Kakao, Intel, G42, and MBZUAI.
Visit us at: flashlabs.ai
Continuum AI
Continuum AI develops next-generation AI infrastructures, offering zero-markup LLM routing technologies such as OrcaRouter and OrcaRouter Lite to democratize AI foundational layers.
Learn more at: continuum01.ai
Contact For More Information
FlashLabs Inc. Marketing Department
Contact: Koki Kobayashi
Email:
[email protected]