Find Your Tool in 6 Steps
Result: 18

Archives: AI Tools

KIFOX - The Search Portal for AI Applications ?

Which AI tool is GDPR-compliant? Compare features, hosting, and data protection with real user reviews and an editorial score. Compare for free now.

Here, users can get an overview of which AI tools and LLMs are currently available on the market. Every user is free to submit a review and share their experience with AI applications, and can click a link to go directly to the provider’s website.

KIFOX is not an online store, but a comparison and information portal for AI applications.

Results: 18
Provider login

Build with the latest DeepSeek models

DeepSeek currently offers two LLM access points via its API: deepseek-chat and deepseek-reasoner. According to the official documentation, both currently correspond to DeepSeek-V3.2 with a 128K context window; deepseek-chat stands for Non-Thinking mode, deepseek-reasoner for Thinking mode. The API is OpenAI-compatible and supports, among other things, JSON Output, Tool Calls, Chat Prefix Completion, and, in the case of deepseek-chat, additionally FIM Completion.
DeepSeek API

LLM "Build with the latest DeepSeek models"

4.2/10 KIFOX Score – Limited
Other Token-based API usage Billing based on input, output, and cache-hit/cache-miss tokens.

deepseek-v4-flash Faster, more efficient model with Thinking and Non-Thinking mode, 1M context, JSON output, tool calls, and chat prefix completion.

deepseek-v4-pro More powerful model for more complex reasoning, coding, agents, and long contexts; also supports 1M context, JSON output, and tool calls.

OpenAI-/Anthropic-compatible API Usage via an OpenAI-compatible base URL or an Anthropic-compatible endpoint; suitable for existing SDKs and agent tools.

Open Weights / Self-hosting path DeepSeek-V4 was released as an open-weights family; self-hosting requires your own infrastructure and should be considered separately from the official API.
(0)

Link

"co-create intelligence with everyone"

MiniMax is a multimodal foundation model provider with models for text, coding, agents, speech, video, music, image, and multimodal applications. Its products include MiniMax Agent, Hailuo AI, MiniMax Audio, Talkie, and an Open Platform for developers and enterprises.
MiniMax

LLM “co-create intelligence with everyone”

6.1/10 KIFOX Score – Solid
Subscription Token Plan – Starter Subscription access for developers with access to MiniMax models via Token Plan API key; text models with a 5-hour rolling window, plan-dependent limited multimodal quotas.

Token Plan – Plus Expanded Token Plan with a higher M2.7 quota and additional daily multimodal quotas for Speech, Image, and other models.

Token Plan – Max Higher standard Token Plan with a larger request quota and expanded daily quotas for multimodal models.

Plus-Highspeed / Max-Highspeed / Ultra-Highspeed High-speed Token Plans with access to faster M2.7/M2.5 Highspeed models and expanded quotas for coding-related workflows.
Other Pay-as-you-go API Standard Open Platform API key for usage-based billing according to actual consumption; supports text, video, speech, image, and additional modalities.

Audio Subscription / Video Packages Separate packages for speech and video generation with product-specific quotas and billing logic.

Local / Private Deployment Open M2 model weights can be run locally or privately via Hugging Face and frameworks such as SGLang, vLLM, Transformers, ModelScope, or NVIDIA NIM.
(0)

Link

Anthropic offers current LLMs via the Claude API for language processing, reasoning, coding, agentic workflows, tool use, and document-centric tasks. According to the official model overview, all current Claude models support text and image input, text output, multilingual capabilities, and vision. For direct API access, Anthropic refers users to the Messages API; in addition, there are Managed Agents for longer-running tasks.Anthrophic Claude API Docs

LLM “highly performant, trustworthy, and intelligent AI platform”

7.1/10 KIFOX Score – Good
Free Anthropic documents that new users receive a small amount of free credits to test the API. However, this is not a classic permanent free plan in the SaaS sense, but rather a trial credit. Other Token-based Claude API Billing by model family such as Opus, Sonnet, and Haiku, as well as input, output, cache-write, and cache-read tokens.

Prompt Caching Reuse of large prompts, system instructions, or document contexts to reduce costs and latency. Batch API Asynchronous processing of large request volumes with a reduced billing model.

Long Context / 1M Context Available for certain current models; suitable for very large documents, codebases, and analysis contexts.

Data Residency / Third-Party Platforms Claude is also available via AWS Bedrock, Google Vertex AI, and Microsoft Foundry; regional pricing and data handling depend on the respective platform.
(0)

Link

OpenAI offers a broad range of models via the API for text generation, reasoning, coding, tool use, structured outputs, and document-centric workflows.

According to the official model overview, the current models support text and image input, text output, multilingual capabilities, and vision; they are available through the Responses API and client SDKs. For complex tasks, OpenAI recommends gpt-5.4 by default; for lower latency and cost, OpenAI points to gpt-5.4-mini and gpt-5.4-nano
Open AI

LLM “Access our frontier models and APIs.”

8.0/10 KIFOX Score – Very good
Free There is a free usage tier in the API rate limit system for users in permitted geographies Other Token-based API usage Billing by model, input/output tokens, cached input, audio/image/tool usage, and other usage-dependent factors.

Batch / Flex / Priority / Scale Tier Options for controlling costs and latency for larger or plannable workloads.

Fine-Tuning / Evals / Tools / Agents Additional API features for customization, evaluation, agents, web search, file search, code interpreter, realtime, and structured outputs.

Data Residency / ZDR / EKM Enterprise-grade data controls with regional storage/processing, Zero Data Retention or Modified Abuse Monitoring, and external key management.
(0)

Link

“Truly usable and practical AI”

Tencent Hunyuan is Tencent Cloud’s AI model family. It includes text models, reasoning models, vision, video understanding, image generation, translation, 3D generation, and open-source models. Tencent positions Hunyuan for content creation, mathematics, code, dialogue, enterprise scenarios, and multimodal workflows.
Tencent Hunyuan / Tencent HY

LLM “Truly usable and practical AI”

4.3/10 KIFOX Score – Limited
Free Free Resource Package / Test Quota
Upon initial activation of Tencent HY Text Generation Global, a one-time free test quota is provided as a Resource Package; after it is used up or expires, postpaid activation is required.
Other Token Postpaid / Pay-as-you-go API billing based on token consumption for Hunyuan text functions; billing via Tencent Cloud according to usage and with postpaid enabled.

Tencent HY 3D Global Separate product for 3D generation from text, image, or sketch with its own billing and API/cloud usage.

Enterprise / Tencent Cloud Agreements Individual cloud, contractual, and compliance setups are possible via Tencent Cloud; specific terms and data regions must be reviewed contractually.
(0)

Link

"Trustworthy artificial intelligence that powers humanity towards superproductivity"

AI21 Labs is an Israeli provider of large language models and AI orchestration systems for enterprises. Its core product in the model space is the Jamba family, a hybrid SSM/Transformer model family for long contexts, RAG, question-answering systems, document processing, and secure enterprise deployments. In addition, with Maestro, AI21 offers a model-agnostic orchestration system for validated RAG agents and complex business tasks.
AI21 Labs

LLM "Trustworthy artificial intelligence that powers humanity towards superproductivity"

7.9/10 KIFOX Score – Good
Free New accounts receive time-limited trial credits for AI21 Platform, APIs, SDK, and Playground. Truly usable for testing, prototyping, and evaluation; billing or a paid plan is required for continuous production use. Subscription Pay As You Go: usage-based access to Foundation Model APIs, SDK, and unlimited seats.

Custom Plan: includes pay-as-you-go features plus volume agreements, premium rate limits, private cloud hosting, priority support, a dedicated account manager, and AI consulting. No direct prices listed.
Other AI21 uses token-based API billing, custom payment/enterprise plans, and cloud provider billing through partners such as AWS, Microsoft Azure, Google Cloud / Vertex AI Model Garden, or SageMaker/Bedrock. Additionally, self-deployment, fine-tuning, quantization, and custom AI systems are relevant, depending on the contract and infrastructure. No direct prices listed.
(0)

Link

Command A is Cohere’s most powerful enterprise LLM for real business tasks such as tool use, retrieval-augmented generation, agents, and multilingual workflows.

The model has 111 billion parameters, supports 23 languages, features a 256k context window, and according to Cohere is designed for a comparatively low inference footprint.
Command A

LLM “Our largest, most performant model, ideal for building enterprise agents with a low compute footprint.” - “Max performance, minimal compute”

8.3/10 KIFOX Score – Very good
Free Yes, limited. Publicly primarily API/enterprise use; free trial or evaluation access may depend on the contract/account. Other API Usage Model access via the Cohere API, usage-based billing by model and tokens.

Enterprise / Private Deployment VPC, on-premises, or air-gapped deployment for companies with strict data protection, security, and data residency requirements.

North / Compass / Embed / Rerank Complementary Cohere products for agents, enterprise search, embeddings, and retrieval
(0)

Link

Inspiring AGI to Benefit Humanity - free AI chatbot & agent powered by GLM

Zhipu AI is a Chinese large language model provider that operates internationally under the name Z.ai. At its core is the GLM model family for text, reasoning, agents, coding, vision, OCR, image, video, and audio capabilities. The platform offers API access, coding plans, web search, Translation Agent, and Slide/Poster Agent.
Zhipu AI / Z.ai – GLM

LLM "Inspiring AGI to Benefit Humanity - free AI chatbot & agent powered by GLM"

6.7/10 KIFOX Score – Solid
Free GLM-4.5-Flash / free quotas
Z.AI describes GLM-4.5-Flash as a free model variant for reasoning, coding, and agents; in addition, free trial quotas may be available depending on the platform status.
Subscription GLM Coding Plan
Personally assigned coding subscription for officially supported coding tools; not intended for general API use, resale, or use by third parties.
Other API Usage / Usage-Based Billing API access to GLM models, SDKs, OpenAI-compatible usage, streaming, function calling, structured output, context caching, and tool usage; billing is usage-based depending on the model and platform rules.

Enterprise / Individual Agreements Separate written agreements are possible; no confirmed information is available regarding standardized EU enterprise plans or EU hosting.
(0)

Link

Amazon Nova is Amazon's own family of foundation models for text, image/video understanding, document analysis, agents, tool use, and speech.

Nova is used via Amazon Bedrock APIs, in particular InvokeModel, InvokeModelWithResponseStream, Converse, ConverseStream, and, for Sonic, via bidirectional streaming.
Amazon Nova API

LLM “frontier intelligence” - “industry-leading price-performance”

7.4/10 KIFOX Score – Good
Free AWS shows “Get started for free,” but for Amazon Nova/Bedrock, publicly documented billing is primarily usage-based; specific free quotas depend on AWS offerings, region, and account. Other On-Demand / Standard Tier Usage-based inference by model, modality, and tokens or image/video/special usage.

Flex / Priority / Reserved Tiers Bedrock supports different service tiers to manage cost, availability, latency, and throughput.

Batch Inference Asynchronous processing of larger workloads; according to AWS, cheaper than On-Demand for selected models.

Provisioned Throughput Reserved capacity for higher or predictable throughput; required for certain custom or production scenarios.

Fine-Tuning / Custom Models Customization using your own training/validation data; use of individual models typically via provisioned capacity.

Guardrails / Knowledge Bases / Agents / Prompt Routing Additional Bedrock features for security, RAG, agent orchestration, model routing, and governance.
(0)

Link

"Where knowledge begins"

Perplexity Sonar is a family of models for AI search, web-based answers, research, reasoning, and deep research. The API is particularly strong in current facts, citations, product comparisons, summaries, and research workflows.
Perplexity Sonar

LLM “Where knowledge begins”

7.6/10 KIFOX Score – Good
(0)

Link

xAI offers Grok models via its API for text generation, reasoning, coding, tool use, document-centric workflows, and agentic research. The current docs focus primarily on Grok 4.20 as the new flagship, as well as on server-side tools such as Web Search, X Search, Code Execution, and Collections Search.

Additionally, xAI documents classic model-listing endpoints such as /v1/models and /v1/language-models.
xAI API – Grok

LLM “Build with Grok, the AI model designed to deliver truthful, insightful answers.”

7.4/10 KIFOX Score – Good
Other Token-based API usage Billing based on input, reasoning, completion, image, and cached prompt tokens per model.

Server-side Tools Additional billing for tool invocations; costs may increase with the complexity of agentic requests.

Credits / API Key API usage takes place via an xAI account, API key, and purchased credits.

Enterprise / ZDR Enterprise customers can use Zero Data Retention so that API requests and responses are not stored.

Voice / Imagine / Batch / Tools Additional product areas for real-time conversations, TTS/STT, image/video generation, batch processing, web search, and structured outputs.
(0)

Link

Google offers a family of models with the Gemini API for text generation, reasoning, coding, agent workflows, tool use, multimodal prompts, and document-centric processing.

For current API LLMs, Gemini 3.1 Pro Preview, Gemini 3 Flash Preview, Gemini 3.1 Flash-Lite Preview, Gemini 2.5 Pro, Gemini 2.5 Flash, and Gemini 2.5 Flash-Lite are particularly relevant. Older Gemini 2.0 Flash variants are still available, but are already marked as deprecated.
Google Gemini API

LLM “AI for every developer”

7.3/10 KIFOX Score – Good
Free Free or unpaid use with limits; content may be used for product improvement and should not contain sensitive or confidential data. Other Gemini API Paid Tier For production applications with higher limits, context caching, Batch API, access to advanced models, and without using content for product improvement.

Batch / Context Caching / Priority / Flex Additional billing and operational options for controlling cost, latency, and throughput.

Vertex AI / Google Cloud Enterprise-oriented operation with Cloud DPA, IAM, regional endpoints, data residency, monitoring, and zero-data-retention configurations.

Grounding / Tuning / Embeddings / Live API Advanced features for search, context enrichment, model customization, vector search, real-time audio, and multimodal applications.
(0)

Link

"Frontier intelligence, customized to you."

The Mistral API is the developer and enterprise interface for Mistral models.

Through Mistral AI Studio, companies and developers can use models via API, test prompts, build agents, implement RAG workflows, use fine-tuning, manage workspaces, and bill API usage. Mistral offers both open-weight and commercial/premier models.
Mistral API

LLM - build, customize, and deploy AI, your way

8.3/10 KIFOX Score – Very good
Free Le Chat Free Personal AI assistant for chat, search, learning, images, projects, memories, and connectors; not to be equated with productive API usage. Other API / La Plateforme Usage-based API for Mistral models, chat, embeddings, OCR, agents, coding, multimodal models, and developer workflows.

Self-Deployment / Open-Weight Models Selected models can be operated independently or via cloud/enterprise deployments; the range of features depends on the respective model.

Enterprise Private Deployment Customized private deployment for organizations with increased control, security, and scalability requirements.
(0)

Link

Alibaba Cloud Qwen is Alibaba Cloud's LLM/multimodal model family. Through Model Studio / DashScope, developers can use Qwen models via API, including text models, multimodal models, reasoning models, coding models, translation models, and open-source/open-weight variants. The API is OpenAI-compatible and can be used via different endpoints depending on the region.Alibaba Cloud Qwen API

LLM “one-stop model service platform”,

7.4/10 KIFOX Score – Good
Free Free quotas for certain models/regions; Free Quota applies only to real-time inference and not to batch calls, context cache, fine-tuning, deployment, or custom models. Other Pay-as-you-go / Model Invocation Usage-based billing by model, input/output tokens, thinking/non-thinking mode, region, and deployment mode.

Batch Calls Separate processing of large workloads; not covered by the Free Quota.

Context Cache Cache function to reduce repeated context costs; not covered by the Free Quota.

Fine-Tuning / Deployment / Custom Models Model customization and deployment of proprietary or fine-tuned models; billed separately and not covered by the Free Quota.

OpenAI-/Responses-compatible API Qwen models support OpenAI-compatible interfaces and the Responses API for agentic applications.
(0)

Link

“The AI community building the future.”

Hugging Face is not a single proprietary LLM provider, but a platform for hosting, discovering, distributing, evaluating, and deploying AI and LLM models. The Model Hub is used for storing, discovering, and using model checkpoints; LLMs can be used via Inference Providers, Inference Endpoints, or locally through libraries such as Transformers.
Hugging Face

LLM “The AI community building the future.”

7.6/10 KIFOX Score – Good
Free You can test API access with a free Hugging Face account. There are monthly free credits. According to the current Hugging Face documentation, free users receive monthly credits, currently listed as $0.10, subject to change. After that, you need additional credits or pay based on usage. Subscription PRO With Hugging Face PRO, you get significantly more included inference credits. The pricing page lists, among other things, 20× included inference credits for PRO; the Inference docs currently mention $2.00 in monthly credits for PRO users.

Team & Enterprise For organizations, there are Team and Enterprise. These plans also include Inference Provider benefits or credits per seat and enable centralized billing, limits, and administration. According to Hugging Face, Team/Enterprise organizations currently receive $2.00 per seat in monthly credits.
Other Pay-as-you-go If your credits are used up, you can continue making API requests by purchasing additional credits or paying based on usage. The costs depend on the specific model, provider, and usage.

Your own provider key In some cases, you can also use your own API keys from external providers. In that case, billing does not go through Hugging Face, but directly through the respective provider; according to the documentation, Hugging Face does not charge for this call.
(0)

Link

Llama is Meta's family of generative foundation models for text and, in part, image/text understanding.

Meta positions Llama as a flexibly deployable model series that can be fine-tuned, distilled, and deployed “anywhere”; this includes self-hosting, private cloud, and hosting through partners. Llama 4 brings native multimodality, while Llama 3.x continues to address important text, coding, translation, and agent use cases.
Meta Llama

LLM “Industry Leading, Open-Source AI”

6.9/10 KIFOX Score – Solid
Free Llama model weights / Download Llama models can be downloaded, fine-tuned, distilled, and self-hosted under the Meta license; infrastructure costs for self-hosting are incurred separately.

Meta Llama API Preview / Waitlist The Llama API is officially positioned via waitlist/login; I could not reliably substantiate a permanently freely usable public API free version with guaranteed limits.
Other Managed Llama API API access to current Llama models, API key, playground, SDKs, OpenAI-like integration, tool calling, and models such as Llama 4 Maverick/Scout according to the official Llama API page.

Self-hosting / own cloud / edge Operation of the model weights on your own infrastructure, with cloud providers, or locally; suitable for data protection, cost control, and individual optimization.

Cloud provider / third-party hosting Llama models are available through various cloud and inference providers; data protection, pricing, and server locations then depend on the respective provider.

Fine-tuning / distillation / Llama Stack Customization and integration into your own AI architectures, depending on the model license, infrastructure, and technical setup.
(0)

Link

"Seeking the optimal conversion from energy to intelligence"

Moonshot AI is the provider behind Kimi, an LLM and agent platform focused on long contexts, coding, deep research, documents, spreadsheets, slides, web search, and multimodal processing. The Kimi API offers models such as Kimi K2.6, K2.5, K2, and Moonshot-v1.
Moonshot AI

LLM “Seeking the optimal conversion from energy to intelligence”

4.3/10 KIFOX Score – Limited
Other Pay-as-you-go API Flexible, usage-based billing according to input and output tokens, model selection, and, where applicable, document/tool usage; suitable for developers and small teams.

Kimi K2.6 / K2.5 / K2 / Moonshot V1 Model families for multimodal or text-based tasks, long context, coding, agents, reasoning, and dialogue tasks.

Enterprise Solutions & Customization For medium-sized and large companies with flexible rate limits, multi-project deployments, support, SLA-oriented reliability, and individual agreements according to the platform page.
(0)

Link

Leading AI models on a secure platform operated in Germany.

DeutschlandGPT provides models such as GPT, Claude, Gemini, Llama, and Mistral through a single interface. Users can chat, analyze documents and images, perform real-time web searches, configure their own assistants or specialized applications, and integrate external business systems. The platform is designed for individual users, teams, companies, and public or data-sensitive organizations.
GermanyGPT

Leading AI models on a secure platform operated in Germany.

7.7/10 KIFOX Score – Good
Free Try it for free: A permanently free plan with a limited number of daily messages, custom prompts, and limited file and image analysis. Subscription Pro All leading AI models, projects, integrations, specialized applications, and higher usage limits.

5x Pro Pro features with significantly higher usage limits for more intensive individual use.

Business Pro features plus centralized user management, white labeling, prioritized support, and configurable retention periods; for SMBs and teams.

Enterprise Business features plus custom domain, SAML SSO, SCIM provisioning, Customer Success Manager, dedicated onboarding, invoice billing, and AI training.
Other Enterprise Trial Access Time-limited trial access for larger organizations, available by arrangement.

Germany GPT API Usage-based access to various AI models; billing depends on the model and token consumption.

Fair Use Model Paid platform plans have higher limits but are subject to rules against automated bulk usage and resale.
(0)

Link