Humaan.ai Introduction
Humaan.ai is an AI platform that offers human-like intelligence capabilities for app development. With just a few lines of code, developers can integrate vision, language, speech, and audio functionalities into their apps using AI building blocks. The platform is currently in private alpha, featuring state-of-the-art models and ready-to-use APIs. Humaan.ai Features
Foundation Models
Humaan.ai provides a range of foundation models that can be easily integrated into apps. These models are designed to enhance the app development process with human-like capabilities in various domains. Stable Diffusion: A generative model for image creation based on text prompts.
- Whisper: Converts spoken language to text, trained on 680k hours of multilingual data.
- AudioLM: A framework for high-quality audio generation with long-term consistency.
- GPT-Neo: A transformer-based language model inspired by the GPT architecture.
- Imagen: A text-to-image generative model that creates photorealistic images from input text. Advanced Capabilities
Humaan.ai also offers advanced capabilities that can be integrated into apps: DocumentAI: Extracts information from complex content across various document types and formats.
- Translate SQL: Translates natural language questions into SQL queries for relational databases.
- Summarization: Summarizes long text into shorter versions while retaining important information.
- Sentiment Analysis: Analyzes the emotional tone of messages in multiple languages.
- Body Pose: Tracks movements of a person or object from images or videos.
- Named Entity Recognition: Identifies and classifies named entities in unstructured text.
- Sound Classification: Identifies and categorizes different types of sounds or audio signals. Simplicity and Speed
Humaan.ai is designed to simplify and accelerate the integration of AI capabilities into apps. The platform offers a seamless integration process with AI-as-a-Service, enabling developers to focus on creating innovative solutions. REST API
Humaan.ai provides a REST API that allows developers to seamlessly integrate and infer models. The API supports various programming languages such as Python, Node.js, Go, and Kotlin.
Example code for using Humaan.ai REST API
import requests
endpoint = "https://inference.humaan.ai/openai/whisper-base" headers = { "Authorization": "Bearer humaan_****************", "Content-Type": "audio/mpeg" } open("audio.mp3", "rb") as audio_file: audio = audio_file.read() = requests.post(endpoint, headers=headers, data=audio) parse(response.text)
Humaan.ai offers a simplified fine-tuning service that allows developers to customize models for their specific applications. This no-code tool enables training, evaluating, and deploying customized models using their own datasets. The platform also provides serverless GPUs, eliminating the need for managing infrastructure. Flexible Integration
Humaan.ai provides a flexible way to integrate intelligence into workflows. Developers can create custom workflow apps tailored to their unique needs, streamlining processes and enhancing productivity. The platform offers modular blocks and advanced AI models to unlock creativity and redefine the work experience. Example Workflow
- **Input**: Invoice from Amazon S3
- **Inference Model**: DocumentAI
- **Task**: OCR text extraction
- **Notification**: Daily invoice report
- **Message**: Date, total invoice, and total amount
- **Database**: Invoices (PostgreSQL)
INSERT INTO invoices(date, customer, amount)
VALUES({{date}}, {{customer}}, {{amount}})
``` **Slack**: Daily invoice report
Humaan Workflow:
- Date: May 18, 2023
- Total invoice: 9
- Total amount: $4,190.00