Changelog

Stay up to date with the latest features, enhancements, and updates.

Subscribe for updates
7/25/2024

New models available

  • Llama 3.1 (via Groq, Together AI and Meta nodes):

    • World's largest and most capable open-source foundation model with 405 billion parameters (smaller versions are also available in Stack AI -> 70B and 8B for simpler tasks and faster execution)
    • Supports advanced use cases like long-form text summarization, multilingual conversational agents, and coding assistants, with a context length of 128K tokens.
  • Mistral Large 2

    • 123 billion-parameter model with a 128K-token context window** (less parameters than Llama 3.1 an similar performance 🤯)
    • Excels in code generation, mathematics, and multilingual tasks, outperforming many leading models in these areas.
  • GPT 4o mini

    • Cost-efficient small model that surpasses GPT-3.5 Turbo.
    • It offers superior textual intelligence and multimodal reasoning, with 128K-token context window.
    • Enables a broad range of tasks, including real-time text responses and applications requiring large context handling. Best for every day simple tasks
  • Gemini 1.5 Flash

    • Lightweight and fast model optimized for speed and efficiency (comparable to GPT 4o mini)
    • Breakthrough one-million-token context window, making it ideal for processing extensive video, audio, and large codebases.
  • Mistral nemo

    • Specialized model designed for high-precision tasks in scientific and technical domains, with a 128K-token context window.
    • It focuses on reducing hallucinations and improving reasoning capabilities, making it suitable for research and development applications.

Other features

  • Images to LLMs with vision: upload or copy paste images in the chat assistant for processing by GPT 4o or Anthropic Claude 3.5 Sonnet.
  • Google Search: select the country in which to perform the search.
7/5/2024

New Feature

  • Chart Generation in LLM Responses: Now you can enable chart generation directly through your LLM settings. Ideal for financial analysis, data exploration, and reporting, this feature allows for visually appealing charts rendered within the LLM's messages, enhancing data visualization and interpretation.
6/28/2024

New Features

  • Integration of Stable Diffusion 3: Medium, large, large turbo models.
  • YouTube Node: Selection of transcription languages.
  • Claude 3.5 Sonnet: Now available in AWS Bedrock.
  • Qwen 2 72b: Integrated via Together AI.
  • Charts in LLM Responses: Enable it in the settings of any LLM.

Other Enhancements

  • URL Node: Revamped for a better scraping experience.
  • Gmail Node: Now capable of reading email threads.
6/20/2024

New Features

  • Connections Manager

    • Create and manage connections in one place, eliminating the need to include credentials in every knowledge base for each project.
    • Add, refresh, remove, and test connections from the new Connections tab in Settings.
    • Choose from a list of available integrations in Stack AI and view details like creation date and status (e.g., healthy, unhealthy).
  • Deepgram Nova 2 for Medical Transcriptions

    • 20.5% improvement in recognizing medical terms with Nova 2, which is 5-40x faster than SOTA models.
    • Other Available Submodels in the Nova 2 family: finance, meeting, voicemail, automotive
    • Healthcare Use Cases:
      • SOAP Notes: Automatically generate SOAP notes from patient meeting recordings.
      • Patient Intake: Efficient and accurate patient data capture.
      • Clinical Documentation: Improve the quality and consistency of medical records.
    • Read our blog post on AI solutions in Healthcare.
  • Chat Assistant for OpenAI Downtime

    • Reliable Alternative: Use our own Chat Assistant at chat.stack-ai.com when OpenAI is down, powered by Anthropic, Groq, Gemini, Mistral, etc.
    • Consistent Functionality: Same functionality as ChatGPT, ensuring no performance issues.
    • Use this link

Other Enhancements

  • Download Docx: Download responses from your AI form directly as a Docx.
  • Batch Processing: Interface improvements for visualizing outputs and deleting all rows, available through the export tab.
  • Chat Assistant: Fixes made to mobile version and delete functionality.
  • Editable Submit Button: Customize the text on the submit button in the form export.

Other Important Announcements:

6/6/2024

New Features

  • GPT-4o on Azure

    • Dedicated Azure Instance: Access a dedicated Azure instance for GPT-4o, providing improved latency and security.
    • Easy Integration: Drag and drop an Azure node from the sidebar under the LLM section to boost your AI workflows.
    • API Key Integration: Integrate your own API keys if you have an Azure account.
  • Text-to-SQL Enhancements

  • Voice Assistant in Beta

    • Build a voice assistant similar to the OpenAI GPT-4o demo using your proprietary data.
    • Integration Steps:
      • Drag and drop audio-to-text and text-to-audio nodes from the sidebar.
      • In the builder, go to the interface tab and select the audio assistant as your export option.
    • This feature is in beta, and we welcome your feedback.

Enhancements

  • Video and Audio in RAG: Upload .mp4 and .mp3 files into your knowledge base for automatic transcription and integration.
  • Multi-Language Support for Deepgram: Utilize audio-to-text features in various languages.
  • Audio Upload in Forms: Upload audio files directly within the form interface.
  • Download Button: Directly download responses from your AI applications via the form interface.
  • Mistral Enhancements: All Mistral models now support JSON mode.
  • Improved Web Scraping: Enhanced to remove subpages limitations for broader website scraping.
  • Table + Search: New checks to automatically correct erroneous SQL queries.
  • Summarize Node: Now powered by the OpenAI GPT-4o model for faster processing.
  • Batch Processing: Interface improvements for easier batch processing.
  • Chat Assistant: Enhanced with links in descriptions for easy access to relevant information.
5/16/2024

Integrations

  • OpenAI's Latest Model: GPT-4o ("o" for "omni")

    • Faster operations and 50% more cost-efficient in API usage.
    • Maintains high performance in English and coding tasks and enhanced in non-English text processing.
    • Multimodal capabilities to handle and output text, audio, images, and video.
  • Groq

    • Groq is now accessible on Stack AI.
    • Specializes in Language Processing Units (LPUs) with ultra-low latency for AI inference.
    • Supports running open-source models like LLama-3 and Mixtral 8x7b efficiently.

Enhancements

  • Interface Redesign
    • The 'Export tab' has been renamed 'Interface'.
    • All favorite interfaces (Chat, Forms, Embedded chatbots, WhatsApp/SMS, Slack, Batch, API) are now more accessible.
    • User-friendly configurations and real-time previews of modifications.