AI Services Verbesserungen Log   Aktualisiert!


LLM Vergleich Tabelle

Juli 2024: Online Tabelle Vergleich

Leistungsfähigste aktuell zugängliche Modelle laut Chatbot Arena ⚔️ Benchmarks Leaderboard Anfang Juni 2024

ModelArena EloCodingLonger QueryGermanCode-focused needle haystackOrganizationLicenseKnowledge Cutoff
GPT-4o-2024-05-13128712981304127683OpenAIProprietary2023/10
Gemini-Advanced-05141267125712601259GoogleProprietaryOnline
Gemini-1.5-Pro-API-051412651272129449GoogleProprietary2023/11
Gemini-1.5-Pro-API-0409-Preview125712331246GoogleProprietary2023/11
GPT-4-Turbo-2024-04-09125612661266125178OpenAIProprietary2023/12
GPT-4-1106-preview125112581248OpenAIProprietary2023/4
Claude 3 Opus124912521269124960AnthropicProprietary2023/8
GPT-4-0125-preview124612461248OpenAIProprietary2023/12
Yi-Large-preview12391247124601 AIProprietaryUnknown
Gemini-1.5-Flash-API-051412311238122336GoogleProprietary2023/11
Yi-Large122212451223121701 AIProprietaryUnknown
Bard (Gemini Pro)120811741131GoogleProprietaryOnline
Llama-3-70b-Instruct120712021184116035MetaLlama 3 Community2023/12
Claude 3 Sonnet1201121612211199AnthropicProprietary2023/8
Reka-Core-202405011200119211921192Reka AIProprietaryUnknown
Command R+118911671213118730CohereCC-BY-NC-4.02024/3
Qwen2-72B-Instruct1187118911991136AlibabaQianwen LICENSE2024/6
Mixtral-8x22b-Instruct-v0.11146115311561130MistralApache 2.02024/4
GPT-3.5-Turbo-0613111711371124111736OpenAIProprietary2021/9

Siehe auch: Aider LLM Leaderboards (editing code); (NEUES FOSS Model): DeepSeek Coder V2 (no diff edit, hf, Aider 75%, HumanEval 90%); Artificial Analysis (Quality, Speed, Price, Context Size, etc.); Big Code Leaderboard;

Ältere Tabelle, freie/kleine Modelle ev. hinzugefügt, Source: Perplexity

GPT-4oGemini Advanced 1.5 ProLlama-3-70b-InstructMS Command R+Claude 3.0 SonnetGPT-3.5T
Zugang/LizenzKonto (limitiert) bzw. Abo $20/mAbo 22€/mmeta Lic / GroqCC-BY-NC-4.0Konto (No EU)Konto
Letzte Version/Vergleich2024-05-13 (2023-10)2024-05-14 (online)2023-122024-03-0x2023-07-12?
Areno Elo128712671208118912021117
Arena Coding129912581202116712161137
Kontext (Gesamt-Ein-Ausgabe)128k1000k8k128k100-200k4/16k
BilderkennungBeschränktBeschränkt??xx
CodingJaJaJaJa
in EU zugänglichJaJaJaVPNJa
Needle in Haystack 1 235%41%36%42%35%35%

Leistungsfähigste aktuell zugängliche Modelle laut Chatbot Arena ⚔️ Benchmarks Leaderboard Anfang März 2024, freie/kleine Modelle ev. hinzugefügt

GPT-4TGemini ProMistral MediumMixtral 8x7bClaude 3.0 SonnetGPT-3.5T
Zugang/LizenzAbo $20/mGoogle KontoApache 2.0Konto (No EU)Konto
Letzte Version/Vergleich2023-11-062024-01-302023-12-112023-12-112024-03-0x2023-07-12?
Areno Elo125012001150111511801115
Kontext (Gesamt-Ein-Ausgabe)4-128K2K1K0.5-32K200K
BilderkennungBeschränktBeschränkt??xx
CodingJaJaBeschränktBeschränkt?JaJa
in EU zugänglichJaJaJaJaJaJa
Needle in Haystack 1 264 K 100%
>100K <30%
Bad (Instruct)?32K >90%200K Opus >99%
Juni 2024: Nicht mögliche Operationen
Claude 3.5 Sonnet: Erzeuge SVG mit Text aus PNG Karte

Wissenschaft/Science Research Help

Jan. 2024: In Englisch mit Hinweis zum Dienst und ob Login nötig ist. Von Andy Stapleton @ YT

  1. Consensus1: It is an AI-powered search engine that finds and summarizes scientific research papers. To use Consensus, you need to create an account1.
  2. Scholarly Assistant2: It is an AI assistant developed by Jenni AI to help with academic writing. It offers both free and paid features2.
  3. Paper Interpreter3: It is a tool that helps users understand academic papers. The usage details, including whether it’s free or requires a login, are not specified in the search results3.
  4. Scholar AI4: It is a website designed to help students with note-taking and learning. It offers both free and paid features4.
  5. Scholar GPT5: It is an AI model developed by OpenAI. It offers a subscription service for $20 per month, promising faster response times and priority access to new features. A free trial version is also available5.
  6. Academic Assistant Pro6: It acts as a professional academic assistant providing support with a professorial touch. The usage details, including whether it’s free or requires a login, are not specified in the search results6.
  7. Academic Research Reviewer7: It is a literature review generator that uses AI to provide comprehensive and meticulously curated literature reviews. The usage details, including whether it’s free or requires a login, are not specified in the search results7.
  8. Herisa or Urisa1: It seems there might be some confusion here. I couldn’t find an AI service named “Herisa”. However, URISA is an association for GIS professionals1. It doesn’t seem to be an AI tool for academic research.
  9. OpenRead234: OpenRead is an AI-driven platform that enables users to efficiently access and interact with papers. It offers a Paper Q&A tool to quickly answer any queries, a Paper Espresso feature to build and refine literature reviews, an AI-powered reading assistant, a low code editor, and an effective notes system3It provides thousands of free and pre-made journal paper templates for easy kicking-off3Most features on OpenRead can be unlocked for $5/month4.
  10. Explain Paper5678: ExplainPaper is a website that uses AI to summarize scientific research papers. Users can upload a paper, highlight confusing text, and get an explanation5It makes understanding complex scientific papers easy8.
  11. PaperBrain91011: PaperBrain is a free AI tool that allows users to explore and understand research papers better10It lets users search for a topic and it will show results relevant to their search10.
  12. Einblick121314: Einblick is an AI-native data notebook that writes and fixes code, plots beautiful charts, builds models, and much more12It doesn’t specify whether it’s free or requires a login12.
  13. Tavaly15is an AI tool for rapid insights and comprehensive research15.

LLM (Große Sprachmodelle – Textanfragen/Antworten)

Schreibe einen Kommentar