LLM Models¶

Proprietary Models¶

Proprietary models are developed and maintained by specific companies or organizations. They often come with advanced features, optimizations, and support, but may require a subscription or usage fees.

Privacy Notice for Proprietary Models

All interactions with proprietary models are typically sent to and stored on company servers. Do not share sensitive data such as:

Personal identifying information
Private company data
Confidential research
Protected health information
Financial data
Authentication credentials

For sensitive data, consider using open source models that can be run locally instead.

GPT-4-Mini¶

Provider: OpenAI ChatGPT

General purpose, decent context length, cheap, and fast.

Good for:

Quick research note organization and summarization
Basic code debugging and documentation
Initial research question brainstorming
Literature search query refinement

Gemini 1.5 Deep Research¶

Provider: Google Gemini

General purpose, very long context length, but requires a paid subscription.

Good for:

Searching multiple research papers simultaneously
Finding subtle connections across large bodies of research
Deeper literature reviews

Gemini 2.0 Flash¶

Provider: Google Gemini

General purpose, very long context length, free (at the current moment), and extremely fast.

Good for:

Real-time data analysis feedback during experiments
Rapid research presentation preparation
Quick validation of methodology sections
Interactive exploratory data analysis

Claude 3.5 Sonnet¶

Provider: Anthropic Claude

Excels at coding, long context length, but limited messages.

Good for:

Complex research software architecture design
Write, edit, and execute code with sophisticated reasoning and troubleshooting capabilities

Mistral Large¶

Provider Mistral

Excels at multi-modal, long context length, and fast. Has free tier.

Good for: 1. Blazing-fast responses (~1000 words/sec) 2. Advanced document & image analysis 3. Local code execution & data exploration 4. Top-tier image generation model

Open Source Models¶

Open source models make their weights, architecture, and training code publicly available. They can be run locally on personal hardware or deployed to cloud infrastructure. Most implementations use techniques like quantization to reduce model size and memory requirements. Common formats include GGUF (formerly GGML) and AWQ for efficient deployment.

Advantages:

Complete control over model deployment and inference
Ability to fine-tune or modify the model for specific use cases
No usage fees beyond infrastructure costs
Privacy benefits since data stays local
Community-driven improvements and customizations

You can use these models through offline gateways.

Deepseek R1 70B¶

Provider: Deepseek

Good at reasoning, long context length, and free.

Good for:

Theoretical research development
Complex mathematical problem-solving
Research methodology validation
Hypothesis refinement and testing

Qwen 2.5 Coder 7B¶

Provider: Qwen

Good at coding, decent context length.

Good for:

Efficient research automation scripts
Data preprocessing pipelines
Statistical analysis implementations
Scientific visualization code

Mistral Small 24B¶

Provider: Mistral AI

Good at reasoning, decent context length.

Good for:

Logical argument analysis
Experimental design critique
Research bias identification
Methodology comparison