170 blog posts published by month since the start of 2024. Start from a different year:

Blog URL
Posts year-to-date
8 (30 posts by this month last year.)
Average posts per month since 2024
7.1

Post details (2024 to today)

Title Author Date Word count HN points
Encord Monthly Computer Vision Wrap: April Industry Newsletter Stephen Oladele Apr 30, 2024 1035 -
Grok-1.5 Vision: First Multimodal Model from Elon Musk’s xAI Stephen Oladele Apr 16, 2024 111812 -
Top 8 Alternatives to the Open AI CLIP Model Haziqa Sajid Apr 19, 2024 2277 -
Announcing HTJ2K Support for DICOM Files in Encord Akruti Acharya Feb 16, 2024 530 -
Top 8 Applications of Computer Vision in Robotics Haziqa Sajid Jan 11, 2024 2428 -
5 Questions to Ask When Evaluating a Video Annotation Tool Haziqa Sajid Mar 08, 2024 2704 -
Top 10 Data Annotation and Data Labeling Companies [2024] Haziqa Sajid Feb 23, 2024 1998 -
Data-Centric AI: Implement a Data Centered Approach to Your ML Pipeline Akruti Acharya Jan 11, 2024 1391 -
Intelligent Process Automation Vs. Robotic Process Automation: Key Differences David Babuschkin Jun 10, 2024 2414 -
Knowledge Distillation: A Guide to Distilling Knowledge in a Neural Network Haziqa Sajid May 10, 2024 4073 -
AGV vs. AMRs for Warehouse Automation: What's the Key Difference? Nikolaj Buhl Jun 26, 2024 1924 -
Gemini 1.5: Google's Generative AI Model with Mixture of Experts Architecture Stephen Oladele Feb 17, 2024 2924 -
Top 6 Computer Vision Data Management Tools Haziqa Sajid Jan 31, 2024 1852 -
5 Best V7 Alternatives in 2024 Nikolaj Buhl Jan 22, 2024 1205 -
Ray-Ban Meta Smart Glasses are Getting an Upgrade with Multimodal AI Akruti Acharya Apr 26, 2024 433 -
Introduction to Krippendorff's Alpha: Inter-Annotator Data Reliability Metric in ML Stephen Oladele Jan 08, 2024 2979 -
Announcing the launch of Advanced Video Curation Nikolaj Buhl Apr 24, 2024 210 -
Llama 3V: Multimodal Model 100x Smaller than GPT-4 Stephen Oladele May 30, 2024 1692 -
MM1: Apple’s Multimodal Large Language Models (MLLMs) Akruti Acharya Mar 26, 2024 2259 -
Comparative Analysis of YOLOv9 and YOLOv8 Using Custom Dataset on Encord Active Akruti Acharya Mar 01, 2024 1327 -
Top 10 Alternatives to Lightly AI [2024] David Babuschkin Apr 22, 2024 2366 -
Microsoft MORA: Multi-Agent Video Generation Framework Stephen Oladele Mar 26, 2024 3000 -
What is Continuous Validation? Stephen Oladele May 03, 2024 2126 -
Segment Anything Model 2 (SAM 2) & SA-V Dataset from Meta AI Akruti Acharya Jul 30, 2024 2612 -
Automate Text Labeling for Your Image Dataset: A Step-by-Step Guide Akruti Acharya Jun 28, 2024 456 -
Improving Data Quality Using End-to-End Data Pre-Processing Techniques in Encord Active Akruti Acharya Feb 03, 2024 4712 -
Meta’s Llama 3.1 Explained Akruti Acharya Jul 25, 2024 1757 -
Product Updates [January 2024] Justin Sharps Feb 15, 2024 969 -
Google’s Video Gaming Companion: Scalable Instructable Multiworld Agent [SIMA] Stephen Oladele Mar 16, 2024 3265 -
Few Shot Learning in Computer Vision: Approaches & Uses Haziqa Sajid Feb 16, 2024 3095 -
Fine-Tuning VLM: Enhancing Geo-Spatial Embeddings Akruti Acharya Apr 04, 2024 978 -
Top 5 Data Curation Tools for Videos Nikolaj Buhl Jun 07, 2024 2116 -
Claude 3 | AI Model Suite: Introducing Opus, Sonnet, and Haiku Akruti Acharya Mar 05, 2024 1978 -
Vision-based Localization: A Guide to VBL Techniques for GPS-denied Environments Haziqa Sajid Jun 17, 2024 3919 -
How to Pre-Label Your Data with GPT-4o David Babuschkin Jun 26, 2024 809 -
Top 9 Tools for Generative AI Model Validation in Computer Vision Stephen Oladele Mar 06, 2024 2814 -
The Python Developer's Toolkit for PDF Processing Akruti Acharya Jul 17, 2024 760 -
Multiplanar Reconstruction (MPR) in the DICOM Editor David Babuschkin Feb 12, 2024 201 -
Apple Vision PRO - Extending Reality to Radiology Akruti Acharya Feb 22, 2024 3248 -
How Poor Data is Killing Your Models and How to Fix It Akruti Acharya Jul 02, 2024 850 -
Overfitting in Machine Learning: ​​How to Detect and Avoid Overfitting in Computer Vision? Akruti Acharya Apr 19, 2024 2204 -
4 Reasons Why Computer Vision Models Fail in Production Stephen Oladele Apr 24, 2024 2471 -
Automatic Guided Vehicles: The Future of Machine Vision in Warehousing Haziqa Sajid May 28, 2024 2359 -
Top 12 Dimensionality Reduction Techniques for Machine Learning Stephen Oladele Mar 22, 2024 4643 -
Best Practices for Handling Unstructured Data Efficiently Haziqa Sajid May 03, 2024 3271 -
Announcing Auto-Segmentation Tracking For Video Akruti Acharya Mar 22, 2024 1263 -
Mistral Large Explained Akruti Acharya Feb 28, 2024 1269 -
10 Best 3D Slicer Alternatives in 2024 Haziqa Sajid Feb 09, 2024 1729 -
How SAM 2 and Encord Transforms Video Annotation Akruti Acharya Aug 01, 2024 1411 -
Diffusion Transformer (DiT) Models: A Beginner’s Guide Akruti Acharya Mar 18, 2024 3010 -
Panoptic Segmentation Updates in Encord Stephen Oladele Mar 06, 2024 1117 -
Intelligent Character Recognition: Process, Tools and Applications Stephen Oladele May 03, 2024 2046 -
Top 8 ITK-Snap Alternatives in 2024 Haziqa Sajid Mar 22, 2024 2007 -
Encord Monthly Wrap: June Industry Newsletter Stephen Oladele Jul 02, 2024 687 -
How to Use Semantic Search to Curate Images of Products with Encord Active Stephen Oladele Feb 16, 2024 1781 -
15 Interesting Github Repositories for Image Segmentation Haziqa Sajid Mar 15, 2024 2625 -
Announcing the launch of SAM 2 in Encord Justin Sharps Jul 31, 2024 325 -
How to Leverage Computer Vision in Warehouse Automation Haziqa Sajid Jul 03, 2024 2644 -
What is Robotic Process Automation (RPA)? Görkem Polat Mar 15, 2024 3244 -
Applications of Computer Vision in Logistics and Supply Chain (2024) Haziqa Sajid Jul 03, 2024 2254 -
Top 10 Multimodal Datasets Nikolaj Buhl Aug 15, 2024 2415 -
Encord Monthly Wrap: May Industry Newsletter Stephen Oladele May 29, 2024 762 -
Top 8 Use Cases of Computer Vision in Manufacturing Haziqa Sajid Jan 12, 2024 2793 -
Exploring Vision-based Robotic Arm Control with 6 Degrees of Freedom Akruti Acharya May 03, 2024 2036 -
Stable Diffusion 3: Multimodal Diffusion Transformer Model Explained Akruti Acharya Mar 05, 2024 2569 -
ONNX Standardized Format: The Universal Translator for AI Models Alexandre Bonnet Aug 15, 2024 2129 -
Phi-3: Microsoft’s Mini Language Model is Capable of Running on Your Phone Akruti Acharya Apr 25, 2024 1724 -
How to Use GPT-4o for Model Development with Encord Akruti Acharya May 17, 2024 1009 -
DataOps Vs MLOps: What's the Difference? Stephen Oladele Apr 19, 2024 1815 -
From Big Data to Smart Data: How to Manage, Clean and Curate Your Visual Datasets for AI Development Nikolaj Buhl Feb 01, 2024 113 -
Announcing Encord’s $30 million Series B funding Eric Landau Aug 13, 2024 961 -
YOLOv9: SOTA Object Detection Model Explained Akruti Acharya Feb 23, 2024 1862 -
GPT-4o vs. Gemini 1.5 Pro vs. Claude 3 Opus: Multimodal AI Model Comparison Stephen Oladele May 16, 2024 2903 -
Data Lake Explained: A Comprehensive Guide for ML Teams Stephen Oladele Mar 28, 2024 3739 -
Top Alternatives to Voxel51 Haziqa Sajid Jan 26, 2024 1762 -
Panoptic Segmentation Tools: Top 9 Tools to Explore in 2024 Haziqa Sajid Apr 10, 2024 2508 -
PPE Detection Using Computer Vision for Workplace Safety Nikolaj Buhl Jul 16, 2024 3390 -
Setting Up a Computer Vision Testing Platform Stephen Oladele Apr 09, 2024 3429 -
Introducing TTI-Eval: An Open-Source Library for Evaluating Text-to-Image Embedding Models Frederik Hvilshøj Jun 26, 2024 1279 -
Meta Imagine AI Just got an Impressive GIF Update Stephen Oladele May 13, 2024 1606 -
GPT-4 Vision Alternatives Stephen Oladele Jan 31, 2024 2706 -
Qwen-VL and Qwen-VL-Chat: Introduction to Alibaba’s AI Models Akruti Acharya Feb 29, 2024 2050 -
Encord Monthly Wrap: January Industry Newsletter Stephen Oladele Feb 02, 2024 987 -
Meta AI’s Ilama 3: The Most Awaited Intelligent AI-Assistant Stephen Oladele Apr 19, 2024 2947 -
Top 15 DICOM Viewers for Medical Imaging Haziqa Sajid Jan 18, 2024 2331 -
YOLO World Zero-shot Object Detection Model Explained Akruti Acharya Mar 11, 2024 1705 -
Top 12 CVAT Alternatives [2024] Stephen Oladele Apr 25, 2024 2153 -
Vision Language Models: Powering the next chapter in AI Justin Sharps Mar 01, 2024 84 -
Meet Shivant - Technical CSM at Encord Lavanya Cholaraju Jul 19, 2024 1009 -
Meta’s V-JEPA: Video Joint Embedding Predictive Architecture Explained Akruti Acharya Feb 16, 2024 1136 -
VGG Image Annotator Alternatives in 2024 Nikolaj Buhl Jun 21, 2024 2861 -
Video Data Curation Guide for Computer Vision Teams Stephen Oladele Jun 04, 2024 2237 -
Model Drift: Best Practices to Improve ML Model Performance Görkem Polat Jan 04, 2024 2970 -
Validating Model Performance Using Encord Active Stephen Oladele Mar 02, 2024 2361 -
Top 10 Video Object Tracking Algorithms in 2024 Haziqa Sajid Mar 08, 2024 3296 -
Top Text Annotation Tools in 2024: Features, Collaboration, and Industry Applications Nikolaj Buhl Jul 03, 2024 3026 -
ML Observability Tools: Arize AI Alternatives Alexandre Bonnet Apr 20, 2024 3134 -
CVPR 2024: Top Artificial Intelligence and Computer Vision Papers Accepted Akruti Acharya Jun 05, 2024 2312 -
Computer Vision in Agriculture: The Age of Agricultural Automation through Smart Farming Haziqa Sajid May 28, 2024 2166 -
A Guide to Machine Learning Model Observability Haziqa Sajid Jan 19, 2024 3137 -
How Have Foundation Models Redefined Computer Vision Using AI? Stephen Oladele May 01, 2024 2681 -
12 Best Supervisely Alternatives in 2024 Haziqa Sajid Apr 12, 2024 2409 -
Encord Monthly Wrap: March Industry Newsletter Stephen Oladele Apr 08, 2024 933 -
An Overview of the Machine Learning Lifecycle Sundeep Teki Feb 26, 2024 1891 -
Encord Monthly Wrap: February Industry Newsletter Stephen Oladele Mar 08, 2024 702 -
Dataset Distillation: Algorithm, Methods and Applications Haziqa Sajid Apr 26, 2024 2803 -
Top 10 Best AI Avatar Generators for Video in 2024 Nikolaj Buhl Jun 13, 2024 3946 -
Google’s MediaPipe Framework: Deploy Computer Vision Pipelines with Ease [2024] Nikolaj Buhl Jun 21, 2024 2357 -
How to Analyze Failure Modes of Object Detection Models for Debugging Stephen Oladele Feb 19, 2024 2959 -
Top 9 Alternatives to DeepChecks Haziqa Sajid Apr 07, 2024 2175 -
AI as a Service: The Ultimate AIaaS Guide for Business in 2024 Haziqa Sajid Jun 24, 2024 2712 -
Top 10 Open Source Computer Vision Repositories Nikolaj Buhl Mar 15, 2024 4295 -
OpenAI Releases New Text-to-Video Model, Sora Akruti Acharya Feb 15, 2024 1954 -
Announcing the launch of Consensus in Encord Workflows Nikolaj Buhl Apr 02, 2024 155 -
Visualizations in Databricks Haziqa Sajid Mar 28, 2024 2107 -
Top 10 Multimodal Models Haziqa Sajid Jul 16, 2024 3133 -
Machine Learning Trends & Stats for 2024 Frederik Hvilshøj Aug 16, 2024 1785 -
OpenAI o1: A New Era of AI Reasoning Akruti Acharya Sep 13, 2024 1207 -
OpenAI o1: A New Era of AI Reasoning Akruti Acharya Sep 18, 2024 1207 -
Key Insights from the Inaugural AI After Hours Ulrik Stig Hansen Sep 27, 2024 976 -
From Vision to Edge: Meta’s Llama 3.2 Explained Alexandre Bonnet Sep 30, 2024 1342 -
Meet Dillon - Commercial Associate at Encord Lavanya Cholaraju Oct 01, 2024 891 -
Manage And Curate Audio Data In Encord David Babuschkin Oct 04, 2024 557 -
NVLM 1.0: NVIDIA's Open-Source Multimodal AI Model Akruti Acharya Oct 07, 2024 1112 -
Apple’s MM1.5 Explained Akruti Acharya Oct 07, 2024 1352 -
Top 10 Multimodal Use Cases Nikolaj Buhl Oct 07, 2024 4933 -
Vision Fine-Tuning with OpenAI's GPT-4: A Step-by-Step Guide Akruti Acharya Oct 09, 2024 1496 -
Annotate Audio Data In Encord David Babuschkin Oct 11, 2024 666 -
Unifying AI Data Toolstack: How to Streamline Your AI Workflows Haziqa Sajid Oct 16, 2024 2194 -
CoTracker3: Simplified Point Tracking with Pseudo-Labeling by Meta AI Eric Landau Oct 18, 2024 1436 -
Understanding Meta’s Movie Gen Bench: New Generative AI model for Video and Audio Justin Sharps Oct 21, 2024 1799 -
Spirit LM: Meta AI’s Multimodal Model for Seamless Text and Speech Generation Ulrik Stig Hansen Oct 22, 2024 1681 -
SAM 2.1 Explained: Smarter Segmentation and Developer Tools For the Future Ulrik Stig Hansen Oct 22, 2024 1083 -
Top 9 Audio Annotation Tools Justin Sharps Oct 29, 2024 1559 -
The Ultimate Guide on How to Streamline AI Data Pipelines Eric Landau Nov 06, 2024 2209 -
Find the Best PDF Annotator Tool: List of Top Tools Eric Landau Nov 06, 2024 1890 -
Machine Learning Image Classification: A Comprehensive Guide for 2024 Eric Landau Nov 08, 2024 1786 -
Real-World Use Cases of Generative AI in Manufacturing Frederik Hvilshøj Nov 12, 2024 2348 -
Building a Generative AI Evaluation Framework Eric Landau Nov 13, 2024 2377 -
Streamlining LLM Data Workflows: A Deep Dive into Encord's Unified Platform Justin Sharps Nov 14, 2024 913 -
Encord is the world’s first fully multimodal AI data platform Eric Landau Nov 14, 2024 1405 -
Pixtral Large Explained Justin Sharps Nov 20, 2024 1187 -
Data Exploration Made Easy: Tools and Techniques for Better Insights Frederik Hvilshøj Nov 22, 2024 2377 -
Data Visualization 101: Key Tools for Understanding Your Data Frederik Hvilshøj Nov 21, 2024 4684 -
Llava-o1: A Vision-Language Reasoning Model Explained Eric Landau Nov 26, 2024 894 -
How to Manage Data Annotation Pipelines: A Guide to Building Scalable Medical AI Solutions Alexandre Bonnet Dec 02, 2024 2611 -
AI Metrics that Matter: A Guide to Assessing Generative AI Quality Alexandre Bonnet Dec 03, 2024 3802 -
How to Label and Analyze Multimodal Medical AI Data David Babuschkin Dec 04, 2024 1233 -
Why a PDF Text Extraction Software is Key for Quality AI Text Training Data Haziqa Sajid Dec 09, 2024 2548 -
Exploring Audio AI: From Sound Recognition to Intelligent Audio Editing Haziqa Sajid Dec 10, 2024 2276 -
AI Agents in Action: A Guide to Building Agentic AI Workflows Frederik Hvilshøj Dec 11, 2024 2856 -
A Guide to Speaker Recognition: How to Annotate Speech Ulrik Stig Hansen Dec 12, 2024 2125 -
How to Enhance Text AI Quality with Advanced Text Annotation Techniques Alexandre Bonnet Dec 13, 2024 2336 -
Key Features to Look for in an Image Labeling Tool Alexandre Bonnet Dec 17, 2024 2010 -
Exploring Google DeepMind's Latest AI Innovations: Gemini 2.0, Veo 2, and Imagen 3 Ulrik Stig Hansen Dec 19, 2024 1015 -
How to Implement Audio File Classification: Categorize and Annotate Audio Files Alexandre Bonnet Dec 20, 2024 2585 -
What Is Named Entity Recognition? Selecting the Best Tool to Transform Your Model Training Data Alexandre Bonnet Dec 19, 2024 2791 -
PDF OCR: Converting PDFs into Searchable Text Haziqa Sajid Dec 20, 2024 2308 -
Web Agents and LLMs: How AI Agents Navigate the Web and Process Information Alexandre Bonnet Dec 23, 2024 1859 -
Recap 2024 - An Epic Foundational Year Ulrik Stig Hansen Dec 23, 2024 1352 -
Understanding Multiagent Systems: How AI Systems Coordinate and Collaborate Alexandre Bonnet Dec 30, 2024 2215 -
Best Practices for Data Versioning for Building Successful ML Models Haziqa Sajid Dec 31, 2024 2204 -
Data Visibility & Traceability: How to Build Robust AI Models Haziqa Sajid Jan 03, 2025 2459 -
Top Computer Vision Models: Comparing the Best CV Models Haziqa Sajid Jan 10, 2025 2358 -
Teaching Machines to Read: Advances in Text Classification Techniques Alexandre Bonnet Jan 16, 2025 4450 -
What is Natural Language Search? How AI is Transforming Search Alexandre Bonnet Jan 21, 2025 2798 -
Data Classification 101: Structuring the Building Blocks of Machine Learning Akruti Acharya Jan 20, 2025 1917 -
Everything You Need to Know About RAG Pipelines for Smarter AI Models Eric Landau Jan 20, 2025 2086 -
Providing Computer Vision Infrastructure for Project Stormcloud Ulrik Stig Hansen Jan 22, 2025 359 -
Scaling Conversations with AI: Challenges and Opportunities Haziqa Sajid Jan 23, 2025 2121 -