Saturday, February 1, 2025

GenAI and LLMs development, trends and implications (20. - 26.1.2025)

What’s the real ROI of AI in 2025?

Google releases experimental AI reasoning model - Gemini 2.0 Flash Thinking Experimental
DeepSeek open-sources DeepSeek-V3, a 671B parameter mixture of experts LLM
Nvidia Ingest aims to make it easier to extract structured information from documents
Microsoft Research unveils rStar-Math, advancing mathematical reasoning in Small Language Models
Microsoft Phi-4 is a Small Language Model specialized for complex math reasoning
Amazon Bedrock introduces Multi-Agent Systems (MAS) with open-source framework Integration
Luma AI’s Ray2 video model is now available in Amazon Bedrock

Want to integrate AI into your business? Fine-tuning won’t cut it
Building successful AI Apps: The dos and don’ts
Agentic Mesh: Towards enterprise-grade agents

Advancing AI reasoning: Meta-CoT and system 2 thinking

Choose a database with a hybrid vector search for AI apps

A framework for building micro metrics for LLM system evaluation

Why LLMs suck at ASCII art
Large Language Models: A short introduction

Human minds vs. machine learning models - exploring the parallels and differences between psychology and machine learning
Understanding emergent capabilities in LLMs - lessons from biological systems

Chain-of-Thought Prompting - a comprehensive analysis of reasoning techniques in Large Language Models

RAG isn’t immune to LLM hallucination

Designing, building & deploying an AI chat app from scratch - part 1 and part 2
A guide to deploying AI for real-time content moderation
Real-time data streaming with AI

How LLMs are going to change code generation in modern IDEs
Meet Junie, your coding agent by JetBrains
"Fix with AI" button to automate Playwright test fixes
Collaborative Intelligence - maximizing human-AI partnerships in the workplace

Building effective agents with Spring AI (Part 1)
Fresh data for AI with Spring AI function calls
Powering LLMs with Apache Camel and LangChain4j

Saturday, January 25, 2025

GenAI and LLMs development, trends and implications (13. - 19.1.2025)

Prompt engineering has become an essential skill for working effectively with large language models (LLMs) - guide on the best prompt engineering books
Google unveiled PaLiGeMMA 2 - a family of vision-language models (VLM)
NVIDIA’s announces DIGITS - its first personal AI computer

Projects like AYA Expanse are exploring multilingual capabilities

Combining local and cloud models to build a multimodal AI assistant answering complex image questions, with the option to run everything locally

Importance of robust system memory as a key to personalized AI intelligence
Building reliable AI applications - LLM routing

Microsoft's framework for AI-driven cloud operations - AIOpsLab
Introducing Google's Vertex AI RAG engine
Enterprise RAG in Amazon Bedrock - learn details of Amazon Bedrock KnowledgeBases capability

Real-world applications and best practices using Azure AI and GPT-4
Developing an AI-powered smart guide for business planning & entrepreneurship

Supercharging RAG with MAS (Multi-Agent System)

The rise of reasoner models - scaling test-time compute
Advancing complex medical reasoning with HuatuoGPT-o1

Major LLMs have the capability to pursue hidden goals

And the future: