Connect with us

Kunal Kejriwal

"An engineer by profession, a writer by heart". Kunal is a technical writer with a deep love & understanding of AI and ML, dedicated to simplifying complex concepts in these fields through his engaging and informative documentation.

Artificial Intelligence1 week ago
DIAMOND: Visual Details Matter in Atari and Diffusion for World Modeling
It was in 2018, when the idea of reinforcement learning in the context of a neural network world model was first introduced, and soon, this fundamental...
Artificial Intelligence1 week ago
In-Paint3D: Image Generation using Lightning Less Diffusion Models
The advent of deep generative AI models has significantly accelerated the development of AI with remarkable capabilities in natural language generation, 3D generation, image generation, and...
Artificial Intelligence2 weeks ago
MARKLLM: An Open-Source Toolkit for LLM Watermarking
LLM watermarking, which integrates imperceptible yet detectable signals within model outputs to identify text generated by LLMs, is vital for preventing the misuse of large language...
Artificial Intelligence1 month ago
MoRA: High-Rank Updating for Parameter-Efficient Fine-Tuning
Owing to its robust performance and broad applicability when compared to other methods, LoRA or Low-Rank Adaption is one of the most popular PEFT or Parameter...
Artificial Intelligence1 month ago
LightAutoML: AutoML Solution for a Large Financial Services Ecosystem
Although AutoML rose to popularity a few years ago, the ealy work on AutoML dates back to the early 90’s when scientists published the first papers...
Artificial Intelligence2 months ago
LLaVA-UHD: an LMM Perceiving Any Aspect Ratio and High-Resolution Images
The recent progress and advancement of Large Language Models has experienced a significant increase in vision-language reasoning, understanding, and interaction capabilities. Modern frameworks achieve this by...
Artificial Intelligence2 months ago
Uni-MoE: Scaling Unified Multimodal LLMs with Mixture of Experts
The recent advancements in the architecture and performance of Multimodal Large Language Models or MLLMs has highlighted the significance of scalable data and models to enhance...
Artificial Intelligence2 months ago
MambaOut: Do We Really Need Mamba for Vision?
In modern machine learning and artificial intelligence frameworks, transformers are one of the most widely used components across various domains including GPT series, and BERT in...
Artificial Intelligence2 months ago
CameraCtrl: Enabling Camera Control for Text-to-Video Generation
Recent frameworks attempting at text to video or T2V generation leverage diffusion models to add stability in their training process, and the Video Diffusion Model, one...
Artificial Intelligence2 months ago
BrushNet: Plug and Play Image Inpainting with Dual Branch Diffusion
Image inpainting is one of the classic problems in computer vision, and it aims to restore masked regions in an image with plausible and natural content....
Artificial Intelligence3 months ago
AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation
Over the years, the creation of realistic and expressive portraits animations from static images and audio has found a range of applications including gaming, digital media,...
Artificial Intelligence3 months ago
Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models
The advancements in large language models have significantly accelerated the development of natural language processing, or NLP. The introduction of the transformer framework proved to be...
Artificial Intelligence3 months ago
AIOS: Operating System for LLM Agents
Over the past six decades, operating systems have evolved progressively, advancing from basic systems to the complex and interactive operating systems that power today's devices. Initially,...
Artificial Intelligence3 months ago
Instant-Style: Style-Preservation in Text-to-Image Generation
Over the past few years, tuning-based diffusion models have demonstrated remarkable progress across a wide array of image personalization and customization tasks. However, despite their potential,...
Artificial Intelligence3 months ago
LoReFT: Representation Finetuning for Language Models
Parameter-efficient fine-tuning or PeFT methods seek to adapt large language models via updates to a small number of weights. However, a majority of existing interpretability work...

Page 1 of 512 3 4 5