Llm

Published on
October 4, 2024
LLMs for everything, all aboard the hype train?
llm t5 decoder encoder-decoder
Is an LLM always the right choice for text generation? (probably not)
Published on
September 13, 2024
What George Washington taught me about writing with AI
llm rag icl writing assistant
LLM has the potential to be a super writing assistant, but why does it often fall short?
Published on
July 25, 2024
Survey of Current Modified Transformer Attention Designs
llm attention
Recent Advancements in Attention Mechanism for Long-Sequence Understanding and Generation
Published on
June 28, 2024
AI now stands for Apple Intelligence
llm apple lora
Some exciting details from Apple announcements at WWDC 2024
Published on
June 6, 2024
Picking the right LLM deployment configuration in AWS
llm aws
AWS offers several options for LLM usage, which will you choose?
Published on
May 16, 2024
Is your data more or less important in the age of LLMs?
llm data
LLMs promise high levels of performance without the need for task specific training, but that doesn't mean that custom datasets are unnecessary.
Published on
May 3, 2024
How to Deploy Bedrock Agents using AWS CDK
llm agents bedrock
Last week a bedrock agent was deployed manually in AWS Console. Let's now do it the smart way, with Infrastructure as Code using AWS CDK
Published on
April 26, 2024
AI Agents with AWS Bedrock (Claude-3 Haiku LLM)
llm agents bedrock
A few weeks ago, I used Google's Gemini LLM for function calling. let's see how it works with AWS Bedrock and Anthropic's Claude
Published on
March 25, 2024
Function Calling with Google's Gemini LLM
llm agents gemini
When you need an LLM to access new information, LLM function calling is here to help. Let's explore how to use the Gemini function calling to access real information.
Published on
March 15, 2024
Langchain: An Overview of my Experience with the LLM Framework
llm rag langchain
Langchain seems to be a popular choice for developing LLM applications, but when extending it beyond basic use cases, issues emerge.
Published on
March 8, 2024
Why LLMs mess up the simple things
llm tokenization chatgpt
A foundational problem with LLMs: a takeaway from Andrej Karpathy’s lecture on GPT Tokenization.
Published on
February 23, 2024
Whose Bias Do You Want?
llm gemini google
The release of Gemini 1.5 Pro, a glimpse into the difficulty in fine-tuning LLM guardrails
Published on
January 18, 2024
Fantastic ML/AI and Where to Find Them: A Guide to the Best ML/AI Resources
LLM
So much is happening in AI, how do you keep up? Here are some of the best resources to follow to stay up to date on AI research and applications.
Published on
January 12, 2024
Exploration of Inference Speed of 🤗 Transformers Auto-Regressive Model
engineering LLM
Are there easy ways to improve the inference speed of an auto-regressive model?
Published on
December 22, 2023
Can we trust ChatGPT integrations into consumer sites?
prompt gpt llm
Early adopters of LLM integrations risk unexpected outcomes
Published on
December 15, 2023
Diving deep into Mistral AI and their newest model, Mixtral-8x7B
mistral gpt llm mixtral
Looking at the source code to see what makes Mixtral-8x7B so powerful
Published on
December 8, 2023
The most important AI development of 2023 was maybe not GPT-4
llama gpt llm
In a year full of AI news and hype, which events are going to have the biggest enduring impact on AI?
Published on
December 1, 2023
Automation Bias in the use of Large Language Models
aws llm safety
Generative AI is showing up everywhere: how do we avoid trusting it more than we should?
Published on
November 24, 2023
How I use Github Copilot
copilot llm
Write code faster with Copilot
Published on
November 10, 2023
Who am I? How to give an LLM access to new data
data llm
How to use Retrieval Augmented Generation (RAG) to connect LlaMA-2 with a large number of internet sources
Published on
November 3, 2023
AI Safety: Snippets of Biden's Executive Order on AI
safety regulation llm
One application of the recent Executive Order from the White House
Published on
October 27, 2023
Using Google's Bard LLM for Rapid Web Development
LLM web-development bard
Using Google's Bard LLM for Creating a Flask and React Recipe Review Application
Published on
October 20, 2023
Training in Sagemaker with Huggingface, Tips & Tricks
LLM transformers sagemaker
Some hidden tips and tricks to get the most out of sagemaker training jobs
Published on
October 13, 2023
The New Huggingface Templates for Chat Models Feature
LLM transformers tokenization templates
Upgrade your code with chat model templates
Published on
August 28, 2023
Train LlaMA-2 LLM on your own emails, Part 2. Model Training
LLM transformers tokenization email personalization
Train an LLM to answer emails for you
Published on
August 25, 2023
Train a LlaMA-2 LLM for free to be an email autocomplete assistant using your own email data! Part 1. Introduction and Data Preparation
LLM transformers LlaMA-2 personalization
Prepare a dataset to train an LLM to answer emails for you
Published on
August 2, 2023
End of Sequence Token Explained
LLM transformers tokenization
How is the pad token handled in training a transformer and what's the impact of setting the pad token to be the same as the eos token?