Category: OpenAI API
-
How to use GPT-4 with Vision for Robotics and Other Applications

TLDR: There are multiple ways you can use GPT-4 with Vision to power robotics and other systems. I included several sample apps you can download to experiment with including a robot simulator. Multimodal AI models like GPT-4 with Vision have allowed for entirely new kinds of applications that go far…
-
Improving GPT-4’s Visual Reasoning with Prompting

TLDR: How to boost GPT-4 with Vision’s capabilities with a simple prompt addition. A recent paper How Far Are We from Intelligent Visual Deductive Reasoning? points out the limitations in visual reasoning in image models like GPT-4V. Like other related papers, I think the investigators are directionally correct (these models…
-
Is the Reversal Curse Real?

A recent paper The Reversal Curse points out an apparent failure in large large language models like GPT-4. From the abstract: We expose a surprising failure of generalization in auto-regressive large language models (LLMs). If a model is trained on a sentence of the form “A is B”, it will not automatically…
-
Smarter than you think: Crystalline and fluid intelligence in large language models
Large models like GPT-3 can perform a variety of tasks with little instruction. That said, one of the challenges in working with these models is determining the right way to do something. GPT-3 has acquired knowledge from its training data as well as another kind of “intelligence” from learning the…
-
How to get better Q&A answers from GPT-3
GPT-3 is an exceptional mimic. It looks at the text input and attempts to respond with what text it thinks best completes the input. If the first line sounds like something from a romance novel it will try to continue writing in that style. If it’s a list of video…
-
Video: GPT-3 and long lists
GPT-3 can remember hundreds of items and perform completions with them. This is useful if you want to take your prompts to the next level and do more complex operations.
-
A simple method to keep GPT-3 focused in a conversation

OpenAI’s GPT-3 is a highly capable general language model able to talk about almost anything. While this is an advantage on one hand, it can also make keeping GPT-3 focused on one topic a challenge if you’re trying to create a special purpose chatbot. If you want GPT-3 to talk…
-
OpenAI API Classification Endpoint
We recently added three new endpoints to the API for GPT-3. The Classification Endpoint makes it easy to apply classification from a data set larger than what fits inside a prompt.
-
How large is GPT-3’s short term memory?

TL;DR: In an API call GPT-3 can recall details from a 1,500 word article and even repeat passages verbatim. It can also repeat over 250 items from a list as it creates a completion. The concept of memory with a large language model can be a little fuzzy. There’s how…
-
The GPT-3 Zero Shot approach

TL;DR: For many tasks you don’t need to provide GPT-3 with examples because it already understands what you want. If you look closely at the documentation and prompts for GPT-3 provided by OpenAI you’ll notice that a number of them don’t require any examples to show the model what you…