In today’s column, I am continuing my ongoing coverage of prompt engineering strategies and tactics that aid in getting the most out of using generative AI apps such as ChatGPT, GPT-4, Bard, Gemini, Claude, etc. The focus here is on utilizing imported text of your own choosing when you are carrying on a generative AI conversation and want the AI to particularly lean into the text. This can be handy if you perchance have outside collected text that pertains to whatever endeavor you are opting to use generative for. For my comprehensive review of nearly two dozen other keystone prompting strategies, see the discussion at the link here.
Today’s discussion is on an essential prompting strategy that you definitely ought to know. The technique will indubitably be handy from time to time, even if right now you might not have an immediate need to make use of some extensive brought-in text during your prompting endeavors.
I would guess that most of the casual or occasional users of generative AI are fine with just hand-entering text directly during a generative AI session. For example, a user might enter a list of grocery items and then ask generative AI to tell you how many calories they each have. Alternatively, instead of providing a list of grocery items, another approach would consist of simply telling generative AI to produce a list of common grocery items and indicate their respective calories. In that case, the everyday user is seemingly satisfied with whatever text the generative AI already has in hand.
All of that is a no-sweat, no-fuss style of usage.
Good for them.
At some point, the odds are that you are going to have a file of say your favorite written personal stories that you’d like to examine with generative AI, or maybe your boss has handed you a bunch of text data that consists of transcripts associated with your customer service reps. It’s a lot of text, more so than you would want to hand-enter. Hand-entering the text would be exhausting, exasperating, and a sad woe since you could have imported the text rather than laboriously typing it in.
Let’s consider the text data that your boss wants you to analyze. Suppose that the boss knows you know how to use generative AI (lucky you!) and has asked you to analyze the transcripts with AI. If you’ve never done such an activity in generative AI, I’m sure you will go into pure panic mode and hurriedly grasp at straws to timely deal with the assigned task. The clock is ticking.
No need to grasp at straws. I will be explaining to you the process and best prompting practices that pertain to this type of task in generative AI. One thing to keep in mind is that each generative AI app is different from other ones in terms of how you import text data. I am going to mainly discuss ChatGPT by OpenAI, though my prompting strategies will generally apply to any of the popular generative AI apps. In the end, you’ll need to explore the various online help files and other materials that will inform you about the specifics of the import operations for your chosen generative AI app.
Because the importation act is substantively dependent on which generative AI app you are using, I am going to just sketch how the importation typically works.
My main focus will be the prompting strategies that you can use once you’ve been able to import text of your own choosing. Those prompting approaches are pretty much applicable across the board. No matter which generative AI app that you opt to use, knowing how to word your prompts to glean the most from imported text is a generalized rule of thumb. My tips and techniques will live long and prosper no matter which generative AI app you end up utilizing.
All in all, my overarching goal in today’s discussion is to get you up-to-speed on the recommended prompt engineering skills underlying making the best use of imported text. That’s something that will stay with you for the rest of your proud and long-lasting prompting career.
The Context Window Is Your Friend And Your Enemy
One of the most crucial elements in this realm that you need to be mindful of entails an aspect of today’s generative AI that refers to a context window. Please put the phrase “context window” on your forehead so that you’ll always remember it (well, just kidding about the forehead thing, of course).
Here’s the deal.
There is only so much text that each generative AI app can handle at one time. Once you hit the allowed maximum, the earliest of the text starts to roll out of usage. This kind of makes sense. You would naturally want the most recently available text to likely prevail. The text that was earlier on might be less important.
I am not suggesting this makes sense in the absence of the constraint that there is a maximum allowed amount of text that can be dealt with at any one point in time. Obviously, we would all prefer that all of the text should be considered fully usable all of the time. The good news is that the AI makers realize this desire, and are fervently enlarging the window of allowed text, perhaps ultimately permitting an essentially unlimited size. Keep your fingers crossed.
Right now, lamentedly, you need to keep in mind whatever limit is established for the generative AI app that you are using. Another approach known as RAG (retrieval augmentation generation) can be used, though still entails coping with the context window size, for those of you interested in RAG and in-context modeling, see my discussion at the link here.
Let’s do some brainstorming together.
You intend to import text that consists of transcripts for a customer service agent who had lengthy conversations with many customers. To start easily, you imported an hour’s worth of text. You use generative AI, and the AI readily analyses the imported text. Good so far.
You then import another hour’s worth of text. Let’s pretend that the limit for this generative AI is that it can only focus on one hour’s worth of text at a time. When you analyze this second hour of text, it all works well on that second hour’s worth of text, but when you decide to ask about the first hour’s worth of text, the AI app tells you that this first hour of text is no longer available for analysis.
Say what?
Yes, unfortunately for you, these studious efforts have exceeded the context window size limits and only the most recently imported text is going to be considered available for analysis.
I assure you that upon reaching the limits of the context window, you are going to pull your hair right out of your head. It is very disconcerting to discover you no longer can encompass the prior text. There you are, in the midst of getting things done, and meanwhile, the text that you had earlier imported is no longer within the available scope of your analysis. Yelling and screaming often ensue (maybe a bit of cursing too).
Before I bring up the typical sizes of today’s context windows for various generative AI apps, I will need to introduce you to another important catchphrase known as tokens or tokenization.
Let’s do so.
Tokens Make The World Go Round
I am going to quickly bring you up-to-speed about how generative AI and large language models (LLMs) convert text or words into a numeric format, for my detailed explanation see my coverage at the link here.
All in all, this conversion from text to a numeric format is generally referred to as tokenization. Tokens are numbers or numeric IDs. Tokenization consists of encoding words into their respective numeric IDs, and then later decoding the numeric IDs back into the text words they represent.
This happens inside the generative AI, and you don’t see it happening. You enter text, it gets converted into tokens, the tokens are made use of, and the generated response that consists of tokens is then converted back into text that is presented to you. Voila, all occurring behind the scenes.
The way that generative AI handles this is by establishing a dictionary of words or segments of words and assigning whatever numeric values or IDs are chosen to be used. Different generative AI apps will tend to use different numbering schemes. There are available online tools to do tokenization that make available a kind of semi-common approach if an AI maker wishes to go that route.
Before I get into the details, I would like to say something else about the tokenization process. One approach would be to assign numeric IDs for each word that we might anticipate ever encountering in our generative AI. In the case of the English language, we presumably would need to have a dictionary of perhaps a million words and assign a unique ID to each respective word. The same would need to be done to incorporate other languages and our dictionary would be sized up accordingly.
Rather than aiming to depict words as their whole, another approach consists of dividing up words into a few letters. The word “boomerang” might be divided into three segments such as “boom”, “e”, and “rang”. Or we could decide we want to divide into three-letter or four-letter segments, which might in the case of three-letter be like this “boo”, “mer”, “ang”.
A rule of thumb that is often used entails thinking that 750 words will approximately produce about 1,000 tokens. This is merely a generalized notion.
You can consider the rule of thumb this way:
- Number of words = Number of tokens x 0.75
- Number of tokens = Number of words x 1.33
Each generative AI app takes an idiosyncratic approach to its tokenization. To give you an example of what this might consist of, consider this description by OpenAI about the approach taken with ChatGPT (per their online blog entitled “What are tokens and how to count them?”), excerpted as follows:
- “Tokens can be thought of as pieces of words. Before the API processes the request, the input is broken down into tokens. These tokens are not cut up exactly where the words start or end — tokens can include trailing spaces and even sub-words. Here are some helpful rules of thumb for understanding tokens in terms of lengths.”
- “1 token ~= 4 chars in English”
- “1 token ~= ¾ words”
- “100 tokens ~= 75 words”
They also describe their approach as somewhat converted into words:
- “1-2 sentence ~= 30 tokens”
- “1 paragraph ~= 100 tokens”
- “1,500 words ~= 2048 tokens”
The reason you might care about the number of tokens is that it is customary for generative AI apps to be set up to charge you based on the number of tokens that you use. Also, the limitations of a generative AI app such as the amount of text that you can use in a conversation are often stated in terms of the number of tokens allowed.
That last point is why you will need to be cognizant of tokenization, namely that the context window of generative AI apps is customarily depicted in terms of the number of tokens that a window can maximally reach.
OpenAI posts on its website the existing context window sizes per which of their generative AI apps you are using. For example, at this moment in time, the pricing sheet says that the free version of ChatGPT has an “8K context window” while the ChatGPT Plus that has access to GPT-4 has a “32K context window”. This means you can have a context window of roughly 8,000 tokens for ChatGPT and about 32,000 tokens for GPT-4.
If you are using some other generative AI app, make sure to look at their stated context window size. Furthermore, realize that all the AI makers are rushing to increase their respective context window sizes, thus, it could be that whatever limit you previously knew of has been increased (you’ll want to know if so, allowing you to thankfully deal with more text at a time).
Making Useful Sense Of Tokens And Context Windows
I hope that the rendition of tokens is not unduly dizzying.
I suspect that you are unsure of how to mentally calibrate the world in terms of tokens. My suggestion is that you can approximate the context windows by the number of words allowed. Words are a fine way to conceptualize the matter.
For example, if a context window has a maximum size of approximately 8,000 tokens, use the handy-dandy rule of thumb that I earlier noted and multiply the 8,000 tokens times 0.75, which gives you a calculated result of approximately 6,000 words. OK, for the generative AI app that has an 8,000 tokens context window, you can consider this to be a context window of about 6,000 words.
Now that you’ve got the context window size figured out in terms of the number of words, the next step would be to look at how much text you are desirous of importing. Envision that you have 5,000 words that you wish to import. The imported 5,000 words are less than the 6,000 words context window maximum size. That’s good. You can proceed to do your prompts and analyze the entire imported text all at once (I’ll get to some caveats in a moment).
Suppose that the imported text was actually 8,000 words. Darn, that means that the imported text is going to exceed the context window size. You can still import the 8,000 words, but only the considered most recent set of 6,000 words is going to fall within the context window. The latter part which was the other 2,000 words won’t be taken into account when you enter prompts and seek to analyze the imported text.
I have even worse news for you, sorry to say.
Prepare yourself accordingly.
The prompts that you enter are also counted toward the context window limit. Plus, the results generated by the AI are counted toward the context window limit. This is a double or maybe triple whammy. I say that because suppose you import 6,000 words and you are greatly relieved that you have gotten within the context window limit.
In truth, you really haven’t since the odds are that you’ll want to enter prompts and get generated results from those prompts. That additional text will get counted toward the context window limit. For example, you import 6,000 words of text, then enter a prompt of 500 words and get a generated response of 1,000 words (for a grand total of 1,500 words). This means that you have now chewed up 1,500 words of the available context window. In turn, that means that the first 1,500 words of the 6,000 words that you imported are no longer considered within the context window and will be out of scope when you do your next prompt.
Beguiling.
Frustrating.
Yes, welcome to the prevailing world of generative AI that has context window size limits. I don’t know if this will make you feel any better, but the context window sizes used to be much lower, such as around 1,000 words to maybe if you were fortunate about 2,000 words. The notion of having 6,000 words seems like a huge luxury. Maybe that doesn’t make you feel any better.
Alright, how about the fact that we are now seeing context window size limits of around 200,000 tokens for some generative AI apps? That would be approximately 150,000 words. Does that brighten your day?
I trust it does.
Importing Of Text Into Generative AI Apps
Congratulations on now knowing about context windows and tokens. You are a better person for knowing about such things.
How would you import text into a generative AI app?
I assume that the first thing you would do is find out the context window size limits for whichever generative AI you are going to be using. Once you have that in hand, you would consider whether your imported text is going to fit within the context window or not. I will be sharing with you prompting strategies that take into account the use case of the context window being sufficient for you, along with the use case of when the context window is too short for you.
You might recall that I said that each generative AI app takes a different approach to the importing of text. I’d like to focus on ChatGPT, giving you a semblance of how things work. Again, you’ll need to research the particulars for whichever generative AI app you are using.
Here are the main methods for ChatGPT:
- Simple copy-and-paste of text into ChatGPT.
- Use a ChatGPT plugin.
- Use a ChatGPT API.
- Use the ChatGPT Advanced Data Analysis feature (formerly the Code Interpreter).
- Use a browser extension made for this purpose.
- Use an online tool made for this purpose.
- Other
There are assorted steps that would be taken for each of those methods. I won’t cover the steps here. You can find the steps online depending upon which of those paths you decide to take.
The easiest and least technical approach would be the first method listed, consisting of doing a conventional copy-and-paste into ChatGPT. My guess is that anyone with a relatively small set of text would find this approach sufficient. If you have a whole lot of text, you will probably be wise to try one of the other methods.
I will share with you my favorite importing-text rules of thumb that I cover in my classes on the use of generative AI and prompt engineering:
- (1) Upfront work. Preprocess the to-be-imported text before you try to bring it into the generative AI app.
- (2) Do the chunk. If possible, try to chunk the text and place a needed chunk within the context window one chunk at a time.
- (3) Sequence chunks. Consider sequencing the chunks of text beforehand so that you’ll know when to bring the chunk in.
- (4) Economy matters. Be economical and only bring in as much text as you really need for a given series of prompts.
- (5) Rolling for dough. Roll in new text as needed and realize or allow prior text to fade out of the context window.
- (6) Vigilance always. Keep on your toes that if you have silently faded out of a vital part of the context window you will get potentially confusing results from your prompts.
- (7) Do a start over. Consider starting a fresh conversation if an existing conversation gets overly convoluted and off-putting.
A brief explanation might be helpful to you.
You will need to do upfront work and make sure that the text is viable for being imported. If your text is replete with oddball characters or has other data-related peculiarities, get those straightened out before you import it. The best bet is to have pristine text that approximates conversational text, meaning that it is readable in the same way that a human might have hand-entered the text into the generative AI.
Next, if the size of the text exceeds the context window for the generative AI, you can still import it but will need to do some trickery when doing your prompting, which I’ll get to in the next subsection. I would urge instead that you try to chunk the text into sizes that are less than the context window (also taking into account the likely size of your prompts and generated results).
In the use case of the transcripts of a customer service rep, maybe you can just do the generative AI analysis on a chunk at a time. Perhaps having all of the text within scope at once is not a necessity. You would chunk the text, import it, do your analysis, and proceed to bring in the next chunk.
The tough thing to keep in mind during the heated battle of using the imported text will be that at any moment you might find yourself going beyond the context window size. If that occurs, your prompts will only be scoped amidst the context window size. This could produce misleading results in that the earlier text that no longer is in scope won’t be examined. There will not necessarily be any alarm bells that go off. You will need to realize that the text outside the context window wasn’t considered during the prompt generation response. Figuring this out is primarily by omission, rather than by overly apparent untoward results.
Rules Of Thumb About Prompting Strategies For Imported Text
By hook or crook, let’s assume you have imported the text that meets your heart’s desire.
We can now consider my handy-dandy rules of thumb about prompting strategies that you can use:
- (1) Be mindful. Keep the context window in mind at all times.
- (2) Prompt size counts. Realize that your prompt is counted as part of the context window.
- (3) Response size counts. Realize that a generated response is counted as part of the context window.
- (4) Stay focused. Try to focus your prompting on a narrow enough search area to abide by the context window.
- (5) Explicitly stating scope. You will need to say whether the scope is just your imported text, your imported text along with whatever else is known, or everything but your imported text.
- (6) Use clues or keywords. Make use of contextual clues or keywords to try and remain within the context window.
- (7) Use quotes or excerpts. Potentially use specific quoted portions or excerpts to aim at the desired area in the context window.
- (8) Watch for undershooting. Do not unduly constrain a search such that it only examines a narrow part of the context window and misses out otherwise.
- (9) Summarization ploy. Consider summarizing the text via generative AI and using that instead of the raw text.
- (10) Dialogue and remain cool. Be persistent and judiciously use interactive dialoguing to garner what’s in the context window.
I shall walk you quickly through those rules.
I have already pounded away that the context window size must be always at the forefront of your thinking. This is something that you’ll need to be extremely watchful of. I also noted that the prompts you enter will count toward the context window consumption, as will the generated responses. A lot of precarious juggling will be taking place.
I have a mind-bending twist for you that most casual users of generative AI have not thought about.
With a bunch of text having been imported, you will need to start thinking about your prompts in a new way that perhaps you hadn’t previously considered. Consider three kinds of scope: (1) the text you’ve imported, (2) the text you’ve imported plus whatever else the generative AI already has, (3) whatever else the generative AI already has and with a pretense that you didn’t import any text at all.
I am going to show those three sets of scope in the next section when I show you examples of text about Abraham Lincoln. Hang in there, we will get to specifics.
Beyond the idea of scoping into three sets of scope, another useful tip is to use clues, keywords, quotes, or excerpts in your prompts if you want the generative AI to focus on your imported text. You are going to want to direct attention to the text. One means to do so is to give indications in your prompt that will do this.
A technique that sometimes might also be valuable to use consists of summarizing the imported text. Here’s what I mean. You import a bunch of text and ask generative AI to create a summary. At that juncture, you possibly no longer care about the imported text. You might be fine with just using the summary as your source for whatever prompts you next want to enter.
Suppose that I have 3,000 words of text to be imported. I import it. I’ve used up 3,000 words in the context window. Maybe not every inch of that text is necessary for the analysis I am going to do. I tell generative AI to summarize the text. Imagine that the summary is 500 words in length. Assuming I no longer care about the whole 3,000 words, I will focus my attention on the 500 words. I have saved myself 2,500 words of available context window space.
The rub is that a summary might not be on par with using the entire text that you imported. I discuss at length the issues of summarization in generative AI at the link here. Use this summarization ploy with caution.
The Importance Of Prompt Engineering
I’ve been carrying on so far in this discussion as though we’ve already covered what prompt engineering provides. My passion got the better of me. Some of you might not be familiar with the topic. Please allow me a moment to bring everyone up to speed, thanks.
First, please be aware that composing well-devised prompts is essential to getting robust results from generative AI and large language models (LLMs). It is highly recommended that anyone avidly using generative AI should learn about and regularly practice the fine art and science of devising sound prompts. I purposefully note that prompting is both art and science. Some people are wanton in their prompting, which is not going to get you productive responses. You want to be systematic leverage the science of prompting, and include a suitable dash of artistry, combining to get you the most desirable results.
My golden rule about generative AI is this:
- The use of generative AI can altogether succeed or fail based on the prompt that you enter.
If you provide a prompt that is poorly composed, the odds are that the generative AI will wander all over the map and you won’t get anything demonstrative related to your inquiry. Similarly, if you put distracting words into your prompt, the odds are that the generative AI will pursue an unintended line of consideration. For example, if you include words that suggest levity, there is a solid chance that the generative AI will seemingly go into a humorous mode and no longer emit serious answers to your questions.
Be direct, be obvious, and avoid distractive wording.
Being copiously specific should also be cautiously employed. You see, being painstakingly specific can be off-putting due to giving too much information. Amidst all the details, there is a chance that the generative AI will either get lost in the weeds or will strike upon a particular word or phrase that causes a wild leap into some tangential realm. I am not saying that you should never use detailed prompts. That’s silly. I am saying that you should use detailed prompts in sensible ways, such as telling the generative AI that you are going to include copious details and forewarn the AI accordingly.
You need to compose your prompts in relatively straightforward language and be abundantly clear about what you are asking or what you are telling the generative AI to do.
A wide variety of cheat sheets and training courses for suitable ways to compose and utilize prompts has been rapidly entering the marketplace to try and help people leverage generative AI soundly. In addition, add-ons to generative AI have been devised to aid you when trying to come up with prudent prompts, see my coverage at the link here.
AI Ethics and AI Law also stridently enter into the prompt engineering domain. For example, whatever prompt you opt to compose can directly or inadvertently elicit or foster the potential of generative AI to produce essays and interactions that imbue untoward biases, errors, falsehoods, glitches, and even so-called AI hallucinations (I do not favor the catchphrase of AI hallucinations, though it has admittedly tremendous stickiness in the media; here’s my take on AI hallucinations at the link here).
There is also a marked chance that we will ultimately see lawmakers come to the fore on these matters, possibly devising and putting in place new laws or regulations to try and scope and curtail misuses of generative AI. Regarding prompt engineering, there are likely going to be heated debates over putting boundaries around the kinds of prompts you can use. This might include requiring AI makers to filter and prevent certain presumed inappropriate or unsuitable prompts, a cringe-worthy issue for some that borders on free speech considerations. For my ongoing coverage of these types of AI Ethics and AI Law issues, see the link here and the link here, just to name a few.
All in all, be mindful of how you compose your prompts.
By being careful and thoughtful you will hopefully minimize the possibility of wasting your time and effort. There is also the matter of cost. If you are paying to use a generative AI app, the usage is sometimes based on how much computational activity is required to fulfill your prompt request or instruction. Thus, entering prompts that are off-target could cause the generative AI to take excessive computational resources to respond. You end up paying for stuff that either took longer than required or that doesn’t satisfy your request and you are stuck for the bill anyway.
I like to say at my speaking engagements that prompts and dealing with generative AI is like a box of chocolates. You never know exactly what you are going to get when you enter prompts. The generative AI is devised with a probabilistic and statistical underpinning which pretty much guarantees that the output produced will vary each time. In the parlance of the AI field, we say that generative AI is considered non-deterministic.
My point is that, unlike other apps or systems that you might use, you cannot fully predict what will come out of generative AI when inputting a particular prompt. You must remain flexible. You must always be on your toes. Do not fall into the mental laziness of assuming that the generative AI output will always be correct or apt to your query. It won’t be.
Write that down on a handy snip of paper and tape it onto your laptop or desktop screen.
Examples Of Coping With Imported Text Via Your Prompts
I will next proceed to showcase the use of astute prompting approaches to cope with imported text and having to battle with context window constraints. I am going to use ChatGPT to showcase my examples. ChatGPT is a sensible choice in this case due to its immense popularity as a generative AI app. An estimated one hundred million weekly active users are said to be utilizing ChatGPT. That’s a staggering number.
A few quick comments before we launch into using ChatGPT.
If you are going to try to do the same prompts that I show here, realize that the probabilistic and statistical properties will likely produce slightly different results than what I show here. That’s the nature of generative AI and how it is devised.
You also should expect that different generative AI apps will respond in different ways. Not all generative AI apps are the same. The key is that sometimes a particular prompt will work in one generative AI app and not another. You will need to undertake a series of trial-and-error attempts to ferret this out.
There is also the timing factor. A prompt that worked today might not work the same way tomorrow. Most of the generative AI apps are continually being updated. The updates might alter internal facets that could change how the AI reacts to your prompting.
We are ready to get underway with ChatGPT.
My opening move in this chess gambit will be to ask the generative AI app about how to import text. You can do the same in whichever generative AI app that you are using.
Here we go.
- My entered prompt: “How can I import text into ChatGPT?”
- ChatGPT generated response: “As an AI model, I don’t have the ability to directly import or access external files or text. However, you can easily provide text to me by copying and pasting it into the chat interface here. Once you paste the text, I can analyze it, generate responses based on it, or provide insights as needed. If you have a specific text or document you’d like to discuss or analyze, feel free to paste it here, and I’ll do my best to assist you with it!”
I would say that the response is not quite complete in that there are other ways to import text, as per my earlier discussion. Yes, the copy and paste is the most straightforward approach, but if you have a ton of text to import, I strongly suggest that you consider one of the other methods too.
Let’s keep going.
I will next ask about the size of the context window.
- My entered prompt: “If I paste a lot of text into ChatGPT, how large can the amount of text be?”
- ChatGPT generated response: “The amount of text you can paste into the chat interface here is quite flexible, but there are practical limits. Generally, you can paste several paragraphs or even longer passages of text without issue. However, if you try to paste an excessively large amount of text all at once, you might encounter limitations in terms of the chat interface’s capacity to process and display the text efficiently.”
I would say that this response is not especially enlightening. Refer to my earlier discussion about the pricing chart of OpenAI and the existing stated context window sizes.
At this juncture, I will go into a mode of acting as though I am importing a bunch of text. I will use the copy-and-paste method. I am not going to show all the imported text because it is a lot of text in order to push things to the size limits of the context window and there is no reason to use up the space limitations in this herein discussion.
Don’t worry, I will tell you what I am doing as we proceed with the examples.
My aim is to use a simple example involving Abraham Lincoln. I will begin by asking ChatGPT for an obscure fact about Honest Abe. I will be using that provided fact later.
Things got underway this way:
- My entered prompt: “Tell me an obscure fact about Abraham Lincoln.”
- ChatGPT generated response: “An obscure fact about Abraham Lincoln is that he was a licensed bartender. In his early years, Lincoln co-owned a general store in New Salem, Illinois, where he also ran a tavern. He obtained a liquor license in 1833, allowing him to serve alcohol at the establishment. While he didn’t drink much himself and was known for his temperance, Lincoln was adept at mixing drinks and managing the tavern’s operations.”
Did you know that Lincoln had been a licensed bartender?
News to me.
You can amaze your friends and acquaintances with that bit of delightful trivia.
I am now going to make up something about Abraham Lincoln that has no basis in historical fact. I want to do this to ensure that the text I am importing is unlike any text that generative AI might have already been data trained on. You’ll see why this is important in a moment.
First, the thing I am going to make up about Lincoln is that he used to collect rubber ducks. I am confident that he didn’t do so, but it is worth checking with ChatGPT to see if perchance that is something that got mixed into the data training of the generative AI app.
Asking about rubber ducks is cool.
- My entered prompt: “Did Abraham Lincoln have a rubber duck?”
- ChatGPT generated response: “There is no historical evidence to suggest that Abraham Lincoln owned a rubber duck. However, it’s important to note that historical records might not always capture every aspect of a person’s life, so it’s theoretically possible that Lincoln could have encountered or owned a similar type of toy, though there’s no specific evidence to support this.”
As anticipated, it seems unlikely that Lincoln collected rubber ducks. ChatGPT indicates that this is something not within its data training. Maybe Lincoln did so secretly? Could be.
The next thing to do is import text about Lincoln and his love for rubber ducks (I’ve made that up).
I proceed to do so.
- My entered prompt: “I am going to provide you with some text about Abraham Lincoln and I will be asking you questions about him.”
- ChatGPT generated response: “Please go ahead and provide the text.”
- My entered prompt: “Abraham Lincoln loved to collect rubber ducks. He had an extensive collection of them. He began collecting rubber ducks as a child. Later on, as an adult, he continued doing so. It was a favorite hobby of his.”
You can obviously see that the text is very short. We are nowhere near the context window size limits. I will soon be pushing the window by dumping a bunch of other text into my ChatGPT conversation.
I want to make sure that my rubber ducks’ indication has been seen by ChatGPT.
Let’s ask.
- My entered prompt: “Using only the text that I provided to you, answer this question: Did Abraham Lincoln collect rubber ducks? Do not look elsewhere to find the answer. Look only at the text I provided.”
- ChatGPT generated response: “Based solely on the text you provided, it states that Abraham Lincoln loved to collect rubber ducks, beginning as a child and continuing into adulthood. Therefore, according to the information you provided, yes, Abraham Lincoln collected rubber ducks.”
I want you to look closely at my prompt.
Notice that I told generative AI to exclusively look at the text that I imported. This brings up my point about the three kinds of scope involved. For the time being, I wanted to have ChatGPT only consider the imported text. I didn’t want it to comingle my imported text with any other data training that it already has.
What would happen if I didn’t narrow the focus to my imported text?
Let me show you what would happen.
- My entered prompt: “I want you to answer the question again, but this time include anything else that you know about Abraham Lincoln in your response.”
- ChatGPT generated response: “Abraham Lincoln’s historical records do not mention anything about him collecting rubber ducks. The information provided about Lincoln collecting rubber ducks seems to be fictional or speculative. In reality, Lincoln is not known to have had such a hobby.”
Do you see how the generated response is now commingling my imported text and the data training that the generative AI already has?
You will need to judiciously decide when you start prompting related to your imported text the scope of what you want the generative AI to encompass. In some instances, you will want just the text itself to be the considered scope. In other instances, you might welcome a commingling. It all depends on what you are trying to achieve.
Remember that ChatGPT told us that Lincoln had been a bartender? If I focus the scope of my prompt exclusively on the text that I imported, presumably the bartender fact is no longer being considered since it is outside the scope of my statement about rubber ducks and Lincoln.
Take a look.
- My entered prompt: “According to the text that I gave you about Abraham Lincoln and his interest in rubber ducks, I want you to use that text and only that text to answer the next question that I have for you. Do not use anything else you know. Only use the text that I gave you. Was Lincoln ever a bartender?”
- ChatGPT generated prompt: “No, according to the text provided about Abraham Lincoln’s interest in rubber ducks, there is no mention of Lincoln ever being a bartender.”
Okay, in this last example, I am going to flood the context window. I took a bunch of Lincoln-related text and copied it into my existing ChatGPT conversation. There was so much text that it pushed the rubber duck text outside of the context window.
I will see if ChatGPT is able to still access the rubber duck’s newsworthy text.
- My entered prompt: “I am going to provide you with a lot of text about the life of Abraham Lincoln. Please be ready to answer questions based solely on the text that I provide to you.”
- ChatGPT generated response: “Absolutely, I’m ready to assist. Please provide the text.”
- My entered prompt: “Based on the text that I have now provided to you; did Abraham Lincoln collect rubber ducks?”
- ChatGPT generated prompt: “No, according to the text provided about Abraham Lincoln’s life, there is no mention of Lincoln ever collecting rubber ducks.”
You can see that the rubber ducks no longer are within the context window. Sad to see them go. I liked the rubber ducks and Lincoln conjecture.
Conclusion
I am sure that if you are an avid user of generative AI, you will at one point find yourself needing to import text from some external files. I’ve tried to lay out the general tips and suggestions about importing text into generative AI. I also provided an indication of recommended prompting practices once you’ve imported your text.
They say that the best way to get to Carnegie Hall consists of three words, practice, practice, practice.
If you have time to do so, please go ahead and find out the context window size limits of your generative AI app. Next, collect a slew of text that you can use to try to reach the boundaries of the context window. Clean the text before importing. Try one or more methods of importing. Etc.
I would caution you that before you import anything of a private or confidential nature (which you shouldn’t do anyway), make sure to find out about how your generative AI app handles the privacy of your data, which I’ve covered overall at the link here.
Once you’ve imported your text, play around with it. See what happens when you use prompts that exclusively examine the text versus commingle with other data that the generative AI was data trained on. Become conversant in prompting across a large body of imported text.
Do this as soon as you can, thus, preparing yourself for the day that your boss suddenly drops on your plate a task of this caliber. You don’t want to be in a position of having to scramble and do a half-baked job.
What can you do to avert a calamity?
Easy-peasy.
Practice, practice, practice.