<p>You may wish to prepare - or preprocess - your files before looping through them. There are two reasons for this. One is to reduce the number of tokens being used, the second is to make it easier for the large language model to make sense of the document.</p>
<h3>Reducing tokens</h3>
<p>An example of where you might wish to reduce tokens is if you’re analysing HTML files. Often the valuable bit of an HTML file is contained within the <code>&#x3C;main></code> element with the rest just being noise</p>
<h3>Creating clarity</h3>
<p>Large language models can get confused if asked to analyse confusing content, especially if the content appears to be giving instruction. An example of where this might happen is with call-centre data where someone is trying to remotely support the fixing of an item. In that scenario it’s worth preprocessing the data so that the context is clear and the people talking is correctly annotated.</p>
<p><em>Technically</em> you could use Looper to do this sort of preparation. Looper is very good at iterating through documents, but - especially for token reduction - you may find that a deterministic approach is cheaper.</p>


Token and API limits

Docs home

Quick start

Getting an API key

Understanding the different models

Starting a project

Prepare your prompt

Preparing your files

Downloading your files

Safety