First steps

AgentQL is a robust query language that identifies elements on a webpage using natural language with the help of AI. It can be used to automate tasks on the web, extract data, and interact with websites in real-time. AgentQL uses AI to infer which element or data you mean based on term names and query structure, so it will find what you're looking for even if the page's markup and layout change drastically.

AgentQL's SDK allows you to write scripts that identify elements and extract data from the web using the AgentQL query language. In this guide, you will learn how to use AgentQL queries and the SDK to automate page interactions and data extraction from the page.

Prerequisites

The AgentQL SDK

Instructions

The script below will open a browser and do the following:

Navigate to scrapeme.live/shop.
Input "fish" into the search field in header section.
Press "Enter" key to perform the search.
Close the the browser after 10 seconds.

Save the following script in a file named example_script.js then open a terminal in your project's folder and run the script with node example_script.js.

example_script.js

const { wrap, configure } = require('agentql');
const { chromium } = require('playwright');

configure({ apiKey: process.env.AGENTQL_API_KEY });

async function main() {
    const browser = await chromium.launch();
    const page = await wrap(await browser.newPage());
    await page.goto('https://scrapeme.live/shop');

    const QUERY = `
    {
        search_box
    }
    `;

    const response = await page.queryElements(QUERY);

    await response.search_box.fill('fish');
    await page.keyboard.press('Enter');

    // Used only for demo purposes. It allows you to see the effect of the script.
    await page.waitForTimeout(10000);

    await browser.close();
}

main();

Here's how you can create it step by step:

Step 0: Create a New JS Script

In your project folder, create a new JavaScript script and name it example_script.js.

Step 1: Import Required Libraries

Import needed functions and classes from playwright library and import the agentql library.

example_script.js

const { wrap, configure } = require('agentql');
const { chromium } = require('playwright');

Playwright is an end-to-end automation and testing tool that can be used for automation. In this example, it manages open the browser and interacting with the elements AgentQL returns.

Step 2: Launch the Browser and Open the Website

The last preparation step is launching the browser and navigating to the target website. This is done using usual Playwright's API. The only difference is the type of the page — instead of Playwright's Page class, it will be wrapped with wrap(), and you will get AgentQL's Page class that will be the main interface not only for interacting with the web page but also for executing AgentQL queries.

example_script.js

const browser = await chromium.launch({ headless: false });
const page = await wrap(await browser.newPage());
await page.goto('https://scrapeme.live/shop');

tip

Default AgentQL SDK implementation is built on top of Playwright and uses all of its functionality for interacting with browser, page and elements on the page.
By default Playwright launches the browser in headless mode. Here it's explicitly set to False for the sake of example.

Step 3: Define AgentQL Query

AgentQL queries are how you query elements from a web page. A query describes the elements you want to interact with or consume content from and defines your desired output structure.

example_script.js

const QUERY = `
{
    search_box
}
`;

In this query, we specify the element we want to interact with on "https://scrapeme.live/shop/":

search_box - search input field

info

To learn more about AgentQL query syntax, capabilities and best practices, check out our AgentQL Query documentation.
If you want to query just one element on the page, you can use simpler getByPrompt API that can identify an element by natural language description.

Step 4: Execute AgentQL Query

AgentQL's Page extends Playwright's Page class with querying capabilities:

example_script.js

const response = await page.queryElements(QUERY);

response variable will have the same structure as defined by the given AgentQL query, i.e. it will have 1 field: search_box. This field will either be None if described element was not found on the page, or an instance of Locator class that allows you to interact with the found element.

Step 5: Interact with Web Page

example_script.js

await response.search_box.fill('fish');

This line uses the fill method on the search_box element found in the previous step. It mimics typing "fish" into the search box.

example_script.js

await page.keyboard.press('Enter');

Here, the Enter method is called on the keyboard attribute of the page, simulating a press on the Enter key.

info

The fill method is coming from the Playwright's Locator class. To get a full list of methods available for interacting with web elements, please refer to this class's documentation.

Step 6: Pause the Script Execution

example_script.js

await page.waitForTimeout(10000);

Here, page.waitForTimeout() method is used to pause the execution for 10 seconds to see the effect of this script before closing the browser.

warning

page.waitForTimeout() is used only for demo purposes and will impact the performance. Don't use it in production!

Step 7: Stop the Browser

example_script.js

await browser.close();

Finally, the close method is called on the browser object, ending the web browsing session. This is important for properly releasing resources.

Step 8: Run the Script

Open a terminal in your project's folder and run the script:

terminal

node example_script.js

Prerequisites

Prerequisites

Instructions

Step 0: Create a New Python Script

Step 0: Create a New JS Script

Step 1: Import Required Libraries

Step 2: Launch the Browser and Open the Website

Step 3: Define AgentQL Query

Step 4: Execute AgentQL Query

Step 5: Interact with Web Page

Step 6: Pause the Script Execution

Step 7: Stop the Browser

Step 8: Run the Script