MariaDB: The “Regexp AND LIKE” optimisation

by Federico Razzoli | Jan 27, 2022 | MariaDB, MySQL

Need Help? Click Here for Expert Support

MariaDB, like most databases, allow to run queries that use regular expressions in the WHERE clause. While this is powerful, a problem with using a regexp in a database is that it can’t take advantage of an index. The reason is simple: the query optimiser can’t understand regular expressions, or make assumptions on what they will return. This article suggests a trivial but counter-intuitive optimisation that can sometimes workaround this limitation. I called it the Regexp AND LIKE optimisation.

Contents hide

1 The regexp

2 LIKE to the rescue

3 Why is this an optimisation?

The regexp

Consider the following regexp:

SELECT id, code FROM product WHERE code RLIKE '^AB[[:digit:]]+';

Just in case you don’t understand regexp syntax:

RLIKE performs a regular expression.
- REGEXP is a synonym.
^ is the beginning of the string.
AB is just a sequence of characters we’re looking for.
[[:digit:]]+ means one or more digits.

The whole regexp means: “a string that starts with AB, immediately followed by at least one digit”.

LIKE to the rescue

As mentioned at the beginning of this article, a regexp can never make use of an index. That is a pity, because we are looking for a precise substring at the beginning of a string, and this search could theoretically use an index (range search).

This condition should use an index:

WHERE code LIKE 'AB%'

Unfortunately, this won’t check if AB is followed by digits.

But what about this condition?

SELECT id, code
    FROM product 
    WHERE code LIKE 'AB%' AND code RLIKE '^AB[[:digit:]]+';

Here’s what will happen (in most cases):

MariaDB will check the LIKE expression over an index. This should filter out most of the rows.
It will check the regular expression against the rows that satisfy the LIKE condition.

Even in the worst case (ie, no index used) this won’t be sensibly slower than using the LIKE condition alone.

Use the EXPLAIN statement. Check that an index is usable (possible_keys column) and is actually used (key column, type=range). If so, the rows column should show that the number of rows examined by the second query is much lower.

Why is this an optimisation?

An index is an ordered data structure, just like a phone book. If you had to look Tom Baker on a British phone book, you’d find it in seconds: because of the alphabetical order, you could just skip all names except for a small number of them. If it wasn’t ordered, you’d have to read all the names one by one, and I don’t know how much it would take – probably months, or years.

A computer is obviously faster than you and me, but the principle is the same. Finding a set of rows by index normally takes much less than a second. Scanning a whole table may take a long time if the table is big.

I understand that this is counter-intuitive: looking at the optimised version of the query, you see that it has to run both a regular expression and a LIKE comparison. And the latter is redundant. So you’d probably think that the second version takes more time. But running RLIKE and LIKE against a small set of rows is much faster than running a regular expression on all the rows in a table.

To master more advanced optimisation techniques, consider our SQL optimisation training for teams.

Federico Razzoli

All content in this blog is distributed under the CreativeCommons Attribution-ShareAlike 4.0 International license. You can use it for your needs and even modify it, but please refer to Vettabase and the author of the original post. Read more about the terms and conditions: https://creativecommons.org/licenses/by-sa/4.0/

About Federico Razzoli

Federico Razzoli is a database professional, with a preference for open source databases, who has been working with DBMSs since year 2000. In the past 20+ years, he served in a number of companies as a DBA, Database Engineer, Database Consultant and Software Developer. In 2016, Federico summarized his extensive experience with MariaDB in the “Mastering MariaDB” book published by Packt. Being an experienced database events speaker, Federico speaks at professional conferences and meetups and conducts database trainings. He is also a supporter and advocate of open source software. As the Director of Vettabase, Federico does business worldwide but loves to do it from Scotland, where he lives. Follow Federico on his personal blog: Federico's Thoughts.

SQL Savepoints and When to Use Them

Jan 22, 2026

Not many developers know about savepoints in relational databases. Even less of them know when to use them. It's not their fault: I can't remember seeing a good explanation of this feature. Let's try to clarify this lesser-known functionality. In this article I'm...

Distributed, Multi-Database Transactions Involving MariaDB and PostgreSQL

Jan 19, 2026

In some situations, an application needs to run a single logical transaction that involves multiple database technologies: in our example, they'll be MariaDB and PostgreSQL. This is not an optimal scenario and I'm not recommending to design systems in this way. But...

MariaDB Underrated Features: Zero Dates and Partial Dates

Dec 30, 2025

How do you represent information like this in a database? This event happened in 2015/06, but we don't know in which day. This job is scheduled to happen on the first day of the month at 00:00:00, every month and every year. This never happened. There are many ways to...

Services



Email

Schedule Meeting

Phone

MariaDB: The “Regexp AND LIKE” optimisation

The regexp

LIKE to the rescue

Why is this an optimisation?

Recent Posts

SQL Savepoints and When to Use Them

Distributed, Multi-Database Transactions Involving MariaDB and PostgreSQL

MariaDB Underrated Features: Zero Dates and Partial Dates

Services

Database Automation

Database Training

Database Health Check

Monthly DBA Time

Database Upgrade

0 Comments

Submit a Comment Cancel reply

Email

Schedule Meeting

Phone

Quick Links

Recent Posts

SQL Savepoints and When to Use Them

Distributed, Multi-Database Transactions Involving MariaDB and PostgreSQL

MariaDB Underrated Features: Zero Dates and Partial Dates

Policies & Licenses

Follow Us on Social Media