About Care Plans - Victor Font Consulting Group, LLC

WordPress Database Modernization Blueprint

February 22, 2023 By Victor M. Font Jr.

Introduction

You may be asking yourself, "What is modernizing a WordPress database?", "Why would I modernize a WordPress database?", or "What benefit is there to modernizing my WordPress database? "

You're not alone. In fact, we never thought to even ask ourselves these questions until we encountered the error below filling up a client's php_error.log. Ultimately, this error means a WordPress database query failed to execute with a fatal error:

WordPress database error Illegal mix of collations (latin1_swedish_ci,IMPLICIT) and (utf8mb4_unicode_520_ci,COERCIBLE) for operation 'like' for query

Impromptu Survey

Before you continue reading, would you mind taking this 1-question impromptu anonymous survey?

Show Poll Results

If you viewed the survey, we choose the second option! Our minds flooded with questions:

What is an "Illegal mix of collations"?
Why does one of our WordPress tables have the latin1_swedish_ci collation? Isn't the WordPress default character set in wp-config "utf8"?
Why is the other table's collation utf8mb4_unicode_520_ci? Isn't the WordPress standard utf8_general_ci?
Why are the collations different? Doesn't WordPress control character set and collation in wp-config?

Whatever the answers are, we know our WordPress database just failed to execute a query and it doesn't appear to be the plugin's fault. It's a problem with the WordPress database itself and the core WordPress tables. This error deserves a "priority one" rapid response, but how do we start to understand what the problem really means?

Unless a web developer has a smidgen of database administrator experience with MySQL and MariaDB, and the fearlessness to match that's coupled with an understanding of character sets and collations, the answers to these questions may be foreign and a little self-education is time well spent.

Perhaps we should start by researching "What is a collation?", "What is a character set?", "What do they do?", "How do they relate to each other", and "How do I fix this fatal WordPress database error?".

Since the utf8mb4_unicode_520_ci is involved in this error, for clues to this conundrum let's begin with the Unicode Consortium and see what they have to say about the topics.

Unicode Consortium

A character set is a set of characters while a collation is the rules for comparing and sorting a particular character set.
https://mariadb.com/kb/en/character-set-and-collation-overview/

Portions of this text are copied from https://home.unicode.org/about-unicode/ and attributed to the Unicode Consortium. This content is presented as training material under the Fair Use doctrine as permitted by Section 107 of the Copyright Act.

Unicode establishes the foundational layers that make it possible to design code that handles the requirements of all languages and regions at the same time, while minimizing the need for lower-level details and idiosyncrasies to interfere with that design.

The Unicode Consortium is the premier standards organization for internationalization of software and services, including the encoding of text for all modern computing systems. The Unicode Consortium began as the standards body for character encoding and derives its name from three main goals:

universal (addressing the needs of world languages)
uniform (fixed-width codes for efficient access), and
unique (bit sequence has only one interpretation into character codes)

Since that time, it has expanded to be far more than character encoding. Its work now includes the character properties and algorithms, language and locale data for internationalization, and production software libraries to make everything accessible to programs.

Character Encoding Explained

The Unicode Consortium explains character encoding this way:

In general, computers just deal with numbers. They store letters and other characters by assigning a number for each one. Before the Unicode standard was developed, there were many different systems, called character encodings, for assigning these numbers. These earlier character encodings were limited and did not cover characters for all the worldâs languages. Even for a single language like English, no single encoding covered all the letters, punctuation, and technical symbols in common use. Pictographic languages, such as Japanese, were a challenge to support with these earlier encoding standards.

Early character encodings also conflicted with one another. That is, two encodings could use the same number for two different characters, or use different numbers for the same character. Any given computer might have to support many different encodings. However, when data is passed between computers and different encodings it increased the risk of data corruption or errors

Character encodings existed for a handful of "large" languages. But many languages lacked character support altogether.

Investigation

Investigating this issue starts with an examination of the database structure. This is a PHPMyAdmin screen capture image of the database structure that produces the "Illegal mix of collations" error.

PHPMyAdmin screen capture of WordPress database structure as viewed in PHPMyAdmin showing differences in storage engines and database collations

If your database looks anything like the above image showing a variety of storage engines and database collations, chances are you have a very old WordPress installation. The WordPress database has evolved quite a bit since its 1.0 release in September 2007. The evolution reinforces the global nature of internet content where languages and character sets differ greatly and database storage must be friendly to it all.

In this database, we see both MyISAM and InnoDB storage engines and three database collations: latin1_swedish_ci, utf8_general_ci, and utf8mb4_unicode_520_ci.

We also found different collation values at the field level in the wp_posts structure. This is the root cause of the illegal mix of collations error. Which database collation and storage engine are the current WordPress standards?

Another anomaly we found while inspecting the wp_post table structure

Finding documentation about the current default character set and collation was anything but easy, so we decided the best way to find out what WordPress expects is to create a new database with the WordPress built-in. A brand new WordPress installation in MySQL Version 5.7.39 creates the following database structure:

PHPMyAdmin screen capture of newly created WordPress database in pristine condition

In a newly created pristine database, the storage engine for all 12 WordPress tables is InnoDB and the database collation is utf8mb4_unicode_520_ci. InnoDB is is the default MySQL database engine since the release of Version 5.5, which changed the default storage engine from MyISAM.

If you're using MariaDB, the default storage engine changed to InnoDB from XtraDB in the version 10.2 release.

If your hosting plan provides you access to your site's database configuration file, you can change the default storage engine in MySQL versions below 5.5 and MariaDB version 10.1 and earlier to the currently preferred storage engine, InnoDB.

The character set and collation values are determined by the determine_charset() function in wp-includes/class-wpdb.php. The instantiated global $wpdb object controls WordPress database interactions. Since utf8mb4_unicode_520_ci is the default collation as defined in WordPress, the WordPress defined character set is utf8mb4.

Further complicating matters is the fact that text, longtext, and varchar fields have their own collation setting that must match the table collation as shown in this PHPMyAdmin screen capture of the wp_posts table:

PHPMyAdmin screen capture of wp_posts table structure showing field level collations

When all tables and fields are synchronized with the correct storage engine and collation, database performance improves and you will not receive the fatal "Illegal mix of collations" error.

The WordPress database structure is documented in this entity relationship diagram that was last updated for WordPress Version 4.4.2: It's out of date, but the only one available on WordPress.org:

WordPress Version 4.4.2 database entity relationship diagram

ERD Errata: The post_password field in the current wp_posts table is varchar(255). The ERD shows this field's old definition of varchar(20).

A Little WordPress Database History

WordPress Version 4.2 was released on April 23, 2015. Named "Powell" in honor of jazz pianist Bud Powell, Version 4.2 changed the world of WordPress databases. The release notes say it all:

Database character encoding has changed from utf8 to utf8mb4, which adds support for a whole range of new 4-byte characters.
https://wordpress.org/news/2015/04/powell/

Prior to release 4.2, depending on the default collation was utf8. If you download the latest version of WordPress today, the wp-config-sample still defines DB_CHARSET as 'utf8':

/** Database charset to use in creating database tables. */
define( 'DB_CHARSET', 'utf8' );
/** The database collate type. Don't change this if in doubt. */
define( 'DB_COLLATE', '' );

Even though DB_CHARSET is defined in wp-config as 'utf8', when we created our new pristine environment, the database was built as utf8mb4 and utf8mb4_unicode_520_ci, the defaults established in WordPress 4.2. We learned that WordPress overwrites the wp-config setting with this code from the WPDB class:

If you're not sure what this function does, let us explain. This function in the WPDB class automatically changes the character set from "utf8" to "utf8mb4". If your WordPress database supports utf8mb4_unicode_520_ci collation, then WordPress automatically assigns it as the db collation value, otherwise utf8mb4_unicode_ci is used instead; "utf8" and "utf8_general_ci" are the fallback character set and collation for older databases that don't support "utf8mb4" character sets and collations.

A Little MySQL History

Before WordPress starting using UTF-8 as the default character encoding, early databases deferred character encoding and collation to the MySQL defaults that are latin1 and latin1_swedish_ci.

Modernization Explained

Database character sets and collation are complicated. If you want to learn more in depth about Unicode, please visit the Unicode Consortium site. If you're up for an interesting opportunity, you may even adopt your own Unicode character. There are more than 136,000 characters that can be adopted by you or your organization for as little as $100 USD.

Incorrectly converting a database can lead to data corruption and loss. Before even considering anything else, backup your database so you can easily restore everything if things go awry.

Online documentation for changing the database collation is scant and sketchy at best. As an organization, Victor Font Consulting Group, LLC. knows how to research and we looked everywhere that came to mind no matter how much of a long shot it might have been. We found tons (metaphorically, not literally) of articles and recommendations about WordPress database optimization and learned nothing new that we don't already provide in our Care Plan Subscriptions.

As far as the primary "Illegal mix of collations" error, we found some information on WordPress.org that also had some conversion scripts. The question had been asked on Stack Overflow a few times and none of the answers were correct either. We also found references in several ISP/Host knowledge bases that proved more helpful than not.

Another interesting find is an old WordPress plugin in the repository named Database Collation Fix that sets the database back to utf8 from utf8mb4 if migrating your system to an older MySQL version.

What we could not find was the one resource that confirmed that "utf8mb4" is the WordPress default global character set and utf8mb4_unicide_520_ci should be the default database and field collation until we came across the Version 4.2 release notes.

Every suggested solution we found, even the crowd sourced scripts available from WordPress.org, failed with invalid character errors and the site would not come back up properly.

To further complicate matters, this client's database is so old that when we compared old and new table structures side-by-side, some of the varchar lengths were significantly shorter in the old tables than the current WordPress database schema defaults.

Why the WordPress update process didn't adjust these out of spec field lengths is beyond knowing. Something failed somewhere, but whatever was missed during WordPress updates is irrelevant at this point in the troubleshooting and repair process. With all other approaches failing, the only one that worked and produced the expected results without error is an in situ database reconstruction.

An "in situ database reconstruction" is usually something that is only recommended when all else fails. It is the plan of last resort but may be the only thing that can get the mishmash that an aged WordPress database can grow into fully operational again. It'll breathe new life into a WordPress system because the database will be more performant with large datasets and ensure ongoing consonance until the Unicode Consortium decides to modify the rules again.

About Plugins

The Process

Care Plan Essentials

March 28, 2019 By Victor M. Font Jr.

The right tools for the job

Upkeep

Dedicated support for your site includes backups, software updates, security and performance scans, spam and revision clean up, database optimization, and detailed monthly site upkeep report.

Backups

We'll make sure an up-to-date backup is always ready. Backups are stored on servers located in the United States, but if EU Data Protection compliance is required, we've got you covered, too. Let us know and we'll store your backups on servers in the EU.

Backups run daily and are kept for 90 days. If you cancel your plan with us, backups are stored for 7 days after cancellation before permanent deletion. If you want backups stored for a longer period or post cancellation, optional 3rd party storage (e.g., Dropbox, Google Drive, Amazon S3, OneDrive) is required.

Executive Dashboard & Reports

At the beginning of each month, except for the first subscription month, we'll send you a report detailing the upkeep, analytics, and keyword search activity that we performed on your site in the previous month.

Keyword search reporting is available if you've subscribed to our SEO Ranking add-on. Reports can optionally be scheduled to be sent weekly or bi-weekly. Click here to view an example of our monthly maintenance report.

The Safeguard plan includes a Basic Client Dashboard that displays 5 key metrics at the click of a button.

Our Nurture and Thrive care plans include an Executive Dashboard tailored to your business needs. Anytime you want to see how your online business is thriving, your most important metrics are just 1-click away. See example here: Executive Dashboard

Support

Your dedicated support time can be used to address a majority of your needs, such as: bug fixes, performance improvements, image optimization, content updates, design updates, newsletter curation, etc.

If you are on the Safeguard plan, you may use your 45-minute support time for a consulting call.

Training

Professional Training Videos: Unlimited access to step by step video tutorials on managing your content.

Monthly Newsletter: Our monthly newsletter includes targeted articles specific to website growth and digital marketing.

Webinars: Coming soon! Live and recorded video workshops where we explore topics such as email marketing, search engine optimization, blogging, lead capture, and more.

Consulting

Consulting Calls (included in the Advanced plan): Sessions designed to get hands-on with your website growth strategy, live editing, or any other type of training needs. We review website performance, talk about your business goals, and lay out the path to help you use your website to its fullest potential.

Letâs observe a few facts

March 26, 2019 By Victor M. Font Jr.

John is your hardest working employee.
John labors 24/7 every day of the year, never asks for time off, gets sick, or complains that heâs tired. Even You have to sleep once in a while!
John's work/life balance is all work and no play, and he's happy about it.
John continually generates leads, drives conversions, and produces revenue (or at least he should).
John is a primary channel through which customers, suppliers, potential new staff, and partners can find and engage with you.
When John does his job, he is a profit center, not a cost center; he earns you more than you spend on customer acquisition.
John is indispensable to your on-going success and growth, and it's critical that he remains available to your audience without disruption.
John is your website.

As a savvy and successful business owner, how far would you go to retain this valuable and key asset?

High potential employees (Hi-Pos) see quality as a priority in the workplace. They stand out due to their associative thinking skillsâwhich help solve problems and drive innovation. They focus on doing a good job to satisfy clients and customers. They concentrate on improving their skills and take initiative in decision making.

As a leader, you provide direction, clarity, and resources; you do your best to understand what drives your Hi-Pos and appeal to their motives. In other words, you nurture your Superstars. You develop strategies and focus on the traits that can help them flourish to continue to produce for you and thrive. You invest in their success.

Your website is a high performing employee, or at least has the potential to become one (remember John?). It requires just as much nurturing and care as its living, breathing Superstar counterparts, if not more. You've already made an investment to bring it to life, but once it's on the job, it needs nurturing to continue to produce for you. It isn't a set it and forget it proposition. It's the public persona of you and your company. It's what attracts customers to your business. In order to thrive, it has to remain healthy, vibrant, and relevant; you have to nurture it. This all takes work and yes, continued investment for its success.

See our Care Plans page to learn more.

Why Consider a Website Care Plan?

June 27, 2017 By Victor M. Font Jr.

Skip an Oil Change?

Successful websites continuously attract the right type of visitors and lead them down a path toward becoming a customer. This requires consistent content publishing and monitoring of the website to make sure it is achieving its objectives. It also necessitates routine upkeep to assure that your website operates at peak performance and maximum efficiency.

We maintain the homes we live in and the vehicles we drive. You change the filters in your HVAC units every three months and the oil in your car every 5,000 miles. Your smart connected thermostat and car both receive regular software updates to keep them running smoothly and effectively. You wouldn't even consider letting these routine maintenance tasks go for any length of time because you know the cost of fixing these essential tools can be budget-breaking. You also embrace the fact that preventive maintenance saves time and money over the long term. What's that old adage? "An ounce of prevention is worth a pound of cure." Having a few safeguards in place before a fire starts is preferable to a lot of fixing up afterward.

How Big is BIG?

Nowadays, the odds are about even that your website runs on one of the popular content management systems (CMS). Content management systems are web-based applications for creating and managing the content of a website. W3Techs is dedicated to collecting survey data to provide information about the usage of various types of technologies on the web. Their CMS statistics focus on the top 10 million websites in the world. The following table shows that WordPress is the most popular CMS in the world. As of this writing, it is used on 28.3% of the top 10 million sites, which W3Techs extrapolates to a 59.2% world-wide marketshare. That's BIG!

Year over year, WordPress has experienced explosive growth while the use of Drupal and Joomla have remained fairly consistent.

Myth Busted: "Set It and Forget It"

Who doesn't remember inventor Ron Popeil and his famous infomercials where he immortalized the phrase "Set It and Forget It" as he marketed his creation, the Ronco Showtime Rotisserie. The Ronco website still proclaims it to be the "#1 Selling Rotisserie in the World!" Well, maybe set it and forget is true for a kitchen appliance, but it is not true for WordPress, the statistically proven #1 CMS in the world. Set It and Forget It for a WordPress powered website is a myth busted to such a high degree that it would make the original MythBusters proud!

The WordPress developers maintain a regular release schedule. Major releases are sent out every 6 months. Minor releases are sent out much more frequently, especially when a new bug or security vulnerability is discovered that requires immediate fixing. Bug fixes and security vulnerabilities always leave your site open to compromise. You also have to consider the plugins running on your site. With more than 50,000 plugins in the WordPress plugin repository, not to mention the thousands that are available through 3rd party sources, any number of these are probably providing at least some functionality on your site. Bug fixes and security vulnerabilities in plugins could mean lots of updates required on a regular basis.

Your site's performance, load time, downtime, and uptime, are dependent on WordPress and its plugins functioning as they should in their latest versions. The site's growth, fresh content, analytics monitoring, and design changes, are dependent on making sure all site updates are done smoothly and error-free.

If you don't keep your website up-to-date, it's very likely costing you a lot more than you realize. Potential customers don't want to sit around waiting for websites to load. If someone visits your website and finds something wrong with it, they'll be out of there in a heartbeat; and you may not even be aware that your site is having a problem. How often do you check your site to make sure it's still working? If you are unaware of issues or downtime, you are losing potential customers.

Options: DIY or Professional?

If you're a fan of DIY projects, you've no doubt tackled some of the challenges of maintaining your home or car on your own. At sometime though, you may have started out doing one of those projects and then realized, quite quickly, that you were over your head and needed help from a professional.

Maintaining a website on your own can save you a little bit of money in the short term, but can end up costing you more than ever you imagined in the long term. Inexperience or neglecting critical updates can damage the look and functionality of your site. It can leave your site open to hacking attempts or code insertions that can make your site a carrier for viruses or malware. Your site can even be banned from appearing in search engine results.

If you just updated your site from the WordPress admin area, you have no record of which updates actually took place. Did you make a backup before the update? Plugin and theme updates are known to go bad at times, bringing a site to a screeching halt.

What happens if a problem happens a couple of days or even weeks after an update? Do you have any way of discovering which update caused the problem and how to fix it? At best, it would be difficult to know what changed and how to recover your site without restoring a very old backup. You could possibly end up losing days or weeks of content changes, business records, sales orders, customer contact details, payment information, and the like. Just name it and you can lose it.

Nobody knows your business better than you. Wouldn't it be nice to only deal with what you know best and let the professionals handle what we know best? Would you ever go to court and serve as your own attorney? Of course, you wouldn't! So why try to maintain your own website if youâre not a web professional? When you use an experienced web professional to properly update and maintain your site, you are saving potential lost sales, capturing new visitors, and gaining a competitive edge.

Why Us?

At Victor Font Consulting Group, LLC, we go to great lengths to care for your site as though it were our very own. Before every update, we first backup the site. After the updates are applied, we check the site for any issues. We provide 24/7/365 uptime monitoring and know within seconds whether your site is suffering from any issues. Our website care software keeps a record of updated plugins so if there is ever an issue, we can tell what plugin version was changed and revert back. Our monthly maintenance report includes performance and security scan details. If in the event we ever catch a plugin issue, we address it before you ever know thereâs a problem.

If youâre interested in securing a website care contract for your site, contact us to learn how we can keep your site in proper working order. In the meantime, you please take the time to review our care plan options. It's worth the read.