Optimizing Server Performance, Part 1

April 1, 2008

Part 1 of 3: Part 1 | Part 2 | Part 3

I recently compiled a list of tips for people interested in easy ways to optimize server performance. The tips were directed at WordPress users, but many carry over to traditional web applications as well. After all, a blog is a specific kind of web site, so performance improvement tips for a blog will also improve the performance of a web site. That said, you will certainly get more value out of this book if you’re running WordPress and / or a MySQL database server. There are quite a few tips, so I’ve broken this post into multiple parts (yes, another one of those!).

workmen-1.jpgMy goal with this series of posts is to give you some fast and easy ways to prepare for traffic spikes. Of course, a crash due to a traffic spike is a good problem to have, but it would be better still if we could avoid it altogether. So to get you started quickly, try to follow these tips in order; i.e. start at the top of post one and work you way through each post. The tips near the top will give you the most bang for your buck, according to my determination of the trade-offs among tip priority, tip complexity, and time required for implementation. And keep tuning until you feel you’ve done enough to ensure that your server is able to handle the traffic. As we move lower down in this list, we’ll reach a point of diminishing returns. Each additional tweak could take more and more time for less and less improvement.

These posts contain the tips that I’ve picked up over the past several years. Some are from experimentation. Some were learned during consulting engagements. More are from administrators who have kindly shared their experiences with others. Many are from RTFM (“delving into the product documentation”). I’m not going to provide an overly deep explanation for each tip. You’re busy. You’ve got a site to administer.

I hope you find these tips useful.

Knowing When to Upgrade

Although it might be a hard decision to make, we must each make a determination about the viability of our current hardware and network. Not all servers, networks, or hosting accounts are created equally.

Server: your server must be physically able to handle high traffic loads. This means an adequate processor, plenty of RAM, and the often over-looked network card.

Network: hosts will have network limiters on their servers because they must parcel out their limited bandwidth to many servers at the data center.

Hosting account: your hosting account will have stated provisions for bandwidth. Larger hosts are usually much more generous with bandwidth. Some hosts will also make bursting provisions, which permit periodic spikes in traffic that are not maintained over long periods.

Upgrade Path

So if you’re using a shared hosting account (good ones are Lunar Pages and BlueHost) and your host is forgiving, you might need to be moved to a newer, more powerful server. You might also be forced to upgrade to a VPS plan. I am not a fan of entry-level VPS plans, so if you’re forced to go this route, don’t choose the least expensive plan and hope to maintain the response times you have with a shared account. A good VPS provider is Spry. Another option is to try moving to a host that uses grid service or cloud hosting, like Mosso or Media Temple.

If you’re using a VPS plan, you might need to consider upgrading to a more expensive plan that gives you a larger share of the processor and RAM. Or at least negotiate for an increased burst rate. If you’ve outgrown VPS entirely, you’ll need to move to a dedicated server. Good providers are iWeb and Liquid Web.

If you’re already using a dedicated server, you might need to look into upgrading your network card, adding RAM, using more and multi-core processors, or upgrading your hard drives. Or, you might need to add additional servers and configure a load-balancing solution. Or move MySQL to a server or servers on its / their own.

Hard decisions, especially when each step up the ladder involves increasing amounts of money.

Enough with this, let’s get started with the tips!

Install WP-Cache

WP-Cache is a WordPress plug-in that can have a dramatic impact on the performance of your blog. By dramatic, I mean anywhere from a 1,000 to 10,000 percent performance improvement, reducing response times by a few tenths of a second in many cases.

By default, all pages requested from WordPress are built dynamically; with WP-Cache, when a visitor requests a page or post, a stored, static version of the requested page is presented instead.

  1. Download the WP-Cache plug-in.
  2. Upload and extract it in your plug-ins folder.
  3. Activate it in your WordPress administrative panel.
  4. In Options | Reading, make sure that gzip compression is not selected.
  5. Open the Options | WP-Cache subtab, and it will attempt to configure itself.

Deactivate Plug-ins

It is possible to use too many plug-ins. Each plug-in requires additional server resources and processing time, and some require the use of additional software.

In all situations, a good practice is to only use plug-ins that you need to help you administer your blog, or that enhance the experience of your reader. In high-traffic situations, you need to be ruthless about trimming the fat.

One of WordPress’s nice features is being able to quickly activate and deactivate plug-ins. Go through your list and see if any plug-ins could be removed temporarily while you meet a spike in traffic.

Use Your Visitors’ Browser Caches

workmen-2.jpgFirst-time visitors to your page need to download all of the page components (HTML, CSS, JavaScript, Flash, images, etc.) before the page can be viewed, and this can require a fair amount of bandwidth.

But you can use the “Expires” or “Cache-Control” headers to make sure that their browsers cache those components. This won’t help with the first page request, but all additional page requests will avoid downloading those same resources again.

There are quite a few considerations to look at here, but many can be avoided if we just keep things simple. It won’t be optimal, but it will give you the most bang for your buck.

  1. Go to Presentation | Theme Editor in your WordPress administrative panel.
  2. Click the “Header” link on the right.
  3. At the very top (this must come before any other output), enter:
   ?php
       Header("Cache-Control: max-age=172800, must-revalidate");
       $strExpires = "Expires: " . gmdate("D, d M Y H:i:s", time() + 172800) . " GMT";
       Header($strExpires);
   ?>

You can also set an expiration date far into the future, but you run the risk that the user never sees the content that you’ve updated unless the file name changes. Setting expiration to two days (172,800 seconds) is a good tradeoff to get you past any traffic surges. Adjust as necessary.

Optimize your Database

The larger your database, the more advantageous this tip will be.

  1. Open phpMyAdmin
  2. Open your WordPress database (not information_schema)
  3. Perform a backup just in case
    1. Click the “Export” tab
    2. check the “Add DROP TABLE / DROP VIEW” checkbox;
    3. check the “Complete inserts” checkbox;
    4. check the “Save as File” checkbox;
    5. Click the “Go” button and download your backup
  4. Optimize your tables
    1. Click the “Structure” tab
    2. Click the “Check all” link below the table list to select all tables
    3. In the “With selected” drop-down box, select the Optimize tables option.

Serve your Images from Somewhere Else

This tip is often a tough sell, because it does require more time than the others to implement; however, its effect can be large.

Images eat a lot of your bandwidth, especially in blogs, and result in additional requests to your server. If you find that you use images in your posts, consider serving them from another server.

A user’s browser can only open two resources from one address at any one time. Offload your images, and your viewers’ browsers can open four simultaneous connections.
So your server is using less bandwidth, is being hit with fewer requests for resources, and your viewer’s browser is receiving twice as much content simultaneously. Not bad.
A service I would wholeheartedly recommend is Amazon S3; to a lesser extent, Steady Offload and Flickr.

Amazon opens up their server farms for several uses, one being their S3 storage service. You can access Amazon’s rock solid reliability for a very small price: $0.15 per GB per month for storage, and $0.20 per GB of bandwidth.

Assuming your average blog page contains 100 KB worth of images, you currently have 100 posts in your blog, and you post at a rate of 100 posts per year, your image storage cost is literally less than $0.01 per year. Your bandwidth costs would work out to $2 per 100,000 visits.

Steady Offload is an interesting service that mirrors your static content on their site. It’s effortless to set up, and you only pay for bandwidth you use. For high-traffic situations, I still would prefer Amazon’s reliability.

Flickr is a great option if you have a personal or non-commercial blog. They have restrictions against professional and corporate use, which can limit its usefulness for some. Also, it’s an image-sharing site, which might be a concern. On one hand, your images get more traffic, on the other hand, your images could be “involuntarily shared” with others.

I’d recommend that you stay away from ImageShack and PhotoBucket, although use of both is popular. ImageShack’s bandwidth per hour limit makes it useless during traffic spikes and both ImageShack and PhotoBucket are frequently blocked by corporate firewalls, and it’s never a good idea to alienate portions of your audience.

Don’t Touch your MyISAM Tables!

This is not a to-do tip, but rather a not-to-do tip. It had to be mentioned, as it’s a very common source of questions with folks who are just digging into MySQL. Many people contend that converting your MySQL MyISAM tables to InnoDB tables will have a large impact on performance, and this is true in certain situations.

InnoDB tables feature row-level locking, meaning that when a row in a table is being edited, only that row must be locked. In MyISAM tables, the entire table must be locked to edit a row.

In blogs, the only time you would require table locking is when content is being added, e.g. a comment or a blog post. As readers outnumber commenters and posters by an overwhelming majority, using InnoDB tables for a WordPress blog would actually hurt per-formance, not improve it.

If you were considering it, don’t do it!

That’s it for today. Check back in tomorrow for part 2 of this post.

Part 1 of 3: Part 1 | Part 2 | Part 3

Choosing a Blog Platform

March 24, 2008

Never has it been easier to setup and maintain a blog, and a large part of the reason is the blogging platform. The blogging platform includes the blogging software itself, along with the hardware and infrastructure required to power the software.

The two main categories of platforms are hosted (often referred to as “services”) and self-hosted (often refereed to as “standalone”).

Hosted Blogs (Blog Services)

Hosted platforms refer to services whereby a service provider grants you access to their hardware, network connection, and software, and you create a blogging account to use on that platform. This would be very similar to signing up for an account at a site like Facebook or MySpace. As far as you’re concerned, there’s nothing more than a web page where you enter and view your information. Everything that goes on behind the scenes is con-trolled by someone else, and you have no stake other than being a user of the service.

Within the hosted blog space, you have a few subcategories of service, as well. There are pure blog plays—your account is used to create a blog, and the end result that gets traffic is a blog and a blog only.

There are also social networking services that build blogging into their platforms. MySpace has a very rudimentary blogging system, though I don’t believe that it meets the accepted requirements of a blog. Then there are services that fall somewhere in between, like Xanga and Vox, which offer a mix of media and social networking.

Some pure play examples include:

Advantages of Hosted Platforms

  • Inexpensive: many are free; others (like TypePad) are offered with multiple price points
  • Simple to setup: just create an account and get to blogging
  • Page rank: search engines might give more favorable weightings to hosted blogs, but note that this, if it is true, would only hold true for low-traffic blogs
  • Automatic updates: you don’t have to worry about keep-ing hosted software updated and patched

Disadvantages of Hosted Platforms

  • Inflexible: there are fewer configuration options with hosted platforms. This often results in applying workarounds or “hacks,” which make moving off of the platform cumber-some.
  • Fewer design choices: you typically only get to select from a limited number of average designs, and you often can’t make updates to the standard designs
  • Generic URL: some hosted platforms will allow you to use your own URL, but the process can be complex; others don’t offer the option at all
  • Less control: you are at the mercy of the platform
  • No commercial sites: some platforms will not let you serve ads on their platforms

Self-hosted Blogs (Standalones)

Self-hosted platforms consist of a web hosting account you establish with a data center, and the blogging software you install on the data center’s server.

Some self-hosted solutions include:

Advantages of Self-hosted Blogs

  • Design flexibility: you have full control over design and lay-out
  • Great software: the best software exists on self-hosted plat-forms
  • Custom URL: custom URL’s are ubiquitous on self-hosted platforms
  • Adaptability: your blog can change as your needs change

Disadvantages of Self-hosted Blogs

  • Difficult setup: setup can sometimes be difficult
  • Cost: while the blogging software is free, the hosting platform will cost you money
  • Hosting issues: unless you choose a good host, you might have to suffer from downtime or poor support.
  • Upgrades: you will need to keep your software patched

Blogs are really specialized versions of content management systems (CMS), which are systems used to manage the computer files, images, media and content of a Web site. CMS’s generally support workflow, which is a process for moving content from creation, through approval, to publishing with automated routing and notifications. CMS’s usually support a large number of authors and editors (sometimes collaboratively) in a distributed environment.

Blogs are simplified versions of CMS’s, though they retain many of the same characteristics, and I do know of some people who use WordPress in place of a full-blown CMS.

Popular CMS’s include:

Which Platform is Best?

It’s a personal decision. The majority of my blogs are self-hosted. I prefer the flexibility and design options available to me in self-hosted software. Self-hosted solutions are also much more professional. Blog readers can tell a difference.

Often, whichever blog platform you start on, you finish on. As  your blog gets larger and more backlinks are created, you start to become handcuffed to the platform. This blog, for example, was hosted on Typepad until two weeks ago. Not because I preferred Typepad, but because it started as a quick and dirty personal blog that continued to grow. That’s one reason I always recommend that peopled start blogging on a flexible platform–you always have room to grow.

In February 2007, I conducted a usability study with several readers, placing the same content on two different blogs, one hosted, and one self-hosted. Both were setup to take full advantage of the underlying platform, meaning I made the blogs look and behave as best as I could with the tools I had available. Universally, the self-hosted blog was perceived to be higher value, and the content was considered to be more credible.

Note that this was a small study, and usability tests are not what I would consider scientific by nature, but the anecdotal evidence is worth considering.

For my money, I will always go self-hosted. Your mileage may vary. If you just want to try this blogging thing out, try one of the hosted services. But if plan to maintain a professional blog, I think you’ll find yourself upgrading quickly.

I’ve used six different blogging services, hosted and self-hosted, and I’ve dug around the API’s of several. WordPress is hands-down the winner.

  • WordPress has an extensive theme directory. Themes are used to change the look and feel of your blog. You don’t need to know how to design or code a thing.
  • Thousands of themes are available around the web, and if you want a premium theme, you can find quite a few of them. Premium themes are great for sites that you want to monetize, or where you want to convey a professional presence.
  • WordPress has an extensive plug-in directory. Plug-ins enable you to add interesting functionality to your blog. How about a map of your recent readers? Adding a contact form to your about page? Showcas-ing affiliate products? Fighting spam? Improving your search engine rankings? Mobilizing your site? If you can think it, a plug-in is probably around to do it.
  • WordPress offers complete administrative control over your blog. There are very few things that you can’t do with your WordPress blog if you’re self-hosting.
  • WordPress has excellent user management, using a role-based security model that supports multiple levels of access. For example, authors can write articles, editors can publish those articles after approval, adminis-trators can change the design of the blog.
  • WordPress fights spam. A comment spam plug-in installs with Word-Press by default. Say goodbye to that annoying Blog Blaster spam.
  • WordPress has an active and enthusiastic contributing developer community. This means that software updates are frequent, bugs are fixed regularly, the software is mature, and requested features are actually added to the product by people who care about and use the product. The WordPress community is the reason why WordPress is such a powerful and easy-to-use product. It is a real gem.
  • WordPress has an easy-to-use page editor. With pages, you can really blur the lines between what is a web site and what is a blog. Pages allow you to store static content (i.e. content that isn’t regularly updated). Pages are especially common for people who use WordPress as a content management system. For a lite version of WordPress-as-CMS, check out the Thirty Days to Blog Traffic site.
  • There are a large number of SEO (Search Engine Optimization) reasons for choosing WordPress, but a discussion of them is outside the scope of this book, so I’ll just say that WordPress has paid atten-tion to how to help search engines find your blog.
  • There are also plenty of SEO and traffic-related plug-ins available for WordPress.
  • It’s free! There are other free blogging packages out there, but none offer the same combination of power and simplicity that WordPress offers.

Here’s an interesting exercise: Google “moving to WordPress.” People are moving for a reason!

In a nutshell, if you’re blogging, you need to be using WordPress. It’s that simple. WordPress has built a fanatical community and support base for a reason: they’re the best. If you find yourself on another platform like Blogger or TypePad and you want to move to WordPress, I’ve detailed the migration process step-by-step.