May 19, 2008

New Database: Topographical Dictionary of England 1831

I’ve added another database to the Databases for Sale site, it is the Topographical Dictionary of England 1831, by Samuel Lewis. Each of the 15,756 places mentioned includes a description, and is localized, usually giving the county, parish, and hundred or wapentake within which it is located.

This has some great historical information on English localities. It often mentions who the local ’squire’ was at that time (1831), and describes the condition of antiquities like ancient castles and prehistoric features. You can find out how much the local Church of England parson was paid, and just where the money came from. Historical events, such as battles or the construction of old monasteries and chapels are also noted. Some descriptions include information on the origin of the place name, and notes about the forms of local government, schedule of fairs and markets, and details on the primary industries connected with the locality.

As with our other databases, this one is 1/2 price for the first 30 days — get it now while it’s hot. This content it not available in its entirety anywhere on-line (so far as I know — exact match string searches do not get any ‘hits’), but of course you can expect it start appearing soon — get in fast to get indexed as an original source!

April 23, 2008

Free Databases

OK, you asked for it. Well, no you really didn’t — more like you failed to ask for anything. So I’m forced to up the stakes. I’m not getting enough sign-ups for my newsletter, so for the next two months (May 2008 /June 2008) I’m giving away a free database each month for all subscribers.

Now of course these are not going to be my premium database products, I need to sell those … these are databases that you can find online if you search deep enough and long enough. But you won’t need to search at all — by subscribing I’ll give you a direct link that you can use, no charges, no further up-sell, nothing except you need to subscribe to my newsletter, which will tell you of new premium databases as they become available.

Our first free database will be the first-names database, over 28,000 first names, their origins and meaning. I know, lots of you may already have this — but I also know only a small minority have noticed that over 8000 names have a tab character preceding and following the name entry in the original database. I have deleted those from our version of course, because they prevent ‘exact match’ queries, which is what most people would use for a first-name (or baby-name) query.

I looked at some of the names that had the tabs, and searched them on Google. Results returned were as low as 300-and-some matches. Names that didn’t have the tabs always return about 500,000 matches. Obviously, most people with baby-name or first-name websites have never noticed this problem with the database. You can have a database without that problem simply by subscribing to my newsletter (put your email in the box at upper right). If you have an existing baby-name or first-name meanings site with fewer than 28,000 names returned on query, I’d suggest you replace your database with this one.

If you don’t have a first-name or baby-name site, I don’t suggest you start one with this database, though you could. Much better, would be to use this database to enhance a site that focuses on other information. Maybe a biographical site, with each first-name linked to a pop-up that gives the meaning and origin for those names. Use your imagination.

No publicly available database should be the basis for an entire site! Use commercial databases to enhance sites you build around your own private database, to create ‘value added’ sites that will attract loyal visitors.

April 15, 2008

Mexico Place Names and Postal Codes

Added two new databases to our Web Content Center, one is a list of towns, cities and communities with their postal codes, the other is a sub-set of that with just the names of localities, the civil administration municipality and state they are found in. The postal code database has 44,947 records, while the locality database has 41,417 records. There are fewer locality records because some of the postal code records are for Universities, government organizations, etc.

The postal code database is $48 — (discounted to $24 for the first 30 days), and the locality sub-set is just $18 (or $9 at discount for the first 30 days). I have seen only one other source for the Mexican postal codes, and somehow they have 14,000 more records than I do, but I can not explain why. So far as I can tell, my database is complete — if anyone knows where they came up with those extra records, let me know! They also include longitude/latitude with their data, which is not available with mine. But then, they charge $350 for their database — more than seven times my base price, or 14 times the discount price.

I have zip code and locality information for the U.S. as well, and will add those databases soon. I also have some genealogical and biographical data that will be added. Put your email address in the yellow box at upper left to get notifications when new databases become available — remember, get them in the first 30 days for half-price!

March 12, 2008

Database Sale

Well this is my 48th post on this blog, and I bet you’ve been wondering when I would begin to monetize things. There were 47 posts without a single ad, affiliate link, or promotion … surely you knew the other shoe had to drop! I’ve made it clear that I’m in this business to make money … like you. I think my best bet (for this blog at least) is to help you make money by enriching your website content. So yes, I’m selling data.

I have lots of ideas and data already available, and plans to acquire more. Today, however, I added my first database to my Web Content site: a list of  13,000 news headlines from the last quarter of 2007. That database is just $7 for the first 30 days, half the regular price. I have plans to automate the sale and delivery using PayPal’s IPN, but have not yet actually written the code, so if you buy in the next few days the sale will be hand-processed and may take a while — that will be corrected as soon as I get the time to write code to handle it.

As with any database, I implore you not to use this information in bulk without either selection or modification. That is a sure road to duplicate content problems. You are not the only one with this data, so use it creatively. Select parts of the database, using keyword searches or other criteria, and use only those fields that will benefit the SEO characteristics of your site.

This blog is about Web Empire’s of course, so I assume you have, or will have, more sites. Use the same database in different ways on different sites. Select different data, use different fields, randomize the order — or sort on different values — to achieve unique results. Follow the link to the site, above, for an example of just one potential usage for some of this data.

December 7, 2007

Content in Databases

In order to have an SEO friendly site, and follow the modular design concepts I recommend, it is easiest to build your sites with content you have put into database format. Databases provide ease of searching, sorting and use, plus provide for convenient re-usability and more flexible updating.

If at all practical, put all of your content into a database. Use as many different fields and tables as possible when constructing the database, so that you have the option of using parts of the information in a different manner on another site. This makes the design of the site slightly more complex, but it is more than compensated for by the benefits we describe here.

For example, suppose you are building a site on how to make great pizza. You have forty or fifty pizza recipes. You could create a table called recipes with three fields: recipe number, title, text.

But recipes typically consist of two parts, the list of ingredients, and the procedural description. It is better to put those in two separate fields, in case you want to use them separately in the future. The description also usually includes a phrase such as ’serves 8′ or ‘8 servings’. You might want to put that in a separate field as well, where you simply save the number 8 in a field called servings.

Then, at some future date, you decide to put up a site on dieting. With a simple script, you can convert your list of ingredients and number of servings to show the number of calories per serving for these pizzas, which you can easily add to your dieting site.

If your network is about food, you might want to break it down even further and put each ingredient in a separate table, each list of ingredients (actually just references to the first table) plus quantities in a second table, then names and descriptions in a third with a counter field that gives the second table a reference for its list of ingredients. Then you will be able to allow users to search for recipes by ingredient as well as name.

This is an example of linked tables within a database — a very efficient and practical form of data storage. Table one (Ingredients) has just two fields, reference #, ingredient name. Table two [Ingredient-List] has three fields, recipe # [from table 3], ingredient # [from table 1] and quantity. Table 3 [Recipes] has four fields recipe #, name, description and servings.

Later you decide to get fancy, and add a pop-up description for each ingredient when the mouse hovers over it in the list of ingredients for a recipe. It is easily done by adding a ‘description’ field to the Ingredients table. Had you put recipes in a single table, that change would not be so easy.

When you build a Web Empire with inter-related topics, you never know when one part of a database might be useful on another site, or when you might want to add functionality to some part of the data, so split the data as fine as is practical.

Database information is always useful. Remember, you can use multiple databases (stored in different tables or retrieved dynamically from another of your sites). Your main content may be unique to the site, while another site can be cannibalized for data by extracting part of its database just to fill a side-bar or be included inside other content just as you would a photograph. By randomizing the retrieval of some of the database content, you can provide dynamically changing content within your page, which makes it more ‘fresh’ too, for both your readers and search engine spiders. In our next post we will show you how to control the update frequency of randomized elements using PHP.