Archive for January, 2010

Search Results on Your Website, Try The Feed

Saturday, January 30th, 2010

The feed I promised you last time is now ready and you can now get our search results directly on your own website, allowing you to make your own search engine or use the results as additional content on your pages.

You can test it by simply including this URL in you website as an iframe or directly form PHP or some other scripting language.

http://api.secretsearchenginelabs.com/html-feed.php?q=obama&start=2&num=3

In this example q is the query “obama”, start is 2 meaning we skip the first result and start from result 2 (not page 2), num is 3 meaning we show only three results, namely the results number 2, 3 and 4

A normal page one query for “hosting” would be

http://api.secretsearchenginelabs.com/html-feed.php?q=hosting&start=1&num=10

The results are plain HTML with CSS calsses that allows you to customize the look and feel of the results.

There’s a link back to Secret Search Engine Labs at the bottom of the results which you are required to display.

Please tell me you experience with this, as I’d be happy to improve on the interface if you find some issues!

Happy searching!

Simon
CEO and founder,
Secret Search Engine Labs

Big Server Upgrade Lets Us Increase Index Size and Paves Way for New Algorithms and Public API

Friday, January 22nd, 2010

Yesterday I completed the promised server upgrades, upgrading the main server from a Linode 720 to a Linode 1080 and adding a new Linode 360 server to host the API (I’ll tell you more about that later) and for doing special data processing that is outside the normal indexing cycle.

So what does this mean for you?

The main thing you’ll notice is that the index will get bigger. I can’t tell you how much though as this is impossible to predict, but I really hope we will be able to reach 300,000 pages with this setup.

There’s two things that will allow the undex to grow:

1. With more RAM, 1080MB instead of 720MB the updating of the index database will be quicket letting the robot index more pages in a month.

2. As there is less VPS nodes sharing the same server there should be more disk and CPU cycles available to Us. This is not cut in stone though as it depends a lot on what usage profile the other nodes have and I don’t know that yet.

The index will grow, great. What about that other server then?

The other server, the Linode 360, will be used to host the new API/Feed that I will announce soon. I will make a feed of the search results available for free letting you make your own search engine, use it as additional content for your directory or anywhere that you want to give your visitors some relevant websites to visit. But more about this later.

The other mission for the new server will be to do calculations of data sets that will support the main indexing.

Mission one will be to make a related keywords database to allow us to find sites about “New York City” when someone writes “NYC” or “Win XP ” when someone writes “Windows XP”.

I know I promised you an index of 300,000 pages the last time we upgraded servers but things changed and I implemented a few new algorithms that improv quality but slows down the indexing process.

The main slowdown this year has been the addition of the search cache, a separate database arranged to make search queries ultra fast. You can now expect to get your results in under a second, sometimes in two, while before the cache searches was anywhere between 4 and 40 seconds as they had to dig through the main index.

Now lets just wait and see how much the index grows.

Simon Byholm
CEO and founder,
Secret Search Engine Labs

P.S. If you order a Linode through the links in this post Linod will give us $20 in free server time.

PRESS RELEASE! We Are Joining The Independent Search Engine And Directory Network (ISEDN.org)

Wednesday, January 20th, 2010

I have the pleasure to announce that late yesterday the final steps was taken to integrate the advertising feeds from ISEDN.org into the search results pages. (It’s the green text ads to the far right)

On December 26th (between eating leftovers from Christmas) I signed the partner agreement with Jayde Online Inc to join the Independent Search Engine and Directory Network (ISEDN.org) also known as the ExactSeek featured listing program.

The feed is now live (for an example see the search for webmaster) and you can get a featured listing for your website targeting a specific keyword. Your ad will show up on Secret Search Engine Labs and on over 375 other search engines in the ISEDN network.  See the advertising page for details.

Joinig the ISEDN will allow us to increase revenue from the search result pages which in turn mean we will be able to get bigger and better servers to increase the index size and improve the search results.

The press release about this just went live on PRWeb and I’d like to say welcome to all new visitors, please try the search results and tell me what you think by leaving a comment below.

Simon
CEO and founder,
SecretSearchEngineLabs.com

Monster Crawler – Robot Spider Combines Google, Yahoo, Bing and Ask

Wednesday, January 13th, 2010

I found a new (actually old but I didn’t know about it before) search engine by some graduates from Southern Illinois University. Founded in 1999 it has had a recent facelift as looks nice providing meta search results combined from Google, Yahoo, Ask and Bing.

They include “Did you mena…” type suggestions and related searches and I like the clean and professional looking interface.

See monster crawler in action here. It’s now added to our big list of search engines.

Beta Testing Is Open, Did You See The Press Release Last Week?

Tuesday, January 12th, 2010

I forgot to blog about it but in case you didn’t notice we started the official beta testing period last week and we even had a press release distributed through PRWeb.

Thanks to the press release we had about 400 extra visitors on last Thursday and Friday,  most spending more than the average number of click on site.

The number if visitors isn’t exactly earth shattering but nevertheless it was great to have you all here and I hope many of you will be back for more searches.

The Beta testing period is planned to run all year and end in a public launch in January 2011 with the target of having over a million relevant pages indexed by then.

I also want to take the opportunity to apologize if you have stumbled upon some less than relevant results. We had a bit of a Cuil experience when in anticipation of a stampade of visitors from the press release we added a new cache database to speed up searches.

Yes the searches are now about 30 times faster with most searches ready in less than half a second.

The downside is it will take a whole month before the cache is fully populated with data and in the meantime the search results are based only on a subset of the whole index. For some search terms this means junk and garbage from the bottom of the barrel will show up.

Please enjoy the new super quick search results and check out the ping tool in the webmaster tools section.

What is CashRank? For Your Site?

Tuesday, January 12th, 2010

It’s amazing how much junk there is on the Internet. You let the spider loose and it just wanders so deep into a site that you never find it again.

I first made  a system with levels to prevent the spider to go more than x link jumps (3 or 4) from the domain homepage of  a site. This wasn’t enough though.

As I have a really small index I don’t want that many pages from every domain but as many different domains as possible, so I invented something I call CashRank which is a somewhat controversial way to rank pages.

Each page gets a dollar value indicating how much money has to be paid annually to keep that page online. This is domain registration fees, hosting fees, cost of IP address, advertising costs etz.

A page in only included in the index if it’s worth at least $1/year, this effectively limits auto generated content that has no value, because no one is paying for it’s upkeep.

The CashRank is also propagated through links, a page keeps $1 for it’s own
upkeep and then sends $1 through every link until it has used up all of it’s cash.

This means pages with many inbound links can have more pages in the index and only the first links on a page are counted.

Please give me your opinion on CashRank by leaving a comment below.