Google AdSense rejection and cross-domain duplicate content experience

Since the rejection replies you get from Google regarding applications to their AdSense program can be quite vague and even cryptic, I thought I would share my experience. First things first, you are not going to get to talk to Google directly — you have to use the forums. (Here is my thread.)

My rejection was simply:

Hello Jeremy,

Thank you for your interest in Google AdSense. Unfortunately, after
reviewing your application, we’re unable to accept you into Google AdSense
at this time.

We did not approve your application for the reasons listed below.

Issues:

- Unacceptable site content

Looking around their restrictions, I really could not find a solution, as I couldn’t find any rule that I explicitly violated.  I posted a request for insight on the forums, and a very insightful user (wasaweb) pointed out my extensive duplicate content on http://jeremytharp.com/ and http://www.coffeecuphalfmoons.com/.  And it’s true — I use my blog’s rss to feed much of the content on my music site.

Since I want to keep the sites separate, I saw no reasonable solution, but then I discovered that recently Google began allowing cross-domain canonical URLs.  Eureka!  I simply set the canonical URLs on the pages on jeremytharp.com that fed from coffeecuphalfmoons.com to the appropriate page that fed the same content on coffeecuphalfmoons.com.  When I resubmitted the AdSense application, it was approved!

Keep in mind that there are a LOT of reasons you might get rejected from AdSense, but given that this was very simply successful, I thought I would share.

I also noticed an immediate reorganization of my site:jeremytharp.com results on Google (for the better).  I’m hoping it has some positive effects on PR on both pages as well.

WORDPRESS HOW-TO: Ignore/Hide A Category On Your Home Page

Sometimes you don’t want a particular category to appear on your WordPress home page.  For me, I create weekly Twitter digests, and I’d rather they not be part of the page welcome.

The process of hiding them actually took me a little bit to figure out, so I thought I’d share.  It’s rather simple, so don’t blink!  :)

Simply edit “index.php” in the wp-content/themes/<your_theme_here>/ directory.

Between the lines:

<?php if (have_posts()) : ?>

and

<?php while (have_posts()) : the post(); ?>

Insert the line:

<?php if (is_home()) { query_posts("cat=-XXX"); } ?>

Be sure to replace “XXX” with the unique numeric ID of the category you want to ignore.  To find the ID, log into your admin, navigate to your categories, and select the one you want to ignore.  The number will then appear in the URL.

NOTE: Make sure you don’t delete the “-” before the “XXX” or else you will ONLY show entries from the category.

If you update your theme, you will probably have to repeat this procedure.

If you want to hide multiple categories simply append the other categories to the string with a comma: (Example: “cat=-12,-82,-4″)

It also follows, then, that this technique can also be used to hide tags, posts, etc.

Verizon Wireless USB760 USB Modem Ubuntu Howto

This is a repost of information located at: Verizon Wireless USB760 on the Ubuntu Forums.

I am putting it here because I found it very useful, and wanted people to be able to find it.  If you anyone takes issue with my publishing it here, please contact me.

At any rate, the Verizon USB760 does not work right away, so you have to modify two files.  This solution was posted by Databit. I added a few notes for clarification purposes and enclosed them in brackets.

File 1: /etc/udev/rules.d/70-persistent-cd.rules

Find the line that contains "Novatel_Mass_Storage" and append the following to it:

RUN+="/usr/bin/eject %k"

[You will probably need to use a comma before this part]

File 2: /usr/share/hal/fdi/information/10freedesktop/10-modem.fdi [Will require superuser privileges]

Add this in the USB section:

      <!-- Verizon USB760-->
      <match key="@info.parent:usb.vendor_id" int="0x1410">
        <match key="@info.parent:usb.product_id" int="0x6000">
          <match key="@info.parent:usb.interface.number" int="0">
            <append key="info.capabilities" type="strlist">modem</append>
            <append key="modem.command_sets" type="strlist">IS-707-A</append>
          </match>
        </match>
      </match>

I have tested this and it worked without issue in Jaunty Jackalope 9.04.

Linux BASH Script: Convert mp3 to avi with static image (command line)

This is a script I made to take advantage of the ffmpeg package in linux to quickly convert an mp3 to avi using a static image. I personally use this technique for uploading my songs to YouTube. I originally found the conversion command here.

All you need is a linux distro with ffmpeg installed, a jpeg or png image, and an mp3. Note: It is highly likely other image formats, audio formats, and output video formats will work, but I have only used jpeg/png+mp3+avi and so cannot attest to results otherwise.

Usage: bash mp32avi.sh <image_file> <mp3_file> <output_file.avi>

Code (mp32avi.sh):

#!/bin/bash
FFMPEG=`which ffmpeg`
if [ "$FFMPEG" = "" ] ; then
	echo "Please install ffmpeg.";
	exit 0;
fi
if [ $# != 3 ] ; then
	echo "Usage: $0 <image_file> <mp3_file> <output_file.avi>";
	exit 0;
fi
if [ ! -f $1 ] ; then
	echo "Source image '$1' not found.";
	exit 0;
fi
if [ ! -f $2 ] ; then
	echo "Source mp3 '$2' not found.";
	exit 0;
fi
if [ -f $3 ] ; then
	echo "Output file '$3' exists.  Overwrite? (y/n)";
	read CONFIRM
	if [ "$CONFIRM" == "y" ] ; then
		echo "Overwriting '$3'"
	else
		if [ "$CONFIRM" == "Y" ] ; then
			echo "Overwriting '$3'"
		else
			echo "Operation canceled.";
			exit 0;
		fi
	fi
fi
TIME=`$FFMPEG -i $2 |& grep 'Duration' | awk '{ print $2; }' | sed -e 's/,//g'`
$FFMPEG -loop_input -i $1 -i $2 -acodec copy -y -t $TIME $3

IE6 is not yet dead, although it should be … though I wonder

There’s not much contention that IE6 is an outdated browser and is not suited for the modern web.  If you don’t buy that just on sheer merit, there’s no point to read the rest of this article.

Most informed articles addressing the issue have boiled “the reason why” down to the fact that those users can’t upgrade (restricted by work, slow computer, etc.)

However, I wonder how it would be to build an HTML/JavaScript browser (which is effectively what Firefox currently is) that embeds directly into IE6 by visiting a webpage.

A user loads IE6, sets his/her home page to “awesome-js-browser.com” and then navigates from within that page.

I would venture that this would more of a nuisance and a resource hog than anything, else I think Mozilla would have tried it …

or have they? …

Coffee Cup Half Moons now has an official URL Shortener!

http://cchm.us/

Based on the same engine as http://abv8.me/ and http://ihrt.it/ (developed by my web company, geeXmedia).

I plan soon to launch URL shorteners on http://tharp.me/ and http://tharp.us/

GoDaddy Webmail eMail Notifier Under WINE in Linux :)

I use GoDaddy email for my work email, and it is a pain having to remember to check every so often to see if I got mail, especially when I only have new mail about 1 in 20 checks.

I have a notifier for my personal gmail account, and that is very handy — saves me a lot of time checking to see what has come in.

GoDaddy only offers a prepackaged notifier for Windows, however (although they supply a Mac executable if you try to download the notifier under Linux).

I checked the FireFox plugins for a generic webmail notifier, but the best one, WebMail Notifier does not have native support for GoDaddy (email.secureserver.net), and I didn’t feel like spending the time writing a custom script, especially since there is forum activity on this exact subject dating back 2 years, with no easily findable script.

Lo and behold?  The windows notifier works under WINE.  :)

Just download the Windows installer (it was wbensetup_PL.msi for me). Then in a terminal run msiexec /i <path_to_file>/wbensetup_PL.msi

Follow the instructions, and, at least in my Gnome toolbar, a notifier appeared.

Nice.

I should note that my current setup is Ubuntu 8.04 with OpenBox as my window manager, running the gnome toolbar.

CIE Surplus — new, richer, e-commerce ready website!

After a year and a half or so of lots of business at geeXmedia, we finally had some time to build a more robust web front-end for CIE Surplus, whose commissioning of ListApp is really what brought about our company.

Anyway, I wanted to highlight the new features.

First of all, the most important enhancement is the ability to buy items directly from the website!  None of the other optimizations we’ve done are worth anything without that functionality.  That means you can buy an awesome steal of a deal from a CIE Surplus eBay auction, proceed to check out using the much-upgraded CIE cart, and pile on a bunch of additional blowout deals right from the site, and pay right there.

You can even specify customer pickup.  What does that mean to people in the northern Virginia, Winchester and Berryville Virginia, and West Virginia eastern panhandle area?  Super deals with no shipping charges!

And for the savvy user, there are RSS feeds to track new inventory from wherever you choose.  They are as follows:

Check it out today!  They can ship to anyone, so even if you’re not stopping by to pick up, you can get some seriously awesome deals.

Why GoDaddy can’t get it right, and why you should not use their Grid Hosting

Grid hosting is a somewhat recent technology that essentially emulates a single server, but whose processing power is distributed over multiple servers.  There are several technologies used to accomplish this, but essentially it can be thought of as a cluster or as a self-scaling server arrangement.

The concept of it is to get the best of both worlds — you upload your website to a single server, or at least a single IP address, but depending on how much traffic your site is getting, more servers are added or removed to maintain stability.  In other words, easy, 100%, infinite scalability.

It is very attractive.  However, after spending the last few months using GoDaddy’s Grid Hosting BETA, I simply have to conclude that GoDaddy has it all wrong.  I believe from their perspective, they consider it one of their lower end solutions, and indeed, during the beta, it is only $4.99 a month.  However, infinite scalability is not something of value to someone whose websites will never scale.  Scalability is a point of interest for large sites, especially large sites that could become gargantuan.

So, the truth is, the service works very well with the exception of one important factor:  You can’t directly connect to any hosted file.  That’s right — if you attempt to post data to a script hosted on their server, you will, at least once ever so often, be forced through a 302 File Temporarily Moved redirect.  And, according to the W3C:

The requested resource resides temporarily under a different URI. Since the redirection might be altered on occasion, the client SHOULD continue to use the Request-URI for future requests. This response is only cacheable if indicated by a Cache-Control or Expires header field.

The temporary URI SHOULD be given by the Location field in the response. Unless the request method was HEAD, the entity of the response SHOULD contain a short hypertext note with a hyperlink to the new URI(s).

If the 302 status code is received in response to a request other than GET or HEAD, the user agent MUST NOT automatically redirect the request unless it can be confirmed by the user, since this might change the conditions under which the request was issued.

      Note: RFC 1945 and RFC 2068 specify that the client is not allowed
      to change the method on the redirected request.  However, most
      existing user agent implementations treat 302 as if it were a 303
      response, performing a GET on the Location field-value regardless
      of the original request method. The status codes 303 and 307 have
      been added for servers that wish to make unambiguously clear which
      kind of reaction is expected of the client.

Summary: Any posted data cannot be forwarded through the redirect.  Again, I’ll reiterate that this only happens on occasion, but it happens 100% of the time for a service I was trying to implement:  PayPal IPN.

That’s right, every time PayPal tries to send an IPN (Instant Payment Notification) they are sent a 302 File Temporarily Moved header.  Given that PayPal sends payment data via POST, and that PayPal is so fixated with security, and that the W3C expressly prohibits the forwarding of posted data through a 302 redirect, well it just doesn’t work.

And ultimately, anyone who uses any form on their website cannot reliably expect to get any results, as this redirect will prohibit the form from working.

The fact of the matter is, it really makes no sense anyway.  A logical conclusion would be, well why not have PayPal submit the data to the destination of the temporary redirect?  Short answer, the redirect goes to the same file.

Let’s say my IPN handler is at http://mysite.com/handler.php.  PayPal sends POST data to that URL, only to receive a 302 reply to redirect to the location http://mysite.com/handler.php?3ecvxYara (3ecvxYara is just a random string of characters, and it can change).  Ok so PayPal is then redirected to http://mysite.com/handler.php?3ecvxYara, where it can’t resubmit the POST data, but that’s where GoDaddy sends them.  Once they hit that URL, they receive yet ANOTHER 302 which directs them back to the original URL, only this time it works.  However, 2 steps ago, we were prohibited from resubmitting the POST data.

Let me summarize, GoDaddy, this SUCKS.

I have spent almost 6 months back and forth with their technical support.  I would send an email, only to get the reply “Hey we got your question.”  Then, a few hours later, I would get a reply “Your request has been forwarded to the high level tech guys because we low-level guys barely know how to turn a computer on.”  Finally, a day or so later, I would get “We think it is a problem with your scripting.  Check your script and make sure you’re not screwing up.”

At this point, all I knew was that PayPal couldn’t connect because they were getting a 302 reply.  I had theorized that there was some kind error in the Grid Hosting redundancy system.  Ultimately, GoDaddy admitted that it was something happening on their end.  Then, finally, someone on the PayPal boards triggered me to look up the protocol for a 302, and I realized the problem.  Then, I used FireBug to track the headers, and sure I enough, I found what I needed to know.  So, I questioned GoDaddy as to why the 302 is necessary.

The reply:

Dear Sir/Madam,

Thank you for contacting Hosting Support.

Your issues with communicating with Paypal are related to the methods which we use
to protect our network. For security reasons, we cannot get into the technical
explanations as to why this configuration will not work.

Please contact us if you have any further issues.

Regards,

Aaron R.

Hosting Support

In other words, they can’t come up with a solution that makes the Grid Hosting system behave like a normal server, and, as I mentioned before, that is supposed to be the appeal of this technology.  Give me a break … temporarily redirecting every file somehow protects their network?  Wow!  I guess I know now how to try to bring down their other hosting services right??

Needless to say, I will not be using Grid Hosting from GoDaddy anymore.  I will next try Media Temple, as it is more matured and I have not read about any such issues.  I doubt GoDaddy cares about losing my $4.99 a month, but I will deter as many people as possible from using their system at all.  Their slow, inefficient, vague technical support is a frustration in itself, and the fact that they can’t get their system to work properly, well that’s just bad business.

Days 2 & 3, eBay DevCon 09

So days 2 and 3 of ebaydevcon09 seemed very uneventful compared to previous DevCons (Chicago and Boston).  eBay played it very low key, and mainly just used it to press the selling manager applications platform to those who aren’t already knee-deep in it (such as me).

A lot of the talks focused on SMApps and how to monetize your applications there, so that was helpful, but, frankly, I got more out of networking with other developers and eBayers than the talks and presentations themselves.

We cut day 2 of the conference short (The first day wasn’t actually part of the conference) and did a little tour of San Fran by car — saw quite a bit for just a few short hours.  We drove over the Golden Gate, then drove all the shore-roads all around the bay.

There were some pretty cool communities and views — definitely worth the drive.

I have uploaded pictures here:  http://ihrt.it/3

and here:  http://ihrt.it/2

Sorry it is so short, but it’s taken me forever just to write this anyway.  Maybe the pictures will be more descriptive.  :-D