Welcome to Your Weekly Paradigm Shift: Very Large "Free" Web E-Mail Accounts
This week I asked J'e St Sauver to give us his take on the implications
of Google's Gmail for higher education institutions. He's done an excellent
job of summarizing the pros and cons and suggesting some interesting twists
to the implications.
----------------------------------------------------
J'e St Sauver, Ph.D. (j'[email protected])
The folks at Google have turned their eye to free Web e-mail accounts, and
in typical Google fashion, they're making quite a splash: each Gmail user gets
a 1,000MB worth of disk space. To put that in perspective, if you're at a typical
college or university, your university e-mail account probably has a quota of
50 to 100MB.
Under some circumstances, a "mere" 50 to 100MB quota may be more
than enough. For example, if you routinely download your e-mail to a PC using
POP3, 50 to 100MB will probably be more than enough space to act as a buffer
in the interval between downloads-you can do the bulk of your long term e-mail
storage on your PC's large hard drive.
Unfortunately, most people don't read their e-mail using POP3. They use Web
e-mail (and leave their mail on the server), or IMAP (and leave their mail on
the server), or they may even a command line e-mail client such as pine (which
also leaves the mail on the server). When that's the case, 50 to 100MB to hold
all your e-mail may feel like a Dixie cup of water at high noon in the middle
of the Sonoran desert in August.
Google, on the other hand, will give you virtually all you'd want to drink,
with plenty left over for a shower, for watering some needy cactus, and for
making mud pies, too.
Moreover, Google gives users not just raw disk space, cool as that may be in
and of itself, Google also gives you private indexing of your e-mail-the ability
to privately "Google" a gigabyte of stored e-mail is obviously quite
helpful, and not something that non-Google folks will be able to deliver (by
definition).
Notwithstanding that, I predict that others *will* try to keep up with Google.
For example, Yahoo! Mail Plus will be offering 2,000MB of storage, POP access,
mail forwarding, and other features for a nominal yearly fee, and Hotmail Plus
will be doing much the same thing.
For this audience, however, the interesting question is "what will colleges
and universities do?"
Will colleges and universities join the keep-up-with-Google club, or will they
try to just muddle along with their good old 50-to-100MB quotas?
There's risk either way. Let's begin by considering what happens if colleges
and universities try to "keep up with the Googles."
Going from 50-to-100MB quotas to 1,000-to-2,000MB quotas means that suddenly
I/O throughput will become key, and unfortunately many university e-mail systems
skate right on the thin edge of I/O congestion at the best of times, largely
due to our friend Mr. Spam. Increasing the size of your faculty/staff/student
e-mail disk quota puts you at risk of saturating your disk farm, either in terms
of operations/second or delivered throughput.
At least for some POP3 and IMAP based architectures, the key equation quickly
becomes:
(number of users)*(inbox size in MB/user)*(access frequency)
Looking at that equation, and assuming everything else remains constant, a
10 or 20 times bigger disk quota implies a proportionately larger I/O load.
Is your disk farm ready to handle 10 or 20 times its current I/O load?
Other factors to keep in mind:
- With larger disk quotas, users want and expect larger per message quotas
(at 1GB, 10MB/message is no longer a big deal)
- Choice of mailbox format and file system may also become quite important,
if you think about performance issues associated with UFS-derived file systems
attempting to cope with inserting and deleting inodes in a directory with
lots and lots of individual files.You'll also want to think about how you
back up and handle restoration of files in large disk array.
- There are network implications to accessing that volume of storage (deployed
disk space is an excellent proxy for/correlates strongly with bandwidth requirements-are
you ready for the rise of private e-mail music trading, for example?).
But let's assume that you decide that you *do* want to stand up to the "Google
challenge," and offer your users not just 1,000MB or 2,000MB, but 5,000MB.
In terms of funding something like a 5GB quota for 20K hypothetical users,
the raw disk (assuming you use high-density serial ATA drives) would potentially
be quite doable:
(20,000 users)*(5GB/user)
------------------------------------- * 1.25 (RAID overhead, spares, etc.) ==>
(250 GB/drive)*(8:1 oversubscription)
That's roughly 63 drives @ ~$185/drive ==> $11,562 for just the raw disk
itself.
Of course, if you're buying five dozen drives at once, you can expect to see
a somewhat better price than the typical onesie-twosie price mentioned; that
8:1 oversubscription may be unduly pessimistic, too.
3ware and other vendors offer 8 and 12 port SATA controllers
(see for example http://www.3ware.com/products/serial_ata9000.asp), with street
prices well under $1K/controller. You'd obviously also need systems to hold
and drive the disks, plus a backup solution, but my point is that the raw ingredients
aren't that expensive any more. (And if you don't want to screw around buying
and integrating hardware, doing just a cursory search, I'm seeing 4TB SATA NAS
building blocks with 16 drives from commercial vendors for $13K or so; multiply
as required to get the capacity desired).
Before anyone mentions it, yes, there's no doubt that these boxes aren't the
same as the top-of-the-line fiber channel-based boxes that folks would prefer
if money was not an issue, but money is an issue at most schools. I think you
might be surprised at the capacity and performance you can get from SATA-based
storage solutions.
What if you decide not to try to compete with Google?
If you decide not to try to compete with Google, you should expect your users
to migrate from your local e-mail accounts to one of the new huge Google accounts,
some probably forwarding their institutional accounts to their new accounts
in the process, some probably not much caring that they have an institutional
e-mail address at all.
That has some interesting implications.
If students move to Google and don't forward their mail, you sure better hope
that they remember to check their old, sad, sorry, puny institutional account,
since that's probably where the university is going to be mailing Important
Official Notices That They Should Read.
If students move to Google and do forward their mail, it will be a miracle
if you don't end up having your school's e-mail blocked. Remember that when
you forward e-mail, from the point of view of the host that's receiving the
forwarded e-mail, the e-mail appears to be "coming from" the forwarding
host. Thus, when a spammer sends e-mail to your students' institutional e-mail
account, and the students in turn automatically forward it to their new Google
accounts, if Google happens to looks at that spam, it is going to "look
like" it "came from" *your* institutional mail server. We all
know what happens to spammy looking hosts, right? They get blocked, yeperoo.
There's also the issue of wide area bandwidth. For the first time students
have a serious remote storage resource available to them. Are you ready for
the Internet traffic that will be associated with all that storage?
But don't despair: Gmail isn't perfect (even though it sure seems pretty dang
good based on the Gmail account I've been testing). Specifically:
- Google Gmail accounts are still in test deployment, and are available by
invitation from an existing Gmail user only (then again, if you ask around,
you can probably find a friend or acquaintance who has a Gmail account, and
more likely than not they'll probably have a spare invitation which they can
share with you).
- When you pick a Gmail username, it has to be at least six characters long.
J'e, Tom, Sue and Sally simply aren't available as Gmail usernames, sorry.
- Gmail d'esn't work with all Web browsers. As of July 13th, Gmail supports
IE 5.5 and later, Netscape 7.1 and later, Mozilla 1.4 and later, Mozilla Firefox
0.8 and later, and Safari 1.2.1 and later. Opera users, sorry, you're not
on the officially supported list.
- You need to have Javascript and cookies enabled, which can be rather suboptimal
from a security point of view.
- When you use Gmail, Google d'es show you advertising, and the advertising
is automatically targeted based on the content of your mail messages. Thus,
if you're talking about going to Hawaii with a correspondent, don't be surprised
if you see ads for airline airfare specials to Hawaii, or Hawaiian hotel advertising.
(On the other hand, the ads are low key and unobtrusive, rather than obnoxious
popups, popupunders, or banner ads).
- Some folks worry about Google being able tie their identity to their Web
searching and browsing habits via the cookies it uses. (But Gmail d'es have
a privacy policy, which you can review to see if it is sufficiently strong
to address privacy concerns you may have).
Bottom line, one way or the other, Gmail is here and here to stay. You probably
are going to want to start thinking about how you and your institution will
cope with it, if you haven't already done so.
Wow. What an excellent summary of Gmail and its implications for higher
education institutions.
My own opinion . . . Gmail is what we should be doing for our students,
faculty, staff, and alumni. My bet is that this is going to evolve into users
having a "magic purse" on the Internet that they can put everything
digital that ever might need into and access it any time from anywhere. Wh'ever
provides that magic purse has guaranteed eyeball time from the user, and this
could be a way for institutions to lock up their academic communities through
entire user lifetimes. Maybe an institution could license its own "brand"
of Gmail from Google?
P.S. If anyone has a spare Gmail invitation, I'm looking to try it out myself:
[email protected]. I've got 300,000 e-mail messages stored on my hard drive
and the thought of being able to Google that equivalent in Gmail-once more have
accumulated-is very attractive.