Google (穀歌)

Google’s ‘old dog’ taught Chinese tricks (from Ars). Apparently Google in Chinese is written as 古狗 [gu gou] (literally translates to old/ancient dog). Not wanting to sound old and unhip, they renamed themselves to 穀歌 [gu ge]. They say this means ‘harvest song’, but literally, it translates to grain song, which I guess is close enough.

Speaking about Chinese characters and the Cantonese Dialect, I had an interesting discussion with someone through email recently:

I had lunch over at Chan’s café on 152nd the other day, and I noticed that both the sign on the front of the building and the menu had the characters:

食好D

Now the first two I know from studying Japanese, so I asked the waitress what the significance of the D is. She seemed to have trouble trying to explain it in English, but as far as I could understand, the way the D would be pronounced was a homonym for some word specifically in Cantonese. So can anyone explain what word the D is supposed to represent, what character it would be, and are there any other connotations of using a roman letter there?

Thanks,
Matt

D is used because there isn’t really an official standard written Chinese character for this Cantonese word, but some people write it as 啲. The meaning of 啲 corresponds to the standard written Chinese word 些. Also, Cantonese uses 食 instead of 吃 for the verb ‘to eat’. So 食好D essentially means 吃好(一)些 (meaning: eat a little better) in standard written Chinese.

-Clyde

Another way of saying it is: 食好一D. You can also see it as 吃好一點 (chi hao yi dian). I think that’s where the D sound comes from.

//Toland (^_^x)

Actually, there is a slight difference in meaning between 點 and 些. The usage D corresponds more to 些 (in all cases where D is used, 些 can be substituted, but the same cannot be said for 點). Also, linguistically the D sound is more likely derived from 的 rather than 點.

-Clyde

I was wondering in which cases you were thinking about where 些 would work but not 點. And as you noted, 啲 isn’t actually a real character. The way that word was created was because it sounded like 的, and hence adding the mouth in front of it. Most words that have that aspect have nothing in common with the word they use. This is the case where the character was derived from the sound, and not the meaning.

Although I agree, that 些 can be used in place of 點.

//Toland (^_^x)

Here’s one example where the use of 些 is much more preferable over 點:

Cantonese: 嗰啲人 (meaning: those people)

Standard written Chinese: 那些人 (那點人 is only marginally acceptable by some speakers)

One piece of evidence indicating that 啲 [di] is more likely derived from 的 [dik] rather than 點 [dim] is the use of this Cantonese phrase: 的咁多 [dik gam do] (meaning: only a bit so much), which corresponds to the standard Chinese 一些那麼多. 的咁多 can also be spoken as 啲咁多 [di gam do]. If 啲 was instead derived from點, then *點咁多 [*dim gam do] would be possible, but it’s not.

Note that the usage of the character 的 itself as a particle is only a recent invention. The written character 的 was co-opted during the language reforms of the early 20th century to represent this particle. So when I say that 啲 is derived from 的, what I really mean is that it’s derived from the particle that 的 now represents. In Mandarin, this particle evolved into the possessive particle that is now familiar in standard written Chinese; in Cantonese, it evolved into the indefinite plural measure word 啲. Another thing that strengthens this connection is the fact that all measure words in Cantonese can also be used as possessive particles.

-Clyde

You are right 些 would work better in those cases, but I don’t think your argument about D coming from 的咁多 exactly works, since the 的 in there doesn’t exactly carry the original meaning. Instead, just like 啲, it appears to just be phonetic.

I was looking up origin information for the word, but didn’t find much:

啲 – Wiktionary
Cantonese (linguistics)

Many characters used in colloquial Cantonese writings are made up by putting a mouth radical (口) on the left hand side of another more well known character to indicate that the character is read like the right hand side, but it is only used phonetically in the Cantonese context. The characters [2] 㗎, 叻, 吓, 吔, 呃, 咁, 咗, 咩, 哂, 哋, 唔, 唥, 唧, 啱, 啲, 喐, 喥, 喺, 嗰, 嘅, 嘜, 嘞, 嘢, 嘥, 嚟, 嚡, 嚿, 囖 etc. are commonly used in Cantonese writing. As not all Cantonese words can be found in current encoding system, or the users simply don’t know how to enter such characters on the computer, in very informal speech, Cantonese tends to use extremely simple romanization (e.g. use D as 啲), symbols (add an English letter “o” in front of another Chinese character; e.g. 㗎 is defined in Unicode, but will not display in Microsoft Internet Explorer 6.0. hence the proxy o架 is often used), homophones (e.g. use 果 as 嗰), and Chinese character of different Mandarin meaning (e.g. 乜, 係, 俾 etc.) to compose a message. For example, “你喺嗰喥好喇, 千祈咪搞佢啲嘢。” is often written in easier form as “你o係果度好喇, 千祈咪搞佢D野。” (character-by-character, approximately ‘you, being, there (two characters), good, (final particle), thousand, pray, don’t, mess with, him/her, (genitive particle), things’, translation ‘You’d better stay there, and please don’t mess with his/her stuff.’)

Language is a funny thing and definitely a living thing. 的 (meaning wise) would never work in any of the examples listed. Even in 的咁多, as I mentioned, it’s only used as a sound and not the meaning. If you’ve ever tried to teach your Chinese speaking parent English, you’ll notice for words they can’t enunciate, they’ll write in a string of Chinese characters that make similar sounds (just like 的咁). If I had to make a guess on the origin of 啲, it’d still be 點, but after its usage became common in everyday speech, it just took off on its own and got inserted into other areas where the original character didn’t. 些 just happened to better match its old and new usage better than the original.

Another point I’d like to make is that D is half of 點 [dim], which is really pronounced more like di-mm. People get lazy and drop off the mm and what we end up with is D. The same doesn’t work with 的 [dik], dropping of the k sound doesn’t leave you with D.

//Toland (^_^x)

since the 的 in there doesn’t exactly carry the original meaning

However, the particle that 的 represents in 的咁多 does carry what I assert to be the original meaning, which is an indefinite plural measure word (basically the same as the modern 些), not to be confused with the meaning that 的 currently has in modern standard written Chinese, which is not the original meaning.

的 (meaning wise) would never work in any of the examples listed

I think you’re again confusing the meaning of 的 in modern Chinese with the meaning of the historical particle that I’m asserting. The historical meaning, which is the indefinite plural possessive (since all measure words in Cantonese, and probably the historical dialect(s), can be used as possessive particles), would work in that example (though actually the modern meaning of的 would work there too, since a general possessive sense would work too).

Here’s the relationship I’m asserting for this theorized historical particle, which I will call D:

  • Particle D: pronounced with a -k coda; used as indefinite plural measure word or indefinite plural possessive particle (because all measure words can be possessive particles)
    • Mandarin/Standard Chinese: lost -k coda; used as general possessive particle only (because no measure words can be possessive particles); represented as 的
    • Cantonese: retains -k coda in limited contexts; used as indefinite plural measure word or indefinite plural possessive particle (because all measure words can be possessive particles); represented as 啲

There is another word that has a similar history, which is the character written as 個 in modern Chinese. For this particle, we have another data point, which comes from Shanghainese. I will call its theorized historical equivalent particle G.

  • Particle G: used as definite measure word or definite possessive particle (because all measure words can be possessive particles)
    • Mandarin/Standard Chinese: used as measure word only (because no measure words can be possessive particles); represented as 個
    • Shanghainese: used as general possessive particle only (because no measure words can be possessive particles); represented as 個
    • Cantonese: used as definite measure word or definite/general possessive particle (because all measure words can be possessive particles); represented as 個 when pronounced as [go3] and used as measure word or definite possessive; represented using 嘅 when pronounced as [ge3] and used as a general possessive

The general possessive meaning of 的 in Mandarin and 個/嘅 in Shanghainese/Cantonese were likely spawned off from the plural and definite possessive uses in historical particles D and G, respectively.

they’ll write in a string of Chinese characters that make similar sounds (just like 的咁)

That is true for many cases, but not for all. Take for instance, the Cantonese word 卒之 (meaning “finally”), which does not exist in standard written Chinese, but is etymologically accurate since it uses the Classical Chinese meaning of 卒, which is “final”. The 的 in 的咁多 is most likely not arbitrary either, especially since it matches up exactly with the indefinite plural measure word usage.

The same doesn’t work with 的 [dik], dropping of the k sound doesn’t leave you with D.

However, I gave an example where such a dropping of the -k coda does give the D sound. Namely 的咁多, which can be pronounced [dik gam do] or [di gam do]. In addition, the tone of 啲 matches 的 (both are high level) better than the tone of 點 (mid-to-high rising).

Anyway, this is of course all conjecture since we can’t necessarily go back in time and see how people spoke. In addition, the various dialects that have been around during Chinese history were never really written down until recent history when Mandarin became the basis of standard written Chinese. Before then, everything was written in Classical Chinese, which is even older than the hypothetical historical dialect(s) I base particles D and G on.

Although it seems like 點 is an obvious derivation of 啲, the other pieces I cited (closer meaning to 些; pronunciation of 的咁多; parallelisms in the evolution of particles D and G) make be tend to believe that 的 is the better candidate.

-Clyde

Interesting… I didn’t know that 的 had an ancient usage that’s no longer in today’s language.

//Toland (^_^x)

I’m still not convinced that 啲 originated from 的, besides phonetically wise. I believe both 啲 and 的 both originated from another word that is either no longer in use or the origin has been forgotten. I still think 點 has a high probability of being the origin. If anyone has any comments or insight to this, I’d be very interested in knowing.

Inline Ads

You can thank Derek for advising me to add in-line google ads to the posts. They won’t appear on the front page, but individual posts will now contain them.

Of course, you still have the option of registering to remove all the ads. I’ve created seperate channels to actually see which one does better, so hopefully I can provide some statistics when this month ends.

Transferring Domain

I’m transferring my domain from GoDaddy to NameCheap, so there might be some downtime.

What Grinds My Gears: Many of today’s service companies are whacked. New customers are getting better rates and discounts than old customers. Whatever happened to company loyalty? What many of today’s companies are forcing us to do is swtich, switch, SWITCH! Easiest one to think of are cell phone services. New customers get free or cheap new phones and great plans. Old customers, you have to threaten to cancel before they start wagging their tail, and even then, the deal you’re getting won’t be as good as what new customers would. This is all backwards. Now isntead of trying to keep the customers they already have, they’re trying to steal customers from others while others steal from them. I personally think it’d be a lot easier to keep customers, and if I had a company, that’d be my priority over getting new customers. Contracts help, but after a year, they’re gone.

I’d like to point out another such company, INGDirect, an online savings bank. Just 2 months ago, I had all my savings money in there, despite there being higher interest rates at EmigrantDirect and HSBCDirect (I just realized all these online savings bank has the word Direct in them). Anyway, I got so pissed at their latest promotion, I finally switched out. If it wasn’t for this promo, my money would’ve still been at INGDirect. The promotion was, any new accounts or balance over a certain date would get a higher interest rate of 4.75% while the old balance would get 3.75%. Being a loyal customer, I already had all my savings in INGDirect, and the few extra salary payments I was receiving for the 3 month promo would only amount to a few dollars difference vs $50 difference if all my money would’ve gotten the higher interest rate. So I transferred to HSBCDirect when they started their promo of 4.80% for ALL FUNDS, not just new funds after a certain date. I agree, INGDirect support is fast and quick and money actually transfers from ING to my checking account in Bank of America in only 2 days, I hope by transferring all my money out would make a statement on how they treat their current customers. HSBCDirect isn’t that bad either. The interface does need some work and it only takes 3 days to get my mony from and to my checking account.

I don’t really mind promotions that treat both new and existing customers equally, but whe promotions start treating new customers better than existing customers, especially customers that’s been loyal to you for many years, it breaks my heart when all they see are new customers in their eyes.

So why is this on a transferring domain blog? The reason once again for transferring my domain from GoDaddy to NameCheap is the fact that GoDaddy treats new customers better than old. I had wanted to renew my domain registration for 10 years. I even called them to ask if I could get the promo rate for new transfers and registrations. The difference wasn’t big either. It was $85 vs $89.50. Guess what, the guy on the phone even talked to his supervisor and they said this rate was only for new registrations and transfers and couldn’t give me that $4.50 discount. Welp, guess what. They lost a customer. I was first going to switch over to Yahoo! Domains because they were having a deal for $2.95/yr registrations, but at that time, my domain still had another 2 months before it expired, so I thought I’d wait to maximize. Opportunity lost when I tried a few weeks ago to transfer my domain and the promo had ended and it returned back to the regular $9.95 pricing. I already owned krunk4ever.org on NameCheap (don’t bother spamming that domain, because all email gets bounced), so I went to check what deals they were having. $8.88/yr + there was a coupon for $7.99/yr for the 1st 2 years. I thought I’d bite. Still can’t believe GoDaddy would rather lose a customer than give me that $4.50 discount. I had even sent subsequent emails regarding this issue and even telling them I’d be switching over to NameCheap, but their reply was the same every time.

ピタゴラスイッチ – Pythagora Switch

Saw series of these videos on eBaum (from Deadlock), but since we’re both anti-eBaum, Deadlock suggested that I try to find another source. Searching on Google took a bit, but I finally found what I believe to be the source of the videos, or at least a somewhat higher quality version of the videos: 箸おきと靴みがき | ピタゴラ装置動画.

The original show is hosted on NHK (in Japan): 幼稚園・保育所番組のひろば|ピタゴラスイッチ. It appears to be a children’s program, which makes sense.

This is a set of videos following the likes of Rube Goldberg’s contraptions. Some are short; some are long; some are simple; some are rather complex. Anyone that like to follow Rube Goldberg’s contraptions would love to see these videos. Do note, the video sizes range from 500KB to 9MB, and before the video is show, it requires downloading the entire film. If you want, you can also download directly using the “download movie” link provided on each page. One of the things they could’ve done better was to actually show how the contraptions work step by step. Some of the times, I had to guess what was exactly happening, though most of the time, it was pretty easy to guess.

Instead of direct liking to their videos, I’ve uploaded it onto my gallery:

ピタゴラスイッチ - Pita Gora Switch videoピタゴラスイッチ - Pita Gora Switch videoピタゴラスイッチ - Pita Gora Switch videoピタゴラスイッチ - Pita Gora Switch videoピタゴラスイッチ - Pita Gora Switch videoピタゴラスイッチ - Pita Gora Switch videoピタゴラスイッチ - Pita Gora Switch videoピタゴラスイッチ - Pita Gora Switch videoピタゴラスイッチ - Pita Gora Switch videoピタゴラスイッチ - Pita Gora Switch videoピタゴラスイッチ - Pita Gora Switch videoピタゴラスイッチ - Pita Gora Switch videoピタゴラスイッチ - Pita Gora Switch videoピタゴラスイッチ - Pita Gora Switch videoピタゴラスイッチ - Pita Gora Switch videoピタゴラスイッチ - Pita Gora Switch videoピタゴラスイッチ - Pita Gora Switch videoピタゴラスイッチ - Pita Gora Switch videoピタゴラスイッチ - Pita Gora Switch videoピタゴラスイッチ - Pita Gora Switch videoピタゴラスイッチ - Pita Gora Switch videoピタゴラスイッチ - Pita Gora Switch videoピタゴラスイッチ - Pita Gora Switch videoピタゴラスイッチ - Pita Gora Switch videoピタゴラスイッチ - Pita Gora Switch videoピタゴラスイッチ - Pita Gora Switch videoピタゴラスイッチ - Pita Gora Switch videoピタゴラスイッチ - Pita Gora Switch videoピタゴラスイッチ - Pita Gora Switch videoピタゴラスイッチ - Pita Gora Switch videoピタゴラスイッチ - Pita Gora Switch videoピタゴラスイッチ - Pita Gora Switch video

After searching online trying to figure out what Pita Gora stood for, it turns out to be “Pythagora Switch”. Pythagoras links directly to the mathematician. I don’t think Pythagoras (founder of Pythagorean theorem) made any device or switch similar, but the TV show just borrowed the name because it’s education related.

Side Note: I was originally hitting issues with the movie’s being played on my site and for some reason, the caching was broken where the file I downloaded didn’t match the original. I went ahead and updated Gallery to 2.1 and it’s been working fine so far. Of course, with every upgrade, there comes the question if I broke anything else, since I do have a few customizations of my own. If you see any problems with Gallery, please do let me know and I’ll fix it asap.

Several Updated Posts

I uploaded a short clip of the baby Saber Marionettes J to X and added a link from my latest Saber Marionette J to X post. Just scroll to the bottom to see the link of the video.

I found someone with a better quality of the Expression Crew – Marionette Show clip and had him upload to rapidshare. I posted the link in the original Expression Crew post. I’ve also listed which soundtrack they used for their music.

Enjoy!

Comments without Registration

Anyone is now allowed to leave comments without registering to my site. However, ads will still only be removed for those who register. Had gotten some feedback from Tekman and said how he was discouraged from leaving comments because I forced registration and email validation. You’re still required to put in a name and email address, which of course you can fake. However, unregistered comments will still be put into the moderation queue. Another benefit of registering is the fact that after I validate your 1st comment, you’ll be able to post comments without waiting in the moderation queue. I’ve gotten way too many spam comments in the past and I need some way block that. Everyday when I check my access logs, I still see about 10 sites trying to post messages to my old guestbook location.

Ninjai

Been awhile since I visited the Ninjai: The Little Ninja homepage, since they stopped producing more chapters, but apparently 2 new chapters have been released (since my last visit). I’ve always wanted to see his face. It gets close, but you something is always covering it.

Looking on Wikipedia, apparently these 2 episodes were released in Januaray and March of 2005 respectively. Can’t believe it’s been that long since I visited the site. I was browsing through the forums, trying to see when the next chapter would be release, but no one seems to know.

For those who haven’t seen Ninjai, you can watch the tralier here, though it doesn’t really do it justice on how much nicer the graphics and animation has gotten in chapters 11 and 12.

Private Posts

I just noticed that several of my posts was marked as Private unintentionally. That has been fixed. Enjoy!

FolderShare

FolderShare (from AT) – Keep important files at your fingertips – anywhere. All file changes are automatically synchronized between linked computers, so you are always accessing the latest documents, photos, and files.

I haven’t used this yet, but it sound interesting.

How It Works
FolderShareTM is a service that allows you to securely keep files synchronized between your devices, share files with friends or colleagues, and remotely download your files from any web browser. FolderShare consists of two components – My FolderShare and the FolderShare Satellite.

My FolderShare: Configure and manage your account from here.

Go to www.foldershare.com from any web browser and login to access your “My FolderShare” page.

From here you can do any of the following:

  • Setup devices to sync or share with
  • Invite others (via email address) to share your files
  • Manage your account – upgrade your subscription, change your email address or password
  • Get help
  • Download the latest software

FolderShare Satellite: The software you need to install on the device(s) you want to sync or share files with.

  • The FolderShare Satellite will run in the background when you are online.
  • As you update shared or synced files, it will automatically update them on the other devices connected.

I did a search on file size limit and came back with this:

Does FolderShare have a file size limit?
FolderShare can not transfer files larger than 2GB. There is a maximum of 10,000 files per library, and you may create up to 10 libraries.

Interesting…

Too bad I don’t trust to have my documents stored online on a sever other than mine.

Referer Parser

So I spent most of yesterday working on relearning how to parse Apache logs and how to get it automated and onto a working online user interface, which was rather fun. Haven’t really had much time to do web development after I started working, hence why I decided to use an existing framework and slowly build off of that.

So what I ended up with is a simple parsing of my logs directory. It’s a bit unpolished, but for now, it’ll do.

* links do nothing in this preview

When you click on a link, it calls the tool I wrote called and passes in the log file I want (i.e. parse.php?log=access.log.2006-03-10.gz). The tool can distinguish between compressed files (.gz) and raw files (no extension). If it detects it’s a gzip file, it’ll decompress it and then rerun the tool with the location of the extracted file.

What the tool does is generate the results in XML format, similar to the following:

<domainlist>
<domain name="search.yahoo.com">
<referer from="http://search.yahoo.com/" to="GET /blog/ HTTP/1.0" date="2006-03-11 04:01:56" ip="125.247.105.242" />
</domain>
</domainlist>

I actually group the entries by domain. There’s no sorting yet, but I didn’t find that necessary in my case, since all I really wanted was a easy way to see who’s linked my site and be able to collapse all the google ones if they’re not interesting.

After it generated this file, it would automatically zip that file (reduces my ~2MB xml files to ~100KB) and then email it to me. I’m trying to figure out if I’m allowed to schedule to run this once a day instead of having to do this on demand.

The next thing I needed was an XML viewer. I was contemplating at first to use IE or Firefox, but decided against it. IE was finicky about expanding and collapsing items. Firefox just hang when trying to load a 2MB xml file. So I searched online and the first couple of links in my search result ended up with either: MindFusion XML Viewer or IBM XML Viewer. The IBM one is no longer available, so I went ahead and got the MindFusion one. It was free after all and the screenshot looked decent. At first it didn’t show me all the information I wanted as fast as possible, but after enabling Name and Value in tree and Expand On Load, it was a lot better.

The final thing I wanted to do was to require authentication in order to load this page. Turns out there was a easy way just by creating a users file (htpasswd -c ~/users username) followed by a .htaccess point to that file. You can find more information on how to do this if you’re running Apache by clicking on the link under my reference list below.

Here’s an example: access.log.2006-03-11.gz

Finally, to give credit for references I used:
Apache log file parser class
PHPMailer
Using User Authentication