There are waaaaay too many aliases for this page. Can't we get rid of the ones not used elsewhere on SL, please?
The whole point of the aliases is as a "safety net" of sorts. Who knows if some guy creates a Wikilink to his or her favorite spelling, then says "Hey, there's no article! Let's create one right now!" and then uses it. Maybe it doesn't happen so often on SL, as SL isn't used that much. But it potentially can happen.
Also, getting rid of all of the ones not used on SL is misleading, because I have set up some links so even though they are labelled with one romanization, it actually redirects to another. It's very easy to do.
For instance: [Toya Meijin|Kouyou Touya] becomes Kouyou Touya
Frankly, I have to create less aliases for these articles than I have to for Wikipedia - SL does not differentiate spacing, dashes, and apostrophes - And it does not allow article names with diacritical marks (macrons and circumflexes, which are often used in romanization of Japanese), while Wikipedia does.
As for why Toya Meijin has all of the aliases:
We don't need aliases where the "h" Romanizations substitute for "u" Romanizations because relatively nobody who reads Hikaru no Go will encounter them. Even if we lived in the fantasy world in which "h" Romanizations were valid, the "u" or "(nothing)" Romanizations are so common and popular that such a "safety net" is useless. Even nets have holes in them (to let the little fish escape capture). Aliases are not threading for a "safety bag".
It's fairly common to see "h" romanizations (like "Oh", or "Tohya"). They are called "Passport Hepburn", and the Japanese government allows them on passports - http://www.seikatubunka.metro.tokyo.jp/hebon/
And German people who read their versions of Hikaru no Go DO encounter them. True, the said romanizations don't occur in the official English version, nor do they happen in most of the scanlations that were produced before HnG got licensed. Even so, one romanization used in Roman script can be used in many Indo-European languages. So a German person could use "Akira Tohya".
I also believe that this page has too many aliases, we could do without a lot of them. ViciousMan, if someone were to create a page in one of the alternative romanizations, it's easy to merge the contents of that page into the main page, and turn the new page into an alias at the time. Besides, that would also indicate that the alias is needed. Just my 2 cents.
What is the problem with having many aliases?
I completely agree with you, Phelan.
I asked this on one of the more technical pages as well, but wouldn't be possible to have a more intelligent search function? More google-like? This might do away with most of the aliases. On top of that it would make searching a lot easier. For me this one of the biggest weaknesses of Wikipedia, the search funtion there is quite crappy.
Yes, it is possible to have such a thing. It is called "Google." ;)
three points without capturing site:senseis.xmp.net
I sometimes miss a 'search with google' in the search page. The one we have works fine, but sometimes it's hard to find what you really want.
That works fine, but it is not 'internal'. Also google tend to exagerate sometimes.
is it that hard to get thi set up internally? So if you misspel soemthing, you still come up with the correct page for example (good for aliases)?
work around:
I believe it is possible for SL to include a google search text box on SL pages that will do the steps above for you.
Enjoy
--jfc
P.S. this is what erislover did to make his example link above.
My results for a two searches with no results
A successful search: http://www.google.com/search?hl=en&lr=&safe=off&as_qdr=all&q=Kouyo+Toya+site%3Ahttp%3A%2F%2Fsenseis.xmp.net&btnG=Search
Yes, what I say I missed was that, some box that did all those steps for us (like wikipedia when their search is offline): I think is quite hard to handle misspelling, a few times I've been thinking about it and could'nt find a good way in a program to do it easily. But I'm not an expert on this.
And about no results... usually you have to write one correct word, like only 'kobayashi'. The way google handles misspelling is quite statistical... It won't say you misspelled... let me think, 'eignevector' because it isn't misspelled usually (and BTW there are pages with it :D), and it's not a common word.
Herman Hiddema: I think that this problem could be solved by having the search function also use the mysql soundex of words. This would be implemented somewhat like this:
SELECT pagename FROM pages WHERE pagename SOUNDS LIKE 'Toya Koyo'
For a description of the soundex algorithm, see http://en.wikipedia.org/wiki/Soundex
For MySQL docs on this, see: http://dev.mysql.com/doc/refman/5.0/en/string-functions.html
Since the algorithm will also drop 'h', all the folliwing words will have the same soundex: koyo, kouyou, kooyoh, kouyoo kohyo, etc.
Here are a few functions to program in the algorithm:
Hepburn is defined as the modified Hepburn romanization system used by the United States Library of Congress. There are many variants out there.
Standard and wāpuro romanizations
Also, sokuon+
Non-standard romanizations
Also, n followed by some consonants can be rendered "m" in original Hepburn but is "n" in modified Hepburn - e.g. tenpura = tempura
Also, "E" had been changed to "ye" historically - names like Ieyasu and Inoue have been seen as "Iyeyasu" and "Inouye"
In addition, sometimes Ka is rendered "Ca" and Ko as "Co", etc.
Only alias the Romanizations used on Sensei's Library!!!!!!!!!!!!!!! That is their purpose!!!!!!! Aliases are names that pages are "also known by" within the SL world. They are not the complete set of possibilities for the name of a page!!!
"Only alias the Romanizations used on Sensei's Library!!!!!!!!!!!!!!! That is their purpose!!!!!!! Aliases are names that pages are "also known by" within the SL world. They are not the complete set of possibilities for the name of a page!!! "
As long as SL remains an open wiki, visitors will use romanizations other than what SL uses (due to lack of knowledge of SL preferences, or any other reason).
In fact, the whole purpose of these redirects is NOT to reinforce other romanizations; on the contrary - It serves to reinforce what SL uses. When alternate usages are found, unless the link is a part of a personal comment, the usage is changed to fit SL's convention.
After all, even though Wikipedia has name order conventions, the article for the naming convention specifically says to redirect from the OTHER order.
However, I do not use every single romanization possible for every article - Instead I use romanizations that I encounter on the internet. I Google test them to see if someone else uses them. If not, I do not place a redirect.
In fact, in that post back there, I wasn't telling everyone that all of the conventions must be used in redirects at all times. I was telling a person how to program an SL search engine MySQL index - He ought to throw all of these variants into the search engine programming.