Author |
Message |
Registered: May 26, 2007 | Reputation: | Posts: 2,879 |
| Posted: | | | | With the recent (semi-recent?) discussion of capital letters and diacriticals it might be easy to forget that we do not have all options for entering diacriticals.
For instance, I reviewed the credits recently for the movie Kolja, a Czech film, and while the diacriticals that are entered are correct they are not complete. But I was unable to enter the credits as they appear on screen with their diacriticals, as the program does not support the caron.
So, what I would like to see is that the program be expanded to support all diacritical marks. | | | If more of us valued food and cheer and song above hoarded gold, it would be a merrier world. -- Thorin Oakenshield |
|
Registered: March 14, 2007 | Posts: 1,328 |
| Posted: | | | | What I would love to see as an English speaker that do not use diacriticals is a change to the search parameter on the program so when I search Penelope Cruz, it will also find Penélope Cruz and vice versa. | | | My Home Theater |
|
Registered: March 15, 2007 | Reputation: | Posts: 5,459 |
| Posted: | | | | I have a few Polish films I've been unable to properly profile because I can't enter the information correctly, so yes I'd love to see Profiler updated to cater for all languages! And not only do I agree that diacriticals should be ignored for searches, I think it would be very helpful if they were ignored for name linking too. |
|
Registered: March 14, 2007 | Posts: 1,328 |
| Posted: | | | | Quoting northbloke: Quote: I have a few Polish films I've been unable to properly profile because I can't enter the information correctly, so yes I'd love to see Profiler updated to cater for all languages! And not only do I agree that diacriticals should be ignored for searches, I think it would be very helpful if they were ignored for name linking too. +1 | | | My Home Theater |
|
Registered: March 13, 2007 | Reputation: | Posts: 6,635 |
| Posted: | | | | Definitely needed. | | | Hal |
|
Registered: March 14, 2007 | Reputation: | Posts: 6,744 |
| Posted: | | | | I'm not that experienced in that area but I think that is very hard to implement, especially when you don't have the support of the underlying database.
You'd basically need a mapping table of all letters that could be another letter. It's not just e, é, è and ë, there are ç, c, c and all other kind of stuff even this forum can't handle. And that's just the central-european languages that use an extension of the latin alphabet.
So, while not dismissing the idea, I wouldn't get my hopes up. | | | Karsten DVD Collectors Online
| | | Last edited: by DJ Doena |
|
Registered: March 19, 2007 | Reputation: | Posts: 6,018 |
| Posted: | | | | Quoting xradman: Quote: Quoting northbloke:
Quote: I have a few Polish films I've been unable to properly profile because I can't enter the information correctly, so yes I'd love to see Profiler updated to cater for all languages! And not only do I agree that diacriticals should be ignored for searches, I think it would be very helpful if they were ignored for name linking too. +1 +1 - with all due respect for the difficulties my fellow DJ outlined. DJ |
|
Registered: March 15, 2007 | Reputation: | Posts: 5,459 |
| Posted: | | | | Does anyone know how it works in other programs? Google seems to ignore diacriticals in its searches, I wonder how they get it to work. |
|
Registered: March 14, 2007 | Reputation: | Posts: 6,744 |
| Posted: | | | | Quoting northbloke: Quote: Does anyone know how it works in other programs? Google seems to ignore diacriticals in its searches, I wonder how they get it to work. I think they have a huge database of aliases somehwere because it even finds results using the corresponding singular/plural case. And since this an entire company devoted to finding things I think they know a thing or two about these issues. | | | Karsten DVD Collectors Online
|
|
| Blair | Resistance is Futile! |
Registered: October 30, 2008 | Posts: 1,249 |
| Posted: | | | | Quoting northbloke: Quote: Does anyone know how it works in other programs? Google seems to ignore diacriticals in its searches, I wonder how they get it to work. Diacritics are usually associated within the program's "character map" (so-to-speak), the method for which can vary depending on the purpose of the terms being used. An example of character association are "case sensitive" letters. Ever notice how some websites say that your username and password are case sensitive while others don't? That is a result of (or lack of) associating the letter "A" with the letter "a" and so on through the alphabet. A site (eBay for instance) registers my username "Blair" in their database, exactly how I typed it. Associating capital and lowercase letters has a two-fold benefit. It means I can go back and [ may be able to] type my username in as "bLaiR" if I want, and I would still end up logged in as "Blair"... exactly how I typed it in during registration. It also prevents anyone else from coming along and being able to create the username "blAir" or a similar variant. This association is also why/how when you type using a word processor or often in texting that the software automatically capitalizes the first letter of each sentence. ("Let's see, this is the first letter and he typed 'a', so I need to make it an "A.'") For searches, diacritic association works in a similar but usually more complicated way. In the most basic style (Google is a very bad example because their system is far more complex) you type in the word "fiancee" (without the accent) and all at once the system looks for every diacritical variation of the word that it can [find in the database], returning them all. "Fiancee" will likely end up at the top of the list but further down you will eventually find "fiancée" (with the accent). The same works in reverse. Type "fiancée" and those results come up with "fiancee" somewhere in the list. Dealing with diacritical variations in this manner means more complex searches and therefore more time to perform them. | | | If at first you don't succeed, skydiving isn't for you.
He who MUST get the last word in on a pointless, endless argument doesn't win. It makes him the bigger jerk. | | | Last edited: by Blair |
|