I built a provisional dictionary for Tamil Scrabble, which is in the zip file Lexicon.zip (maximum of upto 12 MB of file size format can be attached here just simply, and let me know if whether this is combined for all of the attached set of file sizes or if whether it is for individual file size). I am not considering other Indian languages like Hindi, Kannada, Telugu and Malayalam right now because there are too many letters. There are 4 varieties of each hard consonant and unlike Tamil (and Malayalam) the vowel symbols for ா, ெ, ே, ை, ொ, ோ and ௌ are not separated. Native name for Tamil is தமிழ். Administrators might consider uploading Tamil dictionary to server if they would want to.
In my version, the tiles for letters ஔ, ா, ெ, ே and ை are kept separate. But the vowels ி, ீ, ு, ூ and ் are combined with their respective consonants and single tile exist for these vowels as consonant - vowel pairs. If anyone wants, they can modify the dictionary to keep the consonants and vowels ா, ி, ீ, ு, ூ, ெ, ே, ை, ொ, ோ, ௌ and ் separately. And ஔ joined if they would want to. This can be done by a computer program using unicode characters, by referring to below what each character in the dictionary corresponds to what Tamil letter.
I got all Tamil words with meanings from the Tamil lexicon, the digital South Asia library published by the University of Madras, by searching with option 'Words beginning with' and to 'Search entry words only (not definitions)'. https://dsal.uchicago.edu/dictionaries/tamil-lex/ Searching for all of 36 Tamil letters from which any word can possibly start with, namely, அ, ஆ, இ, ஈ, உ, ஊ, எ, ஏ, ஐ, ஒ, ஓ, ஔ, ஃ, க, ங, ச, ஞ, ட, ண, த, ந, ப, ம, ய, ர, ல, வ, ழ, ள, ற, ன, ஸ, ஜ, ஷ, ஹ and ஶ. Words starting with க்ஷ also start with க and words starting with ஶ்ரீ also start with ஶ.
That would produce a XML file of all search results with words, definitions and other attributes by viewing raw page source of the results page in the web browser, which can be easily copied and pasted into a text editor. My final Tamil dictionary file in Lexicon.zip zip file contains 115914 words. My Tamil.zip zip file contains 4 pass python scripts, tamil.py, read.py, write.py and lexicon.py in order to do with these following tasks:
Assuming that the files stored from above 36 search results are named as 0.txt, ..., 9.txt, a.txt, ..., z.txt, tamil.py converts XML raw page source to word=meaning format. read.py: To drop off needless characters from words like superscript notation numbers, < and process words with space, -, (, ), [ and ] as follows: After replacing a(b)c with a(b)-c, Input word: a, output word: a Input word: a-b, output words: a, b, ab Input word: a(b)-c, output words: a, abc, if space in a then left, right, left right concatenation are considered. If b = ளு then last letter in a or last 2 unicode characters in a are not dropped if and only if they end with ி, ெ and ொ. For b = லு: if and only if they end with ெ and ொ. For b = ணு, if and only if they end with ு and உ. For b = னு, if and only if they end with ி and ெ. For all other values of b except when there are 2 Tamil letters or 3 unicode characters between brackets, last letter in a or last 2 unicode characters in a are not dropped. Input word: a(b)-c(d)e, output words: a, abde with the same conditions as above. Input word: a-b (c), output words: a, c with the same conditions as above. If space in a then left, right, left right concatenation are considered. If space in c then left, right, left right concatenation are considered. Input word: a(b)-c (d), output words: a, abc, d with the same conditions as above. Input word: a(b)-c(d)e (f), output words: a, abde, f with the same conditions as above. Input word: a b, output words: a, b, ab after dropping off ending letters like ச் and த் in a. A decision was also made to avoid joining ab in case that b began with a vowel for 3 words, but eventually later on I dropped off with that idea. Input word a b-c, output words: a, b, ab, bc, abc after dropping off ending letters like ச் and த் in a. If letters like தா end in b, then it is changed to தரு on merging it with c. There are small number of changes that have been made under some exceptional cases like replacing -- with - and trimming off last letter - each of which might occur for only one single word in the dictionary. One letter words are dropped until they have a meaning themselves and not simply defined as a compound of consonant + vowel. write.py converts all of Tamil words in unicode format to Valluvan font format. One letter words in Valluvan font format are dropped in this process and words containing ஶ but not ஶ்ரீ are dropped off since ஶ character is not used in Valluvan font format. lexicon.py removes 4 duplicate words that are being starting off with கௌ, replaces $, %, & for ப், ல் and ன் respectively since their uppercase and lowercase equivalents of L, P and U had been confused by Scrabble3D application, drops off 2 words starting with க் and 1 word starting with ர், replaces 2 words with dropped off ஶ letter with ஶ்ரீ and drops off 23 words with missing ஶ. Such words can be filtered by their pronunciation containing š that is given in the second word separated by space in all of the word meanings. š is not necessarily only used for ஶ but also for ச, ஸ and ஷ and so it requires human intervention to separate the actual ஶ missing words. In read.py and lexicon.py, resulting duplicate words with different meaning are comma separated and merged with each other. During all the 4 phases, small corrections are done to some of the dictionary words. ஶ letter is dropped in a large number of words in Tamil lexicon dictionary that is available at the above mentioned website page only.
Tamil.zip zip file also includes 3 other non-passing of dictionary words related python scripts, but they are used to do other tasks when related to the Tamil Scrabble played game. frequency.py lists frequency of occurrence of each Tamil letter per thousand letters and total count is given in brackets. Because letters are added to python dictionary and are listed directly in order from that hash set, they are not sorted. Sorting can be done by some online tool although some human intervention is needed to sort some special characters ஃ, ஶ்ரீ, ா, ெ, ே and ை and digraphs from 0 - 9 are used but that single letter is not used in Valluvan font format at all for ஶூ, ஸு, ஜு, ஷு, ஹு, ஶு, ஸூ, ஜூ, ஷூ and ஹூ respectively. tamizh.py removes all of the word meanings in the tamil.dic file and creates new file format without all of the word meanings: tamizh.dic. Either tamil.dic or tamizh.dic can be used for loaded dictionary in Scrabble3D application. tamil.dic is attached in Lexicon.zip zip file format but tamizh.dic is not attached, it can be generated by using computer program tamizh.py that is attached in Tamil.zip zip file format. If there is ; in definition then while hovering mouse over played word in Scrabble board of Scrabble3D application, tooltip text definition upto that point is shown. If any player wants to see entire long meaning as defined in dictionary file, then it is good to change ; to some other character either by using computer program or by using replace function of text editor. I am not sure if whether the tooltip text definition in Scrabble3D application is limited to some number of characters or not. I am not sure if whether that definition shown is trimmed if it exceeds a certain number of characters or not. find.py is used as a Tamil Scrabble helper to find a word after parsing through all words in the dictionary and prints the word only and not the meaning. It can be in command prompt with not correctly showing all of the non-ASCII characters or text editor with correctly showing all of the non-ASCII characters. The python script can be edited to output the meaning also although that is needless and just space consuming if it were outputted to a file. Meaning is only used for stylish tooltip text definition and not important for Scrabble played game. Input should be given as a set of characters and my python script searches for words which are a strict subset of these set of letters. These letters may be entirely the set of tiles from the rack with zero or one tile extra with or without the one placed on the Scrabble board in the played game or a player's own desired creation and search query. For python 2, hexagonal escape sequence byte value pairs has to be given for accented and non-ASCII characters for the input set of letters and characters, to be searched for a subset after parsing through all of the available and possible words in the dictionary, but with python 3 it works out properly although some syntax changes to the code have to be made like and such as those as division returning quotient value only should be replaced from / to //. Print statement should be replaced from print x to print(x). If ord(c) value of character is between 128 and 191, then in python 2, replacement value of first byte is 194 and that of second byte is ord(c). If ord(c) value of character is between 192 and 255, then in python 2, replacement value of first byte is 195 and that of second byte is ord(c) - 64.
For the demo game which is shown in screenshot above 20 tiles are used in rack. There are 152 distinct Tamil characters which are used in the dictionary, so greater number of tiles in rack offers greater flexibility. Using 20 tiles in rack offers lot of permutations of letters, but it did not slow down the computer opponent players' performance, so I bet that trie data structure is used for storing the dictionary and the search function for all of the defined words in it as well. I started a played game but uppercase L, P, U were confused with their lowercase equivalents, so replacing them with $, % and & I ran a demo to check out if whether everything is in the right place. In my home made version of the played game of Tamil Scrabble, in which board and tiles were made on chart paper pasted with gum on cardboard, cut by scissors, I used front and back tile of each tile, although this feature is not supported by Scrabble3D application. It is in line with choice of questions in Tamil assignment and examination in school days. Instead of having 20 tiles in rack with 20 letters, only 10 tiles in rack were used with 20 letters front and back, so words greater than 10 letters cannot be formed but front and back letters do not combine in a word so easily and so offers more flexibility in forming words from these Tamil tiles for all of the Tamil letters. Even it had a hard time in forming Tamil words, so we decided to play later on with 50 tiles per player front and back drawn at the start of a game, randomly chosen from 464 tiles. It encouraged playing long Tamil words instead of shorter ones and was best played on 21×21 Scrabble board with maximum of 200 tiles. First 441 tiles, on a 21×21 square were printed on a chart paper, then I added 23 more tiles when I printed the tiles needed for Scrabble with mathematical equations on a chart paper. Plenty of short words are available in English Scrabble and English phonetic Scrabble, so that played game of Scrabble board can be densely filled accommodating upto 300 tiles, but it is not the case for Tamil Scrabble and Scrabble with mathematical equations, not much closely packed tiles can be formed from them and so even 200 tiles saturate with that played game of Scrabble board.
With 20 one sided tiles in the rack, a high scoring tile must be played or penalty for that tile if left stagnated in rack at end of played game, if any player finishes and not if all people pass. With 10 tiles in the rack printed front and back, a high scoring tile can be played or used for its back side cheaper one and frequently used letter. If passed away by the latter way, then no penalty for that tile if left stagnated in rack at end of played game, if any player finishes and not if all people pass.
Probability of tiles getting left stagnated in rack at the end of played game is low for English Scrabble and English phonetic Scrabble but it is high for Scrabble with mathematical equations. But for Tamil Scrabble, players could cleverly pass away if they first use high scoring, rarely used tiles and ் combined from other low scoring and frequently used letters.
To play Tamil Scrabble games successfully and for Tamil letters to be displayed properly in Scrabble tiles, Valluvan font format needs to be downloaded and used from Font.zip zip file. Font.zip zip file also contains other Tamil fonts, 23 of them, of which 3 of them are in Unicode format (Akshar, Latha and Vijaya), that is keyboard characters use Latin Script character set, non-ASCII character set are used for all of the accented characters and Tamil letters are only used in Unicode character set, 2 of them (Valluvan font format and Anangu Valluvar font format) uses all combined letters for ி, ீ, ு, ூ and ் for all of the accented and non-ASCII character set except for ஸு, ஜு, ஷு, ஹு, ஶு, ஸூ, ஜூ, ஷூ, ஹூ and ஶூ, while the rest 13 of them (Boopalam, Mohanam, Saraswathy, Baamini, Kanchi, Kovai, Madurai, Salem, Thanjavur, Tirunelveli, Tiruvarur, Trichy and Vellore) uses limited number of characters only that are available in the keyboard and so consonants and even vowels for ி, ீ, ு, ூ and ் are separately included them until they are heavily modified in writing form that a separate character cannot be for them at all. It would be good if in Scrabble3D application, for played moves shown in Messages menu bar and for move validation by the current player and all of the human opponent players that a certain word is not in dictionary, accept anyway, the same font is used for those words that is used for tile set just in case that non-Latin script letters are used for the directly keyboard typed characters.
This is the frequency of occurrence of each Tamil letter in Tamil dictionary per thousand letters and total count is given in brackets. Players can choose their own letter distribution in Tamil Scrabble at their own wish based upon this data. I have also included what each 8-bit character corresponds to what Tamil letter in Valluvan font format in brackets. My desired points of each Tamil tile for non-Sanskrit characters in my own make of Tamil Scrabble that had been printed on chart paper front and back side pasted with gum on cardboard and cut with scissors are also given here in subscript notation in brackets. The same points for each Tamil tile can also be used in Scrabble3D application. My own make of Tamil Scrabble had no use of Sanskrit characters. Sanskrit characters are ஸ, ஜ, ஷ, ஹ and ஶ. ஸ, ஜ, ஷ and ஹ but not ஶ are used in tamil.dic dictionary file that is being attached in Lexicon.zip zip file. tamizh.dic without meanings of words can be generated from tamil.dic with meanings of words by using my own python script tamizh.py as a computer program that is being attached in Tamil.zip zip file.
Random tiles with any desired letter frequencies for different Scrabble board games to be played can be always generated by using a computer program and not necessarily depending upon Scrabble3D application at all. For different board sizes and for different total count of number of tiles used in game that is played. Scrabble3D application has got a feature to generate not more than 100 number of random tiles to use in a game but that would always be generating all of the different letters and characters that is used in the letter set with equal probability only. It will not respect the frequency of occurrence of each letter and character in the loaded dictionary and used dictionary at all, isn't it? A dictionary need not necessarily always be for a language but also Scrabble with mathematical equations or a special subset of numbers like and such as those as prime numbers, perfect square numbers, perfect cube numbers, triangular numbers, etc. and so on.
Creating any user's own forum is possible in this bulletin board and this feature is not available in many of the other bulletin boards. Usually it would be only possible for the website page owner but usually it would not be available for non-administrators and non-supermods at the very least, but request could be made and suggestion could be given to the website page owner to create new forums as required. I think that here any user can be moderator of the forum which that very specific and particular user had actually created with, exactly and precisely.
Tamil0.png image is attached, it is a demo game only. Due to confusion between uppercase letters L, P, U with their lowercase equivalents by the computer, I had to abort the human played game. Computer demo was very useful to check out and find out if everything had been in the right place. Because it is just a demo game, the game file in .ssg format has not been saved at the end of the played game and only board image is available and not saved game played in .ssg format. So, there is no modification in Game.zip zip file that was attached from its own last time, let alone, but only for Board.zip zip file. Game.zip zip file is small enough to be attached here every time that it has been changed up away with, let alone, but only Board.zip zip file is large enough to be attached here every time that it has been changed up away with.|addpics|ubb-21-06cc.png-invaddpicsinvv|/addpics|
1000 tiles, minimum number of tiles needed in Tamil to represent every possible letter that is rarely used, is best fit for 45×45 square board or 15×15×15 cube board.
https://twitter.com/UntangleIndia/status...3001326592?s=08 Tamil Scrabble has been designed and developed by Santhana (kris_rs on Twitter) in a different way. He kept consonants separate and all of the vowels separate, ா, ி, ீ, ு, ூ, ெ, ே, ை, ொ, ோ, ௌ and ். And joining ஔ into one single tile, rather than keeping it separate as two different tiles ஒ and ள. And not merging these frequently used vowels with their consonants together: ி, ீ, ு, ூ and ். And not keeping ஔ tile separate as two different tiles ஒ and ள. Under the name of சொற்களம் (Sorkalam), it is available for play at Untangle India, Teynampet, Chennai. Although under the current COVID-19 situations and curfews and lockdowns throughout the city, it is not possible to move around anywhere easily in the city and many places are closed right now. It would be worth a try once after the COVID-19 situation is over, last patient recovers. And by either containing the virus or by developing a vaccine for it.
Unlike other bulletin boards, it is surprising that even ordinary users who are not administrators can create their own forum and be a moderator for it in this bulletin board. However, many features that are offered by other bulletin boards are missing in this bulletin board. For example, it is impossible to see what users are currently viewing a specific thread. Number of views by other users can be seen only for images that are attached inline, but not for the images that are not attached inline. So not for the other attached file formats which cannot be placed inline in any way. Users can be classified into guest users (who cannot be shown to other registered users as viewing any thread, but that their view count for viewed inline images that are attached in any post are also included) and registered users (who can be shown to other registered users as viewing any thread, but that their view count for viewed inline images that are attached in any post are also included).
Information about editing a post later on, once after that it has been posted is also not shown in this bulletin board, and that feature would be commonly available in other bulletin boards. What bulletin board software does this forum make use of? My most experienced forum that I have used for years is Mersenne Forum, software used from vBulletin, and that is used for discussing about prime numbers, discrete logarithms and other factoring projects, etc. and so on.
Last but not the least, I wonder if whether there is any option to delete any posts from this forum by any of the non-moderators within the allotted editable time frame of 1 hour.
Attached Tamizh.zip zip file in this current post contains tamizh.dic Tamil dictionary file with words only and without their own meanings. Attached Lexicon.zip zip file in the previous post contains tamil.dic Tamil dictionary file with words and also with their own meanings.
Zitat von Raman im Beitrag #2Last but not the least, I wonder if whether there is any option to delete any posts from this forum by any of the non-moderators within the allotted editable time frame of 1 hour.
I have made a few corrections to Tamil dictionary. You may consider uploading Tamil dictionary to the server if you want to. My Tamil dictionary contains 115914 words. Most, if not all words are there in Tamil dictionary. There are multiple inflections for each verb in Tamil language and not everything of that type is there. This is only to limit the performance of the computer opponent players but human players playing online in different computers or offline in same computer can validate by poll if word played is not in Tamil dictionary.
I thought that uppercase versus lowercase confusion happened for computer opponent players only for letters L, P and U and that was what was reflected in my previous dictionary. But in 3D Tamil Scrabble, 15×15×15 board with 1000 tiles, uppercase E was confused with lowercase e. I changed the dictionary to replace all of lowercase letters with other symbols. Accented non-ASCII characters such as é and è will not get converted to their uppercase characters while loading the dictionary, isn't it? There are at least 152 different characters used in Tamil dictionary.
Valluvan Font from Font.zip zip file attached in post 1 in this thread is needed to play Tamil Scrabble properly. Or it is available for download somewhere online, people can search for it on Google and get it.
Tamil.dic from Tamil.zip zip file attached in this post in this thread contains all of words and their meanings, for slow loading and file size is big enough already. Tamizh.dic contains only words and without their meanings, for fast loading and file size is small enough already.
Zitat von Scotty im Beitrag #6Do you want me to upload the Tamil dictionary to SF.net and make it available to everyone?
Please give me a day's time. I will add a letter set to the dictionary and fix a few errors (more details about it I will post tomorrow). I will attach a corrected version of the Tamil dictionary here tomorrow, which you can upload on Source Forge and make it available to everyone.
Zitat von Scotty im Beitrag #6Do you want me to upload the Tamil dictionary to SF.net and make it available to everyone?
Yes, please go ahead with Tamil.zip zip file that I have attached with this post. I have added a letter set to the dictionary and fixed a few errors. You might be interested in running consistency checks before considering them, isn't it? Tamil.zip zip file contains two different dictionary files, tamil.dic with definitions of words and tamizh.dic without definitions of words. For best results, it would be better to change semicolon in tamil.dic to some other character, say colon to display the entire meaning. But if it is long, tamil.dic is fine as it is, as short meaning will be displayed upto first semicolon whenever mouse hovers over any placed word on the Scrabble board. I included all of the words that are present in the Tamil Lexicon published by the University of Madras within my own prepared Tamil dictionary and dropping off the words which contain ஶ which is not even part of extended Tamil alphabet but including all of those other words whatever that contain other Sanskrit characters which are ஸ, ஜ, ஷ and ஹ. Not all of the inflected forms of each verb are included. It is not an official dictionary, so not all of the words are included in it but if human players want to play online on different computers over a network connection or offline on same computer, then they can validate by poll if a played word is not present in the Tamil dictionary. But if a player wants to play only against any of the computer opponent players, then however strong the performance of the computer may be, not including all of the inflected forms of each verb can be used to limit the performance of the computer opponent players only.
I have included a letter set with the dictionary by considering the frequency of occurrence of each letter in the Tamil dictionary and that drops off all of the Sanskrit characters. It includes only Tamil native alphabet. Sanskrit characters ஸ, ஜ, ஷ and ஹ but not ஶ are present in the Tamil dictionary and they can be customized in the letter set as per player's wishes by looking here about their frequencies of occurrence of each native character and inflected character in the dictionary. Please note that the letter set contains at least 1000 characters and something that is best fit for 45×45 square board or 15×15×15 cube board. It can be changed for smaller boards and smaller letter sets as per player's wishes. But note that Tamil letter set contains too many rarely used characters and it is impossible to represent each letter and balance the letter set for smaller board and fewer tiles. If Scrabble3D application contains a feature that generates random tiles depending upon frequency of occurrence of each character in the linked dictionary, then it is possible to generate random letter set for smaller board and fewer tiles.
My version of dictionary is more readable and flexible. I am joining the frequently used vowels ி, ீ, ு, ூ and ் with their consonants, but some players might prefer a version that separates all of vowels ா, ி, ீ, ு, ூ, ெ, ே, ை, ொ, ோ, ௌ and ் from their consonants. They can easily modify the dictionary at their own wish by using computer program and using unicode font for their own used tiles. People can refer to here what Tamil letter that each character maps to in the Tamil dictionary. Valluvan font, attached in Font.zip zip file in my post 1 in this thread, or that it is available for download from Google Search engine is required to play Tamil Scrabble with my prepared dictionary. Can the Tamil dictionary be inbuilt to load that Tamil font whenever the Tamil dictionary is being loaded, if it is available in the computer already downloaded and installed, or could it be made possible to be downloaded, installed and then made use of that Tamil font whenever the Tamil dictionary is being loaded, if it is not available in the computer already downloaded and installed?
In my post 5 in this thread, (1) all of the lowercase ASCII characters are converted into symbols and other non-ASCII characters which were not used by the Tamil dictionary. I am running out of 8 bit characters for sure when I am trying to avoid those with ASCII value of 0 to 31 and those with non-ASCII value of 128 to 159. (2) I raised points for some of the Tamil consonant letters because computer ignores playing them frequently and they stagnate in computer's rack at the end of the played game. I am sure enough that raising points for some of the Tamil consonant letters would be effective for human players also and no two letter words can be formed out of them. In this post 7 of mine in this thread, in my newly attached version of Tamil dictionary in Tamil.zip zip file, it seems that comma character is also not allowed in words. In my previously prepared version of Tamil dictionary, I used comma character for lowercase ASCII character of x. (1) Saved files use comma character for separators of letters and history and that can cause confusion to the program while loading them. (2) When I checked the letters that are remaining in the bag, the comma separator got changed to x separator. (3) Also in the messages and the game course, the comma separated list of played words became modified to x separated list of played words. As a result of it, I changed the comma character in the Tamil dictionary to ý, a non-ASCII character which was not used by the Tamil dictionary even after all of the lowercase ASCII characters are converted into symbols and other non-ASCII characters.
It seems that random seed that is generated for any of game that is played in Scrabble3D application gets only known to players after that game is saved and saved file is opened for viewing the random seed that is generated by the program, isn't it?
I have once again attached set of programs I have used to prepare the Tamil dictionary from the Tamil Lexicon published by the University of Madras under the name of Code.zip zip file. It is the same as Tamil.zip zip file that I attached in my post 1 in this thread, except for a few more new programs that I have added to it. symbol.py python script replaces ($, %, &) by (l, p, u) in my previous version of prepared Tamil dictionary that is also attached here. To read all of the original data from symbol.dic dictionary file and to write all of the changes to convert.dic dictionary file. convert.py python script replaces (a, b, c, d, e, f, g, h, i, j, k, l, m, n, o, p, q, r, s, t, u, v, w, x, y, z) by ({, |, }, ~, ó, õ, ö, ü, þ, !, #, $, (, ), *, %, <, >, /, -, &, :, +, ,, ., ?) and thereby retaining the same set of symbols ($, %, &) that were previously used for (l, p, u). To read all of the original data from convert.dic dictionary file and to write all of the changes to replace.dic dictionary file. replace.py python script replaces comma by ý and adds a letter set to the dictionary that is at least 1000 units big enough and something that is best fit for 45×45 square board or 15×15×15 cube board. To read all of the original data from replace.dic dictionary file and to write all of the changes to tamil.dic dictionary file. In my previous version of my prepared Tamil dictionary, I thought that only lowercase ASCII letters of l, p, u were getting confused with their uppercase equivalents whenever Tamil dictionary was loaded by the Scrabble3D application. Later I observed that e also had the same behaviour, so that I felt that it would be best enough to change all of the lowercase ASCII characters are converted into symbols and other non-ASCII characters which were not used by the Tamil dictionary. You can also test out that flaw by yourself with the convert.dic dictionary file with meanings of words and convert_lexicon.dic dictionary file without meanings of words that I attached under Defective.zip zip file in my post 49 of thread 'Unfixed bugs in Scrabble3D application' in 'Bug reports and suggestions' forum in 'English' group, if you think that lowercase characters do not converted to their uppercase equivalents on loading the dictionary. Uppercase ASCII letters and lowercase ASCII letters both of them are used in those 2 dictionaries without converting all of the lowercase ASCII characters are converted into symbols and other non-ASCII characters which were not used by the Tamil dictionary. More information about case conversion of all of the ASCII letters and accented non-ASCII letters in my post 49 of that thread only.
If you are considering uploading Tamil dictionary to sourceforge for use by everyone, may be only after running some of the consistency checks from your own side, then you can also consider updating the other dictionaries that have not been updated for ages. I have created my own dictionaries for the same and put them in my own dictionary folder, and that is good enough for playing against other computer opponent players and other human opponent players offline in my same computer. And certainly not at all for playing with other human opponent players online in different computers over a network connection. (1) The English dictionary, UK or International version, not the default category which is USA and Canada version could be made to point to CSW2019 dictionary. Same can also be done also for the default category which is USA and Canada version, but I suggest that you search for it by yourself on the web to find out the latest version of that by yourself. Actually that CSW2019 dictionary is made use of for all of Scrabble tournaments in India and I am just fine enough to keep the latest version being updated of that dictionary only. Here is the link to the CSW2019 dictionary which is the latest version of the English dictionary, UK or International version, not the default category which is USA and Canada version. CSW2019 words without definitions: https://drive.google.com/file/d/1oGDf1wj...IWMh5uJs8s/view. CSW2019 words with definitions: https://drive.google.com/file/d/1XIFdZuk...CrrgJbLBxV/view. Whenever definitions of words are included in language dictionary, then it is very nice for tooltip text for words meaning are displayed whenever mouse hovers over the placed word in the Scrabble word in played game in Scrabble3D application. (2) Can maths dictionary be modified to include more equations, and all of equations that are less than, say, 13 characters in length. For best results, it would be good enough to parse over each maths equation that is played by any of the human players or that is to be played by any of the computer opponent players to check for validity. And for playing a move by making use of computer opponent player, it would be good enough to parse over each maths equation. Do lexical analysis, syntax analysis and then semantic analysis by making use of context free grammar and a trie data structure. If human players want to play online on different computers over a network connection or offline on same computer, then they can validate by poll if a played equation is not present in the Maths dictionary. But if a player wants to play only against any of the computer opponent players, then however strong the performance of the computer may be, not including all of the maths equations can be used to limit the performance of the computer opponent players only. (3) English phonetic dictionary could be modified to include within the support for the following types of the diphthongs: aɪ, eɪ, ɔɪ, aʊ, oʊ, tʃ and dʒ. It could be made as player's choice whether they want to play any of the types of the games of the English phonetic Scrabble by making use of diphthongs or not and then also whether they want the option of allowing diphthongs to be combined from 2 monophthongs or not as well. In English Scrabble, at least a small fraction of words within that used dictionary containing Q exist without being followed by U. But in English phonetic Scrabble, not any of one single word within that used dictionary containing a exist without being followed by ɪ or ʊ. Not any of one single word within that used dictionary containing e exist without being followed by ɪ. Not any of one single word within that used dictionary containing o exist without being followed by ʊ.
ý and ÿ accented characters with non-ASCII character values of 253 and 255 respectively are not used by Valluvan Tamil font but loading them by using Scrabble3D application if they are replaced by using some other character that is used in Valluvan Tamil font, namely lowercase ASCII letter of x which has ASCII value of 120 should not present any of problem either at all. I am assuming at least those accented non-ASCII characters are not converted into their uppercase equivalents while loading the dictionary by using Scrabble3D application, if not at least for the lowercase ASCII letters which have ASCII value of inbetween 97 and then 122 enough only actually.
ÿ, ', " are not used by Tamil dictionary. ÷ and ø are used only as combined with Sanskrit letters of ^, _, `, a and b. I felt that those characters would be left for use of digraphs alone and not get confused as individual letters in dictionary words and not in dictionary meanings. All of the other symbols and accented non-ASCII characters which are even not used in digraphs alone but are present in different unused Tamil letters in Valluvan font are used as individual letters in dictionary words and not in dictionary meanings.
Zitat von Raman im Beitrag Unfixed bugs in Scrabble3D application.I think that Bussinchen has actively subscribed to all of the threads that I have made a post in. He actively logged in into this forum after I think that he had received an e-mail notification to my own subscribed thread and thanked my made post in the thread of Tamil Scrabble, in other languages forum, where all of other languages which are not listed separately above in International group are discussed with, in any further. Or the time that I made the post would be the active period of time that he would actually and usually log in to the forum, it seems so. But he should have thanked that post of mine without actively reading all of the long material and without understanding all of them. After all, he should have trusted me. I believe that none of the German people here know Tamil at all, isn't it?
I have thanked you because you have made the effort to create a dictionary file in Tamil language for Scrabble3D, which is great. You deserve a "Thank you" in general, even though I cannot speak Tamil or play Scrabble in Tamil language. I have not activated any e-mail notification for this forum, because normally I look in this forum every day. By the way: I am female.
Zitat von Bussinchen im Beitrag #10I have thanked you because you have made the effort to create a dictionary file in Tamil language for Scrabble3D, which is great. You deserve a "Thank you" in general, even though I cannot speak Tamil or play Scrabble in Tamil language. I have not activated any e-mail notification for this forum, because normally I look in this forum every day. By the way: I am female.
Helper can be built and programmed in any language very easily from the dictionary especially if it were not protected by their authors. Is Scrabble3D application the only program where our own dictionaries and letter sets can be customized, for Tamil dictionary to be incorporated into them.
It did not take too much of effort, just a few days of work from Tamil Lexicon published by using University of Madras, when I was curious to try out Tamil Scrabble by myself in Scrabble3D application as a program which I longed in for many years put together as a combined enough measure only, actually. I would love to try out different types of things, playing the same variant again and again repeatedly is not spicy enough.
In 2014, I prepared Tamil Scrabble myself, prepared with printed tiles on 2 chart papers pasted with gum on cardboard on front side and back side and cut with scissors, to try out to play with my own family, friends and then relatives. Just remembering simply that there is choice for questions and answers most of the time in Tamil assignments and examinations, I put front side and back side choice of each tile, usually front side and back side would be not combining letters to provide more flexibility on 113 different letters used in Tamil language, excluding all of Sanskrit letters and played with 10 tiles in rack. In Scrabble3D application as a program, there is no front side and back side choice of each tile and is played with 20 tiles in rack. Sanskrit letters of such as those like ஸ, ஜ, ஷ and ஹ but not for ஶ have been included in the Tamil dictionary that is prepared by me from the Tamil Lexicon published by using University of Madras. In my humble opinion, it would be best to replace ஶ by ச.
It would be good enough to see how different variant of Scrabble goes. Demo is useful to test out for flexibility of dictionary, letter set and number of tiles within the rack. In my own make of Tamil Scrabble, whenever making use of front side or back side choice of each tile, then high scoring rarely used tile may not be always played, it can be pushed for a low scoring frequently used alternative another side tile. Usually that high scoring rarely used tiles have low scoring frequently used letters printed on alternative another side of tile. Also then, similarly that low scoring frequently used tile may not be always played, it can be pushed for a high scoring rarely used alternative another side tile. However, in Scrabble3D application as a program, each tile must be played as it is, no matter if whether it is high scoring or low scoring or rarely used or frequently used, since no making use of front side or back side choice of each tile is being supported. May be that feature could be considered in future set of releasing version within the Scrabble3D application as a program, just simply, if and only if the author would ever be willing enough to work out upon it later onwards, only actually then. Along within that, not only making use of front side or back side choice of each tile to be supported, but also from among multiple choices of letters for each tile to be chosen from, of within the set.
In my own humble opinion, it would be good enough to make use of, for 6 face cube tiles for Hindi, Kannada, Telugu & Malayalam Scrabble. I am not considering other Indian languages like Hindi, Kannada, Telugu and Malayalam right now because there are too many letters. There are 4 varieties of each hard consonant and unlike Tamil (and Malayalam) the vowel symbols for ா, ெ, ே, ை, ொ, ோ and ௌ are not separated.
Exact number of choices of letters for each tile to be chosen from, of within the set, can be made in commercial sets by making use of n-sided prism or n-sided pyramid. In computer platforms as a software, they can be easily done and then easily customized by making use of computer program, over thereby the precise measurements and then precise makes for the n-sided prism or n-sided pyramid does not matter at all over thereby.
Zitat von Raman im Beitrag #8In this post 7 of mine in this thread, in my newly attached version of Tamil dictionary in Tamil.zip zip file, it seems that comma character is also not allowed in words. In my previously prepared version of Tamil dictionary, I used comma character for lowercase ASCII character of x. (1) Saved files use comma character for separators of letters and history and that can cause confusion to the program while loading them. (2) When I checked the letters that are remaining in the bag, the comma separator got changed to x separator. (3) Also in the messages and the game course, the comma separated list of played words became modified to x separated list of played words. As a result of it, I changed the comma character in the Tamil dictionary to ý, a non-ASCII character which was not used by the Tamil dictionary even after all of the lowercase ASCII characters are converted into symbols and other non-ASCII characters.
This should be corrected to:
In this post 8 of mine in this thread.
Please make the correction in the earlier post, post 8 of mine in this thread and delete the second half of this post if you would ever want to. I was, then, not able to edit my post after the follow up post was made even by me, and only then that I realized my mistake after that period of time. It was not possible by me to edit my post then that, even if 1 hour period of time has not elapsed after my posting. It was not possible for me to delete the follow up post of me by myself, edit the first post, and then re-post the follow up post once again. Also then, only that supermods or administrators to all of the forums and moderators to a particular (specific) forum can only edit non-last post in a thread no matter if whether that it is older enough than 1 hour period of time or not or a last post in a thread if whether that it is older enough than 1 hour period of time, then, isn't it?
Don't find obvious issues with the dictionary. But since you replace every character my sanity checks don't bite. Some minor issues: I'm not happy with 1k tiles. It's easier to multiply the number (there is a control on top of the list) than shrinking. Maybe you have numerous rarely used characters that need a frequency of zero, which would multiply to n in that case. But even then, isn't it easier to adjust these zeros to ones? Perhaps I'm wrong and you cannot play classic Scrabble at all and need super large boards.
Besides this, may I use your name and email address in the header? Credits and questions belong to the author.
Zitat von Scotty im Beitrag #12Besides this, may I use your name and email address in the header? Credits and questions belong to the author.
Yes, you can please go ahead with it. My name is Raman Viswanathan and my e-mail address is raman22feb1988@gmail.com.
But please note that it is Tamil Lexicon, the digital South Asia library. The work belongs to University of Madras, it is open source and freely available to everyone and it is not copyrighted material at all.
Zitat von Scotty im Beitrag #13And maybe I missed the point but what's the difference between Tamil and TamilZH? Which one should be on SF.net?
Tamil.dic contains all of the words with meanings and Tamizh.dic contains all of the words without meanings, both of them can be parsed by using Scrabble3D application as a program. I would suggest to consider the Tamil.dic, the one single dictionary file containing all of the words with meanings. If you do not have an issue with attaching large files, then you can consider Tamil.dic. If you want shorter files, then you can consider Tamizh.dic. Larger files should take long time only while loading, and Tamil.dic is not too big enough file. Computer opponent player's performance, especially for smaller enough 3D boards and then not even for larger enough 2D boards should not depend upon size of dictionary, but only let alone number of words that are being present within the dictionary, isn't it? Incidentally that all of types of file formats are not supported to be attached here, and they need to be zipped, so that multiple files of them can be combined together within one single zip file.
Tamizh is the native and then local pronounciation of Tamil. The l that is being present in the word of Tamil is voiced retroflex approximant, and then not for voiced alveolar lateral approximant at all.
--
Within this forum, there exists no feature of fewer enough minute gap inbetween any of types of 2 successive posted messages at all, isn't it? Usually some forums, in order to avoid with spam, might ask for unregistered guest users and newly registered users having within fewer enough number of posted messages to verify for a captcha code to check out it is really human posting messages manually and then not automated computer programs can be easily programmed to read them at all, isn't it? Captcha is something that is being easily for humans manually and then not automated computer programs, unlike most of the another tasks.
Zitat von Raman im Beitrag #11It did not take too much of effort, just a few days of work from Tamil Lexicon published by using University of Madras
You deserve my thanks anyway.
Zitat von Raman im Beitrag #11This should be corrected to:
In this post 8 of mine in this thread.
Please make the correction in the earlier post, post 8 of mine in this thread and delete the second half of this post if you would ever want to. I was, then, not able to edit my post after the follow up post was made even by me, and only then that I realized my mistake after that period of time. It was not possible by me to edit my post then that, even if 1 hour period of time has not elapsed after my posting. It was not possible for me to delete the follow up post of me by myself, edit the first post, and then re-post the follow up post once again. Also then, only that supermods or administrators to all of the forums and moderators to a particular (specific) forum can only edit non-last post in a thread no matter if whether that it is older enough than 1 hour period of time or not or a last post in a thread if whether that it is older enough than 1 hour period of time, then, isn\'t it?
Because of your serious and useful work for Tamil Scrabble3D I created a new Tamil section in this forum and moved this thread into this new section. At the same time I gave you the right to edit your posts in all the threads you have started and gave you moderator rights in the Tamil section. I hope this is helpful for you, dear Raman, so that you can work more effectively with your posts.
Please make yourself the above mentioned corrections you wish to be made, and all other corrections you wish to do as well.