Difference between revisions of "Syntop wsd"
From CSWiki
Line 1: | Line 1: | ||
− | |||
== April 28 == | == April 28 == | ||
Line 15: | Line 14: | ||
* In a tree parser, you can use parent node to define the distribution | * In a tree parser, you can use parent node to define the distribution | ||
* Embed whole thing as graphical model (Jonathan's idea) | * Embed whole thing as graphical model (Jonathan's idea) | ||
+ | |||
+ | == Random Notes (on desktop in office) == | ||
+ | |||
+ | Need to vary hyperparameters ... | ||
+ | |||
+ | == May 15 == | ||
+ | |||
+ | To generate input files: | ||
+ | * run makeVocabDat.py (note: you may have to modify EXTENSION variable depending on what the files look like ... it could be either ".bnc.parsed" or ".parsed") | ||
+ | * That generates "role-count.dat" and "word-count.dat" ... if you want to change the vocab files, you'll need to sort these and rename them to "sorted-word.dat" and "sorted-role.dat" | ||
+ | * Then run |
Revision as of 21:41, 15 May 2007
April 28
Began writing code, prepared presentation.
April 29
Tested Multinomial sampler and Dirichlet prior classes; still need to test conditioned sampling.
April 30
Started processing parsed files. Presented to WN group, got following suggestions:
- For parsing using some order, switch to tree parser
- In a tree parser, you can use parent node to define the distribution
- Embed whole thing as graphical model (Jonathan's idea)
Random Notes (on desktop in office)
Need to vary hyperparameters ...
May 15
To generate input files:
- run makeVocabDat.py (note: you may have to modify EXTENSION variable depending on what the files look like ... it could be either ".bnc.parsed" or ".parsed")
- That generates "role-count.dat" and "word-count.dat" ... if you want to change the vocab files, you'll need to sort these and rename them to "sorted-word.dat" and "sorted-role.dat"
- Then run