Generate state-action-reward triples en masse and directly model the q-value function. Add Monte Carlo Tree Search.
AlphaGo cost google around $35 million, it may be hard to reproduce that success. But, could I make it do something reasonable?
Generate state-action-reward triples en masse and directly model the q-value function. Add Monte Carlo Tree Search.
AlphaGo cost google around $35 million, it may be hard to reproduce that success. But, could I make it do something reasonable?
A vector space representing linguistic phonotactics across languages.
Given a set of phonemes in a language’s phonemic inventory, but with N removed, predict which N phonemes are missing.
Turn the weights used in the prediction into a vector.
Then generate random vectors and convert them back into phonemic inventories somehow.
Flattening a vocabulary to a single number (word ID) such that the distance between ID’s best approximates the semantic distance between the words.
Initialize:
Refine:
In addition to the features outlined for Concept: Maximum Basic Goodness 2:
A hierarchy of jurisdictions to which humans can subscribe or belong. For example: United States -> Washington -> King County -> Seattle -> District 6
Taxation: jurisdictions declare tax policy which is implemented by the system. For example: on a monthly basis, all human accounts are taxed .1% on amounts over $20000. All other accounts are taxed .1% on all amounts. All tax revenues accrue instantly to the jurisdiction’s treasury.
Privacy policy: for example, for all account pairs for which net daily transfers exceed some threshold, all transactions pertaining to the pair will be public; all other transactions will remain private.
Campaign finance: with jurisdictions, elections occur according to a schedule. For each election, there is a registration period for campaigns to declare themselves. After the campaign season commences, donations to campaigns are accepted from human accounts only. Donations above a certain limit are ignored. Donations above a certain limit are made public.
Wealth distribution: make explicit policies for wealth redistribution instead of implying it through various programs.
Maximum code size: prohibit “law” above a certain complexity. (But how do you set the limit appropriately?)