I have a question regarding the technique/technology which could be applied for the issue:
Suppose I have a rule-based tree or decision tree which predicts a variable Y based on variables A,B,C. This tree is not trained on any data but is build up because it models the 'real' system (see it as a physiologically inspired tree).
NODE 1: Is A > 10?
/ \
/ \
YES / \ NO
/ \
NODE 2: Is B > 5? NODE 5: Is C < 8?
/ \ / \
/ \ / \
YES / \ NO YES / \ NO
/ \ / \
NODE 3: Y = 4 NODE 4: Y = 2 NODE 6: Y = 9 NODE 7: Y = 6
So this is a 'generalized' tree from which I want to optimize according to data. F.e. using a table with new data points:
| A | B | C | Y |
|---|---|---|---|
| 5 | 9 | 8 | 10|
| 4 | 7 | 7 | 7 |
etc.
So, basically I want the NUMBERS (or parameters) in my generalized decision tree to be optimized according to the new datapoints and decide on how much these new numbers of the parameters can deviate from the original ones.
Is this a clear question?
Thank you! Regards