Subscribe to Vincent Granville's Weekly Digest:

high dimensional feature sets and the Data Dictionary

The Data Dictionary for a PMML model requires quite a bit of metadata for each field.  With sparse, high dimensional data the Data Dictionary could be many times larger than either the training data or the trained model.   Has anyone developed a standard extension to the PMML syntax that, for instance, just says that all fields have the same metadata?

Tags: data-dictionary, extensions, metadata

Views: 52

Follow us

© 2013   AnalyticBridge.com is a subsidiary and dedicated channel of Data Science Central LLC

Badges  |  Report an Issue  |  Terms of Service