项目作者: hayesall

项目描述 :
💾 Source code and 📓 TeX for AL Hayes, M Das, P Odom, and S Natarajan, "User Friendly Automatic Construction of Background Knowledge: Mode Construction from ER Diagrams." Knowledge Capture Conference 2017
高级语言: TeX
项目地址: git://github.com/hayesall/Walk-ER.git
创建时间: 2017-07-31T18:20:15Z
项目社区:https://github.com/hayesall/Walk-ER

开源协议:GNU General Public License v3.0

下载






Source code and TeX for “User Friendly Automatic Construction of Background Knowledge: Mode Construction from ER Diagrams.” K-CAP 2017

Paper

Explore further on hayesall.com.

Updates

Citation

If you build on this code or the ideas of the paper, please consider citing.

@inproceedings{hayes2017userfriendly, author = {Hayes, Alexander L. and Das, Mayukh and Odom, Phillip and Natarajan, Sriraam}, title = {User Friendly Automatic Construction of Background Knowledge: Mode Construction from ER Diagrams}, booktitle = {Proceedings of the Knowledge Capture Conference}, series = {K-CAP 2017}, year = {2017}, isbn = {978-1-4503-5553-7}, location = {Austin, TX, USA}, pages = {30:1--30:8}, articleno = {30}, numpages = {8}, url = {http://doi.acm.org/10.1145/3148011.3148027}, doi = {10.1145/3148011.3148027}, acmid = {3148027}, publisher = {ACM}, address = {New York, NY, USA}, }


Getting Started

Modes are used to restrict/guide the search space and are a powerful tool in getting relational algorithms such as BoostSRL to work. If your algorithm does not learn anything useful, then the first debug point would be the modes (in the background.txt file).

Walk-ER is a system for defining background knowledge for use in relational learning algorithms by exploring entity/attribute/relationships in Entity-Relational Diagrams. Refer to the BoostSRL Basic Modes Guide for more information about modes.

Prerequisites

  • Java 1.8
  • Python (2.7, 3.5)

Installation

Download the latest version from the GitHub repository. This includes five datasets:

  1. $ git clone https://github.com/hayesall/Walk-ER.git

Basic Usage

  • Files representing the ER-Diagrams are in the diagrams/ directory.
  • Datasets used in the experiments are in the datasets/ directory.

Walk-ER can either be invoked from a terminal.

  • Options overview (output of python walker.py -h):

    1. usage: WalkER_rewrite.py [-h] [-v] [--number NUMBER] [-w | -s | -e | -r | -rw] diagram_file
    2. positional arguments:
    3. diagram_file
    4. optional arguments:
    5. -h, --help show this help message and exit
    6. -v, --verbose Increase verbosity to help with debugging.
    7. --number NUMBER Select number of features to walk to (assumes that
    8. Important features are ordered from most important to
    9. least important). Defaults to number_attributes +
    10. number_relations if chosen number is greater than both.
    11. -w, --walk [Default] Walk graph from target to features.
    12. -s, --shortest Walk the graph from target to features. If there are
    13. multiple paths, take the shortest. If the shortest are
    14. equal lengths, walk both.
    15. -e, --exhaustive Walk graph from every feature to every feature.
    16. -r, --random Ignore features the user selected and walk (-w) from the
    17. target to random features.
    18. -rw, --randomwalk Walk a random path from the target until reaching a depth
    19. limit (specified with --number).

Examples

  • $ python walker.py -w diagrams/imdb.mayukh

    1. //target is workedunder
    2. mode: actor(+personid).
    3. mode: female_gender(+personid).
    4. mode: genre(+personid,-genreid).
    5. mode: movie(-movieid,+personid).
    6. mode: workedunder(+personid,+personid).
  • $ python walker.py -rw --number 10 diagrams/imdb.mayukh

    1. //target is workedunder
    2. mode: actor(+personid).
    3. mode: female_gender(+personid).
    4. mode: genre(+personid,-genreid).
    5. mode: movie(+movieid,+personid).
    6. mode: workedunder(+personid,+personid).
    7. mode: workedunder(+personid,-personid).
    8. mode: workedunder(-personid,+personid).

Acknowledgements

  • Mayukh Das and Sriraam Natarajan gratefully acknowledge the support of the CwC Program Contract W911NF-15-1-0461 with the US Defense Advanced Research Projects Agency (DARPA) and the Army Research Office (ARO).
  • Phillip Odom and Sriraam Natarajan acknowledge the support of the Army Research Office (ARO) grant number W911NF-13-1-0432 under the Young Investigator Program.
  • Icon in the logo is “Trail” by Martina Krasnayová from the Noun Project, used under a Creative Commons (CC) Attribution 3.0 United States License.