项目作者: gmaldona

项目描述 :
Implementation of FP-Growth Data Mining Algorithm
高级语言: Java
项目地址: git://github.com/gmaldona/FP_Growth-Data_Mining.git
创建时间: 2020-10-14T03:55:45Z
项目社区:https://github.com/gmaldona/FP_Growth-Data_Mining

开源协议:

下载


Frequent Pattern Growth Algorithm for Data Mining

Data Structures and Algorithms Project

Project Overview: The csv file Groceries_dataset contains customer ID and an item purchased on a date by that ID. The data can be sorted by each customer ID and then by data of purchase to obtain a receipt of the customer’s order. The organized customer data is shown in the XML file Transactions. With all the receipts organized, we can apply date mining techniques to find relationships within the data.


Important Note:

Compilation and running of the source code can all be done by executing run.sh or in the terminal executing the command ./run.sh

For organization purposes, cleanup.sh removes *.class files generated by run.sh


The Algorithm breakdown and explanation can be found here.

Example Results:

  1. Programming Assignment 2 gregory$ ./run.sh
  2. [rolls/buns]->[whole milk]
  3. Count: 209 | Support: 1.3967787208447504 | Confidence: 12.697448359659782
  4. [whole milk]->[rolls/buns]
  5. Count: 209 | Support: 1.3967787208447504 | Confidence: 8.844688954718578
  6. [soda]->[whole milk]
  7. Count: 174 | Support: 1.1628684087415626 | Confidence: 11.97522367515485
  8. [whole milk]->[soda]
  9. Count: 174 | Support: 1.1628684087415626 | Confidence: 7.363520947947524
  10. [whole milk]->[yogurt]
  11. Count: 167 | Support: 1.116086346320925 | Confidence: 7.0672873465933135
  12. [yogurt]->[whole milk]
  13. Count: 167 | Support: 1.116086346320925 | Confidence: 12.996108949416344
  14. [other vegetables]->[whole milk]
  15. Count: 222 | Support: 1.4836596939116486 | Confidence: 12.151067323481117
  16. [whole milk]->[other vegetables]
  17. Count: 222 | Support: 1.4836596939116486 | Confidence: 9.394837071519255
  18. [other vegetables]->[rolls/buns]
  19. Count: 158 | Support: 1.0559379803515339 | Confidence: 8.648056923918993
  20. [rolls/buns]->[other vegetables]
  21. Count: 158 | Support: 1.0559379803515339 | Confidence: 9.59902794653706
  22. [whole milk]->[rolls/buns]
  23. Count: 209 | Support: 1.3967787208447504 | Confidence: 8.844688954718578
  24. [rolls/buns]->[whole milk]
  25. Count: 209 | Support: 1.3967787208447504 | Confidence: 12.697448359659782
  26. [other vegetables]->[rolls/buns]
  27. Count: 158 | Support: 1.0559379803515339 | Confidence: 8.648056923918993
  28. [rolls/buns]->[other vegetables]
  29. Count: 158 | Support: 1.0559379803515339 | Confidence: 9.59902794653706
  30. [other vegetables]->[whole milk]
  31. Count: 222 | Support: 1.4836596939116486 | Confidence: 12.151067323481117
  32. [whole milk]->[other vegetables]
  33. Count: 222 | Support: 1.4836596939116486 | Confidence: 9.394837071519255
  34. [whole milk]->[soda]
  35. Count: 174 | Support: 1.1628684087415626 | Confidence: 7.363520947947524
  36. [soda]->[whole milk]
  37. Count: 174 | Support: 1.1628684087415626 | Confidence: 11.97522367515485