Association Rule Mining Tool

Discover hidden relationships in your transaction data with Apriori and Eclat algorithms

Association Rule Generator
CSV format with one transaction per line, items separated by commas
Minimum frequency of itemset in the dataset (0.1 = 10%)
Minimum probability that rule is correct (0.5 = 50%)
Strength of association (lift > 1 indicates meaningful rule)
Association Network
Visual representation of strong item associations
Top Rules by Metric

Key Features

Two Powerful Algorithms

Choose between Apriori (breadth-first search) and Eclat (depth-first search) algorithms based on your dataset size and characteristics.

Adjustable Parameters

Fine-tune your analysis with intuitive sliders for support, confidence, and lift thresholds to find the most meaningful rules.

Interactive Visualizations

Explore your results through network graphs and metric charts that make complex relationships easy to understand.

Multiple Export Options

Download your results in CSV, JSON, or image formats for further analysis or reporting.

Fully Responsive

Works perfectly on all devices from desktops to smartphones, with an interface optimized for each screen size.

Fast Processing

Optimized JavaScript implementation handles even medium-sized datasets quickly in your browser.

How Association Rule Mining Works

What is Association Rule Mining?

Association rule mining is a technique to discover interesting relationships between variables in large databases. It's widely used in market basket analysis to find products that are frequently purchased together.

For example, a rule like {bread, butter} → {jam} with high confidence indicates that customers who buy bread and butter are likely to also buy jam.

Key Metrics Explained

  • Support: The frequency of the itemset in the dataset.
    Example: If {bread, butter} appears in 30% of transactions, support = 0.3
  • Confidence: The probability that the rule is true.
    Example: If 80% of transactions with {bread, butter} also contain {jam}, confidence = 0.8
  • Lift: Measures how much more likely the consequent is given the antecedent.
    Lift > 1 indicates a meaningful rule, lift = 1 means no association, lift < 1 means negative association

Apriori Algorithm

The Apriori algorithm works in a breadth-first manner, generating candidate itemsets and pruning those that don't meet the minimum support threshold.

Key properties:

  • Uses the "downward closure" property: If an itemset is infrequent, all its supersets will also be infrequent
  • Works well for sparse datasets with many items
  • Can be slower for dense datasets with many frequent itemsets

Eclat Algorithm

The Eclat algorithm uses a depth-first search approach and works with a vertical data format (item → transaction IDs) for faster intersection operations.

Key properties:

  • Generally faster than Apriori for dense datasets
  • Uses less memory for datasets with many transactions
  • Performs well when minimum support is relatively high

Frequently Asked Questions

What format should my data be in?

Your data should be in CSV format with one transaction per line. Items within each transaction should be separated by commas. For example:

bread,milk,butter
eggs,bacon,milk
bread,butter,jam

The tool will automatically detect and handle your delimiter. You can also use sample datasets provided.

Which algorithm should I choose - Apriori or Eclat?

Apriori is generally better for:

  • Sparse datasets (few items per transaction)
  • When you need to generate rules with low minimum support
  • When memory is a concern (Eclat can use more memory)

Eclat is generally better for:

  • Dense datasets (many items per transaction)
  • When you can use higher minimum support values
  • When speed is critical (Eclat is often faster)

You can try both algorithms and compare results!

How do I interpret the lift metric?

Lift is a measure of how much more likely the consequent is given the antecedent, compared to its general likelihood.

  • Lift > 1: Positive association - the items appear together more often than expected by chance
  • Lift = 1: No association - the items appear together as often as expected
  • Lift < 1: Negative association - the items appear together less often than expected

Generally, you'll want to focus on rules with lift significantly greater than 1, as these represent meaningful associations.

What's the maximum dataset size this tool can handle?

This tool runs entirely in your browser, so performance depends on your device's capabilities. As a general guideline:

  • Up to 1,000 transactions: Should work quickly on most devices
  • 1,000-5,000 transactions: May take some time to process
  • Over 5,000 transactions: May cause browser slowdowns

For very large datasets, consider using specialized software or programming libraries like Python's mlxtend.

Can I save my analysis results?

Yes! You can export your results in multiple formats:

  • CSV: For spreadsheet analysis or further processing
  • JSON: For programmatic use or integration with other tools
  • Image: Screenshot of visualizations for reports

Just click the export buttons in the results section after generating rules.

Is my data secure when using this tool?

Absolutely! This tool runs entirely in your browser - your data never leaves your computer. We don't send your data to any servers, and we don't store any of your information.

For sensitive data, we recommend:

  • Using the tool offline (save the page and open it locally)
  • Removing personally identifiable information before analysis
  • Using generic item names instead of specific identifiers