Skip to content

Hypothesis: "Fiction books are cheaper than Non Fiction books." We perform a z-test on book price data from Kaggle to confirm or reject this hypothesis using real-world data.

Notifications You must be signed in to change notification settings

meruyert4/HypothesisTestDataAnalysis

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

6 Commits
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

πŸ“Š Hypothesis Test Data Analysis (Go)

This project tests a real-life hypothesis using basic statistical analysis in Go:

Hypothesis: "Fiction books are cheaper than Non Fiction books."

We perform a z-test on book price data from Kaggle to confirm or reject this hypothesis using real-world data.


πŸ—‚ Project Structure

HypothesisTestDataAnalysis/
β”œβ”€β”€ cmd/
β”‚   └── main.go       
β”œβ”€β”€ internal/
β”‚   └── analysis.go 
β”œβ”€β”€ bestsellers with categories.csv
β”œβ”€β”€ go.mod
└── README.md


πŸ“¦ Prerequisites


πŸ”§ Setup

  1. Clone the project (or download ZIP):
git clone https://github.com/meruyert4/HypothesisTestDataAnalysis.git
cd HypothesisTestDataAnalysis

πŸš€ Run the Project

go run cmd/main.go

You should see output like:

πŸ“š Fiction Avg Price: $10.85 (240 books)
πŸ“˜ Non Fiction Avg Price: $14.84 (310 books)
Z-score: -4.55 β†’ Reject Hβ‚€. Fiction books are significantly cheaper. βœ…

πŸ“š Dataset Citation

Sootla, S. (2020). Amazon Top 50 Bestselling Books 2009–2019 [Data set]. Kaggle. πŸ”— https://www.kaggle.com/datasets/sootersaalu/amazon-top-50-bestselling-books-2009-2019


πŸ“Œ Notes

  • This project uses population standard deviation.
  • Z-score interpretation uses a one-tailed test at 95% confidence (threshold: -1.645).

About

Hypothesis: "Fiction books are cheaper than Non Fiction books." We perform a z-test on book price data from Kaggle to confirm or reject this hypothesis using real-world data.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages