๋ฌธ์ œ

 

Brazilian E-Commerce Public Dataset by Olist ๋ฐ์ดํ„ฐ๋ฒ ์ด์Šค๋Š” ๋ธŒ๋ผ์งˆ์˜ ์ด์ปค๋จธ์Šค ์›น์‚ฌ์ดํŠธ์ธ Olist Store์˜ ํŒ๋งค ๋ฐ์ดํ„ฐ ์ž…๋‹ˆ๋‹ค. ๊ทธ ์ค‘ `olist_orders_dataset` ํ…Œ์ด๋ธ”์—๋Š” ์ฃผ๋ฌธ ID, ๊ณ ๊ฐ ID, ์ฃผ๋ฌธ ์ƒํƒœ, ๊ตฌ๋งค ์‹œ๊ฐ ๋“ฑ ์ฃผ๋ฌธ ๋‚ด์—ญ ๋ฐ์ดํ„ฐ๊ฐ€ ๋“ค์–ด์žˆ์Šต๋‹ˆ๋‹ค. Olist์˜ ์ฃผ๋ฌธ๋ถ€ํ„ฐ ๋ฐฐ์†ก๊นŒ์ง€ ํ”„๋กœ์„ธ์Šค๋Š” ๋‹ค์Œ ๋‹จ๊ณ„๋ฅผ ํ†ตํ•ด ์ด๋ฃจ์–ด์ง€๊ณ , ๊ฐ ๋‹จ๊ณ„๋งˆ๋‹ค ์‹œ๊ฐ์„ ๊ธฐ๋กํ•˜๊ณ  ์žˆ์Šต๋‹ˆ๋‹ค.

  1. ๊ณ ๊ฐ์˜ ๊ตฌ๋งค
    • `order_purchase_timestamp` ์ปฌ๋Ÿผ์— ๊ตฌ๋งค ์‹œ์ ์ด ์ €์žฅ๋จ
  2. ํŒ๋งค์ž๊ฐ€ ์ฃผ๋ฌธ์„ ์Šน์ธ
    • `order_approved_at` ์ปฌ๋Ÿผ์— ์Šน์ธ ์‹œ์ ์ด ์ €์žฅ๋จ
  3. ํƒ๋ฐฐ์‚ฌ์— ๋„์ฐฉํ•˜์—ฌ ๋ฐฐ์†ก ์‹œ์ž‘
    • `order_delivered_carrier_date` ์ปฌ๋Ÿผ์— ๋ฐฐ์†ก ์‹œ์ž‘ ์‹œ์ ์ด ์ €์žฅ๋จ
  4. ๋ฐฐ์†ก ์™„๋ฃŒ
    • `order_delivered_customer_date` ์ปฌ๋Ÿผ์— ๋ฐฐ์†ก ์™„๋ฃŒ ์‹œ์ ์ด ์ €์žฅ๋จ

์ถ”๊ฐ€๋กœ `order_estimated_delivery_date` ์ปฌ๋Ÿผ์—๋Š” ์ฃผ๋ฌธ ์‹œ์ ์— ๊ณ„์‚ฐํ•œ ๋ฐฐ์†ก ์˜ˆ์ • ์‹œ๊ฐ์ด ์ €์žฅ๋˜์–ด ์žˆ์Šต๋‹ˆ๋‹ค. ์˜ˆ๋ฅผ ๋“ค์–ด, ์ปฌ๋Ÿผ์— ๊ฐ’์ด ‘2017-02-24 00:00:00’๋กœ ๋“ค์–ด์žˆ๋Š” ๊ฒฝ์šฐ, ๋ฐฐ์†ก์„ 2017๋…„ 2์›” 24์ผ ์ž์ •๊นŒ์ง€ ์™„๋ฃŒํ•˜๊ฒ ๋‹ค๋Š” ์˜๋ฏธ๋ฅผ ๋‹ด๊ณ  ์žˆ์Šต๋‹ˆ๋‹ค.

 

2017๋…„ 1์›” ํ•œ ๋‹ฌ ๋™์•ˆ ๋ฐœ์ƒํ•œ ์ฃผ๋ฌธ์˜ ๋ฐฐ์†ก ์˜ˆ์ธก์ด ์ •ํ™•ํ–ˆ๋Š”์ง€ ๋ถ„์„์„ ํ•˜๋ ค๊ณ  ํ•ฉ๋‹ˆ๋‹ค. ๊ณ ๊ฐ์˜ ๊ตฌ๋งค ์ผ์ž๋ณ„๋กœ ๋ฐฐ์†ก ์˜ˆ์ • ์‹œ๊ฐ ์•ˆ์— ๊ณ ๊ฐ์—๊ฒŒ ๋„์ฐฉํ•œ ์ฃผ๋ฌธ๊ณผ, ๋ฐฐ์†ก ์˜ˆ์ • ์‹œ๊ฐ์ด ์ง€๋‚˜์„œ ๊ณ ๊ฐ์—๊ฒŒ ๋„์ฐฉํ•œ ์ฃผ๋ฌธ์„ ๊ฐ๊ฐ ์ง‘๊ณ„ํ•˜๋Š” ์ฟผ๋ฆฌ๋ฅผ ์ž‘์„ฑํ•ด์ฃผ์„ธ์š”. ๋ฐฐ์†ก ์™„๋ฃŒ ๋˜๋Š” ๋ฐฐ์†ก ์˜ˆ์ • ์‹œ๊ฐ ๋ฐ์ดํ„ฐ๊ฐ€ ์—†๋Š” ๊ฒฝ์šฐ๋Š” ๊ณ„์‚ฐ์—์„œ ์ œ์™ธํ•ฉ๋‹ˆ๋‹ค. ๊ณ„์‚ฐ ๊ฒฐ๊ณผ๋Š” ๊ตฌ๋งค ๋‚ ์งœ๋ฅผ ๊ธฐ์ค€์œผ๋กœ ์˜ค๋ฆ„์ฐจ์ˆœ ์ •๋ ฌ๋˜์–ด์•ผ ํ•˜๊ณ , ์•„๋ž˜ ์ปฌ๋Ÿผ์„ ํฌํ•จํ•ด์•ผ ํ•ฉ๋‹ˆ๋‹ค.

  • `purchase_date` - ๊ตฌ๋งค ๋‚ ์งœ (์˜ˆ: 2017-01-01)
  • `success` - ๋ฐฐ์†ก ์˜ˆ์ • ์‹œ๊ฐ ์•ˆ์— ๊ณ ๊ฐ์—๊ฒŒ ๋„์ฐฉํ•œ ์ฃผ๋ฌธ ์ˆ˜
  • `fail` - ๋ฐฐ์†ก ์˜ˆ์ • ์‹œ๊ฐ์ด ์ง€๋‚˜ ๊ณ ๊ฐ์—๊ฒŒ ๋„์ฐฉํ•œ ์ฃผ๋ฌธ ์ˆ˜

๊ฒฐ๊ณผ ๋ฐ์ดํ„ฐ ์˜ˆ์‹œ

purchase_date success fail
2017-01-06 4 0
2017-01-07 3 1
  • 2017๋…„ 1์›” 6์ผ ๊ตฌ๋งค๋œ ์ฃผ๋ฌธ์€ ์ด 4๊ฑด์ด๊ณ  ๋ชจ๋‘ ๋ฐฐ์†ก ์˜ˆ์ •์ผ ์•ˆ์— ๋ฐฐ์†ก ์™„๋ฃŒ๋จ
  • 2017๋…„ 1์›” 7์ผ ๊ตฌ๋งค๋œ ์ฃผ๋ฌธ์€ ์ด 4๊ฑด์ด๊ณ  ๊ทธ ์ค‘ 3๊ฑด์€ ๋ฐฐ์†ก ์˜ˆ์ •์ผ ์•ˆ์— ๋ฐฐ์†ก ์™„๋ฃŒ๋˜์—ˆ์œผ๋‚˜, 1๊ฑด์€ ๋ฐฐ์†ก ์˜ˆ์ •์ผ ์ดํ›„์— ๋ฐฐ์†ก ์™„๋ฃŒ๋จ

 

์ œ์ถœ
SELECT DATE(order_purchase_timestamp) AS 'purchase_date'
     , SUM(IIF(order_estimated_delivery_date >= order_delivered_customer_date, 1, 0)) AS 'success'
     , SUM(IIF(order_estimated_delivery_date < order_delivered_customer_date, 1, 0)) AS 'fail'
FROM olist_orders_dataset
WHERE order_delivered_customer_date IS NOT NULL
  AND order_estimated_delivery_date IS NOT NULL
  AND purchase_date LIKE '2017-01%'
GROUP BY purchase_date
ORDER BY purchase_date
;

+ Recent posts