Dataset statistics
Number of variables | 7 |
---|---|
Number of observations | 86 |
Missing cells | 12 |
Missing cells (%) | 2.0% |
Duplicate rows | 1 |
Duplicate rows (%) | 1.2% |
Total size in memory | 5.0 KiB |
Average record size in memory | 59.5 B |
Variable types
Categorical | 1 |
---|---|
Text | 3 |
Numeric | 2 |
DateTime | 1 |
Dataset
Description | 대형마트 소비자물가정보 84건에 대하여 경상북도 및 포항시의 품목별 최저가와 평균가를 산출내어 민원인에게 제공하고자 합니다. |
---|---|
URL | https://www.data.go.kr/data/15048475/fileData.do |
데이터기준일자 has constant value "" | Constant |
Dataset has 1 (1.2%) duplicate rows | Duplicates |
경상북도평균가 is highly overall correlated with 포항시평균가 | High correlation |
포항시평균가 is highly overall correlated with 경상북도평균가 | High correlation |
품명 has 2 (2.3%) missing values | Missing |
경상북도평균가 has 2 (2.3%) missing values | Missing |
경상북도최저가 has 2 (2.3%) missing values | Missing |
포항시평균가 has 2 (2.3%) missing values | Missing |
포항시최저가 has 2 (2.3%) missing values | Missing |
데이터기준일자 has 2 (2.3%) missing values | Missing |
Reproduction
Analysis started | 2023-12-12 10:07:28.319265 |
---|---|
Analysis finished | 2023-12-12 10:07:29.987634 |
Duration | 1.67 second |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
구분
Categorical
Distinct | 15 |
---|---|
Distinct (%) | 17.4% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 820.0 B |
채소류 | |
---|---|
어류 | |
유지·조미료 | |
곡류 | |
기타잡비 | |
Other values (10) |
Length
Max length | 6 |
---|---|
Median length | 4 |
Mean length | 3.244186 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 곡류 |
---|---|
2nd row | 곡류 |
3rd row | 곡류 |
4th row | 곡류 |
5th row | 곡류 |
Common Values
Value | Count | Frequency (%) |
채소류 | 15 | |
어류 | 10 | |
유지·조미료 | 9 | |
곡류 | 8 | |
기타잡비 | 8 | |
과실류 | 7 | |
빵및과자 | 6 | 7.0% |
육류 | 4 | 4.7% |
외식 | 4 | 4.7% |
낙농품 | 3 | 3.5% |
Other values (5) | 12 |
Length
Value | Count | Frequency (%) |
채소류 | 15 | |
어류 | 10 | |
유지·조미료 | 9 | |
곡류 | 8 | |
기타잡비 | 8 | |
과실류 | 7 | |
빵및과자 | 6 | 7.0% |
육류 | 4 | 4.7% |
외식 | 4 | 4.7% |
낙농품 | 3 | 3.5% |
Other values (5) | 12 |
품명
Text
MISSING
 
Distinct | 84 |
---|---|
Distinct (%) | 100.0% |
Missing | 2 |
Missing (%) | 2.3% |
Memory size | 820.0 B |
Value | Count | Frequency (%) |
미역 | 1 | 1.2% |
배 | 1 | 1.2% |
초코파이 | 1 | 1.2% |
스낵과자 | 1 | 1.2% |
식빵 | 1 | 1.2% |
빵 | 1 | 1.2% |
떡 | 1 | 1.2% |
된장 | 1 | 1.2% |
간장 | 1 | 1.2% |
고추장 | 1 | 1.2% |
Other values (76) | 76 |
Most occurring characters
Value | Count | Frequency (%) |
고 | 8 | 3.7% |
기 | 6 | 2.8% |
장 | 6 | 2.8% |
치 | 5 | 2.3% |
마 | 5 | 2.3% |
추 | 4 | 1.9% |
유 | 4 | 1.9% |
용 | 4 | 1.9% |
과 | 4 | 1.9% |
조 | 3 | 1.4% |
Other values (120) | 167 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 212 | |
Space Separator | 2 | 0.9% |
Open Punctuation | 1 | 0.5% |
Close Punctuation | 1 | 0.5% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
고 | 8 | 3.8% |
기 | 6 | 2.8% |
장 | 6 | 2.8% |
치 | 5 | 2.4% |
마 | 5 | 2.4% |
추 | 4 | 1.9% |
유 | 4 | 1.9% |
용 | 4 | 1.9% |
과 | 4 | 1.9% |
조 | 3 | 1.4% |
Other values (117) | 163 |
Space Separator
Value | Count | Frequency (%) |
2 |
Open Punctuation
Value | Count | Frequency (%) |
( | 1 |
Close Punctuation
Value | Count | Frequency (%) |
) | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 212 | |
Common | 4 | 1.9% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
고 | 8 | 3.8% |
기 | 6 | 2.8% |
장 | 6 | 2.8% |
치 | 5 | 2.4% |
마 | 5 | 2.4% |
추 | 4 | 1.9% |
유 | 4 | 1.9% |
용 | 4 | 1.9% |
과 | 4 | 1.9% |
조 | 3 | 1.4% |
Other values (117) | 163 |
Common
Value | Count | Frequency (%) |
2 | ||
( | 1 | |
) | 1 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 212 | |
ASCII | 4 | 1.9% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
고 | 8 | 3.8% |
기 | 6 | 2.8% |
장 | 6 | 2.8% |
치 | 5 | 2.4% |
마 | 5 | 2.4% |
추 | 4 | 1.9% |
유 | 4 | 1.9% |
용 | 4 | 1.9% |
과 | 4 | 1.9% |
조 | 3 | 1.4% |
Other values (117) | 163 |
ASCII
Value | Count | Frequency (%) |
2 | ||
( | 1 | |
) | 1 |
경상북도평균가
Real number (ℝ)
HIGH CORRELATION
  MISSING
 
Distinct | 84 |
---|---|
Distinct (%) | 100.0% |
Missing | 2 |
Missing (%) | 2.3% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 8504.1548 |
Minimum | 255 |
---|---|
Maximum | 54784 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 906.0 B |
Quantile statistics
Minimum | 255 |
---|---|
5-th percentile | 1063.7 |
Q1 | 2644.5 |
median | 5066.5 |
Q3 | 9936 |
95-th percentile | 32074.95 |
Maximum | 54784 |
Range | 54529 |
Interquartile range (IQR) | 7291.5 |
Descriptive statistics
Standard deviation | 10328.04 |
---|---|
Coefficient of variation (CV) | 1.2144699 |
Kurtosis | 9.034031 |
Mean | 8504.1548 |
Median Absolute Deviation (MAD) | 3312.5 |
Skewness | 2.8407367 |
Sum | 714349 |
Variance | 1.0666842 × 108 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1905 | 1 | 1.2% |
4207 | 1 | 1.2% |
2169 | 1 | 1.2% |
3595 | 1 | 1.2% |
1402 | 1 | 1.2% |
12354 | 1 | 1.2% |
3906 | 1 | 1.2% |
5594 | 1 | 1.2% |
9903 | 1 | 1.2% |
2586 | 1 | 1.2% |
Other values (74) | 74 | |
(Missing) | 2 | 2.3% |
Value | Count | Frequency (%) |
255 | 1 | |
608 | 1 | |
801 | 1 | |
911 | 1 | |
1031 | 1 | |
1249 | 1 | |
1355 | 1 | |
1364 | 1 | |
1402 | 1 | |
1463 | 1 |
Value | Count | Frequency (%) |
54784 | 1 | |
53726 | 1 | |
37270 | 1 | |
36746 | 1 | |
33336 | 1 | |
24929 | 1 | |
23044 | 1 | |
17500 | 1 | |
17475 | 1 | |
16146 | 1 |
경상북도최저가
Text
MISSING
 
Distinct | 73 |
---|---|
Distinct (%) | 86.9% |
Missing | 2 |
Missing (%) | 2.3% |
Memory size | 820.0 B |
Value | Count | Frequency (%) |
3,500 | 2 | 2.4% |
700 | 2 | 2.4% |
3,000 | 2 | 2.4% |
990 | 2 | 2.4% |
2,500 | 2 | 2.4% |
4,100 | 2 | 2.4% |
6,000 | 2 | 2.4% |
2,800 | 2 | 2.4% |
2,200 | 2 | 2.4% |
1,120 | 2 | 2.4% |
Other values (63) | 64 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 118 | |
, | 71 | |
9 | 40 | 10.0% |
2 | 31 | 7.7% |
1 | 29 | 7.2% |
5 | 24 | 6.0% |
6 | 22 | 5.5% |
8 | 20 | 5.0% |
4 | 19 | 4.7% |
3 | 17 | 4.2% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 331 | |
Other Punctuation | 71 | 17.7% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 118 | |
9 | 40 | 12.1% |
2 | 31 | 9.4% |
1 | 29 | 8.8% |
5 | 24 | 7.3% |
6 | 22 | 6.6% |
8 | 20 | 6.0% |
4 | 19 | 5.7% |
3 | 17 | 5.1% |
7 | 11 | 3.3% |
Other Punctuation
Value | Count | Frequency (%) |
, | 71 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 402 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
0 | 118 | |
, | 71 | |
9 | 40 | 10.0% |
2 | 31 | 7.7% |
1 | 29 | 7.2% |
5 | 24 | 6.0% |
6 | 22 | 5.5% |
8 | 20 | 5.0% |
4 | 19 | 4.7% |
3 | 17 | 4.2% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 402 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 118 | |
, | 71 | |
9 | 40 | 10.0% |
2 | 31 | 7.7% |
1 | 29 | 7.2% |
5 | 24 | 6.0% |
6 | 22 | 5.5% |
8 | 20 | 5.0% |
4 | 19 | 4.7% |
3 | 17 | 4.2% |
포항시평균가
Real number (ℝ)
HIGH CORRELATION
  MISSING
 
Distinct | 83 |
---|---|
Distinct (%) | 98.8% |
Missing | 2 |
Missing (%) | 2.3% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 8478.3214 |
Minimum | 149 |
---|---|
Maximum | 60226 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 906.0 B |
Quantile statistics
Minimum | 149 |
---|---|
5-th percentile | 900.65 |
Q1 | 2902.75 |
median | 5017.5 |
Q3 | 9474 |
95-th percentile | 31246.65 |
Maximum | 60226 |
Range | 60077 |
Interquartile range (IQR) | 6571.25 |
Descriptive statistics
Standard deviation | 11025.663 |
---|---|
Coefficient of variation (CV) | 1.3004536 |
Kurtosis | 11.456407 |
Mean | 8478.3214 |
Median Absolute Deviation (MAD) | 3077.5 |
Skewness | 3.202619 |
Sum | 712179 |
Variance | 1.2156525 × 108 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1284 | 2 | 2.3% |
4320 | 1 | 1.2% |
3380 | 1 | 1.2% |
1295 | 1 | 1.2% |
18380 | 1 | 1.2% |
4140 | 1 | 1.2% |
5040 | 1 | 1.2% |
10141 | 1 | 1.2% |
2009 | 1 | 1.2% |
1089 | 1 | 1.2% |
Other values (73) | 73 | |
(Missing) | 2 | 2.3% |
Value | Count | Frequency (%) |
149 | 1 | |
617 | 1 | |
721 | 1 | |
794 | 1 | |
890 | 1 | |
961 | 1 | |
1089 | 1 | |
1284 | 2 | |
1295 | 1 | |
1411 | 1 |
Value | Count | Frequency (%) |
60226 | 1 | |
59386 | 1 | |
45040 | 1 | |
34577 | 1 | |
33060 | 1 | |
20971 | 1 | |
18380 | 1 | |
18000 | 1 | |
17971 | 1 | |
14877 | 1 |
포항시최저가
Text
MISSING
 
Distinct | 79 |
---|---|
Distinct (%) | 94.0% |
Missing | 2 |
Missing (%) | 2.3% |
Memory size | 820.0 B |
Value | Count | Frequency (%) |
500 | 2 | 2.4% |
1980 | 2 | 2.4% |
3980 | 2 | 2.4% |
2990 | 2 | 2.4% |
4950 | 2 | 2.4% |
4320 | 1 | 1.2% |
836 | 1 | 1.2% |
2480 | 1 | 1.2% |
15,000 | 1 | 1.2% |
2630 | 1 | 1.2% |
Other values (69) | 69 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 108 | |
9 | 36 | 10.4% |
5 | 33 | 9.5% |
8 | 32 | 9.2% |
2 | 32 | 9.2% |
1 | 27 | 7.8% |
3 | 21 | 6.1% |
4 | 19 | 5.5% |
6 | 17 | 4.9% |
, | 12 | 3.5% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 334 | |
Other Punctuation | 12 | 3.5% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 108 | |
9 | 36 | 10.8% |
5 | 33 | 9.9% |
8 | 32 | 9.6% |
2 | 32 | 9.6% |
1 | 27 | 8.1% |
3 | 21 | 6.3% |
4 | 19 | 5.7% |
6 | 17 | 5.1% |
7 | 9 | 2.7% |
Other Punctuation
Value | Count | Frequency (%) |
, | 12 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 346 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
0 | 108 | |
9 | 36 | 10.4% |
5 | 33 | 9.5% |
8 | 32 | 9.2% |
2 | 32 | 9.2% |
1 | 27 | 7.8% |
3 | 21 | 6.1% |
4 | 19 | 5.5% |
6 | 17 | 4.9% |
, | 12 | 3.5% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 346 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 108 | |
9 | 36 | 10.4% |
5 | 33 | 9.5% |
8 | 32 | 9.2% |
2 | 32 | 9.2% |
1 | 27 | 7.8% |
3 | 21 | 6.1% |
4 | 19 | 5.5% |
6 | 17 | 4.9% |
, | 12 | 3.5% |
데이터기준일자
Date
CONSTANT
  MISSING
 
Distinct | 1 |
---|---|
Distinct (%) | 1.2% |
Missing | 2 |
Missing (%) | 2.3% |
Memory size | 820.0 B |
Minimum | 2023-03-05 00:00:00 |
---|---|
Maximum | 2023-03-05 00:00:00 |
구분 | 품명 | 경상북도평균가 | 경상북도최저가 | 포항시평균가 | 포항시최저가 | |
---|---|---|---|---|---|---|
구분 | 1.000 | 1.000 | 0.000 | 0.841 | 0.107 | 0.983 |
품명 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 |
경상북도평균가 | 0.000 | 1.000 | 1.000 | 0.996 | 0.978 | 0.996 |
경상북도최저가 | 0.841 | 1.000 | 0.996 | 1.000 | 0.965 | 0.947 |
포항시평균가 | 0.107 | 1.000 | 0.978 | 0.965 | 1.000 | 0.991 |
포항시최저가 | 0.983 | 1.000 | 0.996 | 0.947 | 0.991 | 1.000 |
경상북도평균가 | 포항시평균가 | 구분 | |
---|---|---|---|
경상북도평균가 | 1.000 | 0.980 | 0.000 |
포항시평균가 | 0.980 | 1.000 | 0.050 |
구분 | 0.000 | 0.050 | 1.000 |
구분 | 품명 | 경상북도평균가 | 경상북도최저가 | 포항시평균가 | 포항시최저가 | 데이터기준일자 | |
---|---|---|---|---|---|---|---|
0 | 곡류 | 쌀 | 53726 | 47,900 | 59386 | 49800 | 2023-03-05 |
1 | 곡류 | 보리쌀 | 3786 | 1,950 | 3880 | 2990 | 2023-03-05 |
2 | 곡류 | 찹쌀 | 4730 | 2,630 | 4665 | 2625 | 2023-03-05 |
3 | 곡류 | 콩 | 12100 | 9,188 | 12376 | 9960 | 2023-03-05 |
4 | 곡류 | 밀가루 | 5920 | 4,990 | 5630 | 5190 | 2023-03-05 |
5 | 곡류 | 두부 | 3463 | 1,990 | 3783 | 1800 | 2023-03-05 |
6 | 곡류 | 라면 | 801 | 660 | 794 | 740 | 2023-03-05 |
7 | 곡류 | 국수 | 3458 | 2,590 | 3697 | 3550 | 2023-03-05 |
8 | 육류 | 쇠고기 (국산) | 54784 | 34,275 | 60226 | 36500 | 2023-03-05 |
9 | 육류 | 돼지고기 | 12890 | 9,800 | 12936 | 8450 | 2023-03-05 |
구분 | 품명 | 경상북도평균가 | 경상북도최저가 | 포항시평균가 | 포항시최저가 | 데이터기준일자 | |
---|---|---|---|---|---|---|---|
76 | 기타잡비 | 치약 | 3120 | 1,120 | 4070 | 1940 | 2023-03-05 |
77 | 기타잡비 | 로션 | 6773 | 4,100 | 5521 | 2218 | 2023-03-05 |
78 | 기타잡비 | 샴푸 | 7967 | 5,300 | 7079 | 2605 | 2023-03-05 |
79 | 기타잡비 | 손세정제 | 4906 | 2,800 | 4575 | 3250 | 2023-03-05 |
80 | 기타잡비 | 보건용 마스크 | 1031 | 700 | 890 | 890 | 2023-03-05 |
81 | 차와음료 | 커피 | 9502 | 1,490 | 7139 | 520 | 2023-03-05 |
82 | 차와음료 | 콜라 | 2796 | 2,290 | 3113 | 2290 | 2023-03-05 |
83 | 차와음료 | 과일주스 | 2564 | 1,290 | 3570 | 3280 | 2023-03-05 |
84 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
85 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
Most frequently occurring
구분 | 품명 | 경상북도평균가 | 경상북도최저가 | 포항시평균가 | 포항시최저가 | 데이터기준일자 | # duplicates | |
---|---|---|---|---|---|---|---|---|
0 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | 2 |