Dataset statistics
Number of variables | 8 |
---|---|
Number of observations | 10000 |
Missing cells | 1989 |
Missing cells (%) | 2.5% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 732.4 KiB |
Average record size in memory | 75.0 B |
Variable types
Text | 2 |
---|---|
DateTime | 1 |
Categorical | 3 |
Numeric | 2 |
Dataset
Description | 소비자 민원 신청인에 대한 정보를 성별, 연령대, 지역, 기타 특성정보에 따라 관리하고 이를 보여주는 데이터 입니다. |
---|---|
Author | 공정거래위원회 |
URL | https://www.data.go.kr/data/15098318/fileData.do |
성별(GENDER) is highly overall correlated with 성별코드(GENDER_CODE) | High correlation |
성별코드(GENDER_CODE) is highly overall correlated with 성별(GENDER) | High correlation |
연령대코드(AGE_GROUP_CODE) is highly overall correlated with 연령대명(AGE_GROUP_NAME) | High correlation |
연령대명(AGE_GROUP_NAME) is highly overall correlated with 연령대코드(AGE_GROUP_CODE) | High correlation |
연령대코드(AGE_GROUP_CODE) has 1989 (19.9%) missing values | Missing |
사건번호(ACCIDENT_NO) has unique values | Unique |
Reproduction
Analysis started | 2023-12-12 12:57:16.674341 |
---|---|
Analysis finished | 2023-12-12 12:57:18.071012 |
Duration | 1.4 second |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
UNIQUE
 
Distinct | 10000 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Length
Max length | 12 |
---|---|
Median length | 12 |
Mean length | 12 |
Min length | 12 |
Characters and Unicode
Total characters | 120000 |
---|---|
Distinct characters | 11 |
Distinct categories | 2 ? |
Distinct scripts | 1 ? |
Distinct blocks | 1 ? |
Unique
Unique | 10000 ? |
---|---|
Unique (%) | 100.0% |
Sample
1st row | 2017-0001942 |
---|---|
2nd row | 2016-0944163 |
3rd row | 2017-0213769 |
4th row | 2017-0173807 |
5th row | 2016-0162800 |
Value | Count | Frequency (%) |
2017-0001942 | 1 | < 0.1% |
2017-0136026 | 1 | < 0.1% |
2017-0084414 | 1 | < 0.1% |
2017-0077677 | 1 | < 0.1% |
2016-0230959 | 1 | < 0.1% |
2016-0879880 | 1 | < 0.1% |
2017-0121707 | 1 | < 0.1% |
2017-0142526 | 1 | < 0.1% |
2016-0868853 | 1 | < 0.1% |
2016-0536762 | 1 | < 0.1% |
Other values (9990) | 9990 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 26430 | |
2 | 16015 | |
1 | 15937 | |
6 | 13864 | |
- | 10000 | 8.3% |
7 | 8131 | 6.8% |
3 | 6166 | 5.1% |
4 | 6120 | 5.1% |
5 | 5943 | 5.0% |
8 | 5824 | 4.9% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 110000 | |
Dash Punctuation | 10000 | 8.3% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 26430 | |
2 | 16015 | |
1 | 15937 | |
6 | 13864 | |
7 | 8131 | 7.4% |
3 | 6166 | 5.6% |
4 | 6120 | 5.6% |
5 | 5943 | 5.4% |
8 | 5824 | 5.3% |
9 | 5570 | 5.1% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 10000 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 120000 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
0 | 26430 | |
2 | 16015 | |
1 | 15937 | |
6 | 13864 | |
- | 10000 | 8.3% |
7 | 8131 | 6.8% |
3 | 6166 | 5.1% |
4 | 6120 | 5.1% |
5 | 5943 | 5.0% |
8 | 5824 | 4.9% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 120000 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 26430 | |
2 | 16015 | |
1 | 15937 | |
6 | 13864 | |
- | 10000 | 8.3% |
7 | 8131 | 6.8% |
3 | 6166 | 5.1% |
4 | 6120 | 5.1% |
5 | 5943 | 5.0% |
8 | 5824 | 4.9% |
접수일자(RCPT_YMD)
Date
Distinct | 457 |
---|---|
Distinct (%) | 4.6% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Minimum | 2016-01-01 00:00:00 |
---|---|
Maximum | 2017-11-23 00:00:00 |
성별코드(GENDER_CODE)
Categorical
HIGH CORRELATION
 
Distinct | 3 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
<NA> | |
---|---|
2 | |
1 |
Length
Max length | 4 |
---|---|
Median length | 4 |
Mean length | 2.8672 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 2 |
---|---|
2nd row | 2 |
3rd row | 1 |
4th row | 2 |
5th row | 1 |
Common Values
Value | Count | Frequency (%) |
<NA> | 6224 | |
2 | 2111 | 21.1% |
1 | 1665 | 16.7% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
na | 6224 | |
2 | 2111 | 21.1% |
1 | 1665 | 16.7% |
성별(GENDER)
Categorical
HIGH CORRELATION
 
Distinct | 3 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
<NA> | |
---|---|
여성 | |
남성 |
Length
Max length | 4 |
---|---|
Median length | 4 |
Mean length | 3.2448 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 여성 |
---|---|
2nd row | 여성 |
3rd row | 남성 |
4th row | 여성 |
5th row | 남성 |
Common Values
Value | Count | Frequency (%) |
<NA> | 6224 | |
여성 | 2111 | 21.1% |
남성 | 1665 | 16.7% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
na | 6224 | |
여성 | 2111 | 21.1% |
남성 | 1665 | 16.7% |
연령대코드(AGE_GROUP_CODE)
Real number (ℝ)
HIGH CORRELATION
  MISSING
 
Distinct | 12 |
---|---|
Distinct (%) | 0.1% |
Missing | 1989 |
Missing (%) | 19.9% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 4.8288603 |
Minimum | 1 |
---|---|
Maximum | 12 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 3 |
Q1 | 4 |
median | 5 |
Q3 | 6 |
95-th percentile | 7 |
Maximum | 12 |
Range | 11 |
Interquartile range (IQR) | 2 |
Descriptive statistics
Standard deviation | 1.5353662 |
---|---|
Coefficient of variation (CV) | 0.31795622 |
Kurtosis | 4.0917753 |
Mean | 4.8288603 |
Median Absolute Deviation (MAD) | 1 |
Skewness | 1.568505 |
Sum | 38684 |
Variance | 2.3573493 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
4 | 2689 | |
5 | 2023 | |
6 | 1320 | |
3 | 1163 | |
7 | 372 | 3.7% |
8 | 136 | 1.4% |
9 | 111 | 1.1% |
11 | 90 | 0.9% |
2 | 49 | 0.5% |
12 | 36 | 0.4% |
Other values (2) | 22 | 0.2% |
(Missing) | 1989 |
Value | Count | Frequency (%) |
1 | 3 | < 0.1% |
2 | 49 | 0.5% |
3 | 1163 | |
4 | 2689 | |
5 | 2023 | |
6 | 1320 | |
7 | 372 | 3.7% |
8 | 136 | 1.4% |
9 | 111 | 1.1% |
10 | 19 | 0.2% |
Value | Count | Frequency (%) |
12 | 36 | 0.4% |
11 | 90 | 0.9% |
10 | 19 | 0.2% |
9 | 111 | 1.1% |
8 | 136 | 1.4% |
7 | 372 | 3.7% |
6 | 1320 | |
5 | 2023 | |
4 | 2689 | |
3 | 1163 |
연령대명(AGE_GROUP_NAME)
Categorical
HIGH CORRELATION
 
Distinct | 13 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
30 - 39세 | |
---|---|
40 - 49세 | |
<NA> | |
50 - 59세 | |
20 - 29세 | |
Other values (8) |
Length
Max length | 11 |
---|---|
Median length | 8 |
Mean length | 7.2431 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 40 - 49세 |
---|---|
2nd row | 30 - 39세 |
3rd row | 불명 |
4th row | 30 - 39세 |
5th row | 40 - 49세 |
Common Values
Value | Count | Frequency (%) |
30 - 39세 | 2689 | |
40 - 49세 | 2023 | |
<NA> | 1989 | |
50 - 59세 | 1320 | |
20 - 29세 | 1163 | |
(구)60 - 69세 | 372 | 3.7% |
70 - 79세 | 136 | 1.4% |
불명 | 111 | 1.1% |
60 - 64세 | 90 | 0.9% |
10 - 19세 | 49 | 0.5% |
Other values (3) | 58 | 0.6% |
Length
Value | Count | Frequency (%) |
7878 | ||
30 | 2689 | 10.4% |
39세 | 2689 | 10.4% |
40 | 2023 | 7.9% |
49세 | 2023 | 7.9% |
na | 1989 | 7.7% |
50 | 1320 | 5.1% |
59세 | 1320 | 5.1% |
20 | 1163 | 4.5% |
29세 | 1163 | 4.5% |
Other values (13) | 1502 | 5.8% |
지역코드(AREA_CODE)
Real number (ℝ)
Distinct | 243 |
---|---|
Distinct (%) | 2.4% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 805.7359 |
Minimum | 100 |
---|---|
Maximum | 9907 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 100 |
---|---|
5-th percentile | 100 |
Q1 | 201 |
median | 800 |
Q3 | 1001 |
95-th percentile | 1506 |
Maximum | 9907 |
Range | 9807 |
Interquartile range (IQR) | 800 |
Descriptive statistics
Standard deviation | 1187.2989 |
---|---|
Coefficient of variation (CV) | 1.4735584 |
Kurtosis | 47.000181 |
Mean | 805.7359 |
Median Absolute Deviation (MAD) | 400 |
Skewness | 6.4824096 |
Sum | 8057359 |
Variance | 1409678.6 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
800 | 872 | 8.7% |
100 | 647 | 6.5% |
1205 | 338 | 3.4% |
500 | 213 | 2.1% |
810 | 211 | 2.1% |
101 | 201 | 2.0% |
400 | 181 | 1.8% |
801 | 179 | 1.8% |
809 | 174 | 1.7% |
808 | 162 | 1.6% |
Other values (233) | 6822 |
Value | Count | Frequency (%) |
100 | 647 | |
101 | 201 | 2.0% |
102 | 84 | 0.8% |
103 | 53 | 0.5% |
104 | 89 | 0.9% |
105 | 90 | 0.9% |
106 | 65 | 0.7% |
107 | 74 | 0.7% |
108 | 38 | 0.4% |
109 | 68 | 0.7% |
Value | Count | Frequency (%) |
9907 | 6 | 0.1% |
9906 | 1 | < 0.1% |
9903 | 4 | < 0.1% |
9902 | 1 | < 0.1% |
9901 | 9 | 0.1% |
9900 | 124 | |
1700 | 34 | 0.3% |
1604 | 108 | |
1603 | 24 | 0.2% |
1600 | 41 | 0.4% |
지역명(AREA_NAME)
Text
Distinct | 222 |
---|---|
Distinct (%) | 2.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Value | Count | Frequency (%) |
경기도 | 872 | 8.5% |
서울특별시 | 647 | 6.3% |
전주시 | 338 | 3.3% |
서구 | 222 | 2.2% |
광주광역시 | 213 | 2.1% |
수원시 | 211 | 2.1% |
강남구 | 201 | 2.0% |
인천광역시 | 181 | 1.8% |
고양시 | 179 | 1.7% |
성남시 | 174 | 1.7% |
Other values (213) | 7010 |
Most occurring characters
Value | Count | Frequency (%) |
시 | 5246 | 15.7% |
구 | 3377 | 10.1% |
주 | 1261 | 3.8% |
도 | 1219 | 3.7% |
서 | 1191 | 3.6% |
광 | 1152 | 3.4% |
기 | 1012 | 3.0% |
경 | 962 | 2.9% |
남 | 809 | 2.4% |
산 | 771 | 2.3% |
Other values (135) | 16393 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 33145 | |
Space Separator | 248 | 0.7% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
시 | 5246 | 15.8% |
구 | 3377 | 10.2% |
주 | 1261 | 3.8% |
도 | 1219 | 3.7% |
서 | 1191 | 3.6% |
광 | 1152 | 3.5% |
기 | 1012 | 3.1% |
경 | 962 | 2.9% |
남 | 809 | 2.4% |
산 | 771 | 2.3% |
Other values (134) | 16145 |
Space Separator
Value | Count | Frequency (%) |
248 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 33145 | |
Common | 248 | 0.7% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
시 | 5246 | 15.8% |
구 | 3377 | 10.2% |
주 | 1261 | 3.8% |
도 | 1219 | 3.7% |
서 | 1191 | 3.6% |
광 | 1152 | 3.5% |
기 | 1012 | 3.1% |
경 | 962 | 2.9% |
남 | 809 | 2.4% |
산 | 771 | 2.3% |
Other values (134) | 16145 |
Common
Value | Count | Frequency (%) |
248 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 33145 | |
ASCII | 248 | 0.7% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
시 | 5246 | 15.8% |
구 | 3377 | 10.2% |
주 | 1261 | 3.8% |
도 | 1219 | 3.7% |
서 | 1191 | 3.6% |
광 | 1152 | 3.5% |
기 | 1012 | 3.1% |
경 | 962 | 2.9% |
남 | 809 | 2.4% |
산 | 771 | 2.3% |
Other values (134) | 16145 |
ASCII
Value | Count | Frequency (%) |
248 |
성별코드(GENDER_CODE) | 성별(GENDER) | 연령대코드(AGE_GROUP_CODE) | 연령대명(AGE_GROUP_NAME) | 지역코드(AREA_CODE) | |
---|---|---|---|---|---|
성별코드(GENDER_CODE) | 1.000 | 1.000 | 0.140 | 0.135 | 0.000 |
성별(GENDER) | 1.000 | 1.000 | 0.140 | 0.135 | 0.000 |
연령대코드(AGE_GROUP_CODE) | 0.140 | 0.140 | 1.000 | 1.000 | 0.127 |
연령대명(AGE_GROUP_NAME) | 0.135 | 0.135 | 1.000 | 1.000 | 0.166 |
지역코드(AREA_CODE) | 0.000 | 0.000 | 0.127 | 0.166 | 1.000 |
성별(GENDER) | 성별코드(GENDER_CODE) | 연령대명(AGE_GROUP_NAME) | |
---|---|---|---|
성별(GENDER) | 1.000 | 0.999 | 0.105 |
성별코드(GENDER_CODE) | 0.999 | 1.000 | 0.105 |
연령대명(AGE_GROUP_NAME) | 0.105 | 0.105 | 1.000 |
연령대코드(AGE_GROUP_CODE) | 지역코드(AREA_CODE) | 성별코드(GENDER_CODE) | 성별(GENDER) | 연령대명(AGE_GROUP_NAME) | |
---|---|---|---|---|---|
연령대코드(AGE_GROUP_CODE) | 1.000 | 0.083 | 0.107 | 0.107 | 1.000 |
지역코드(AREA_CODE) | 0.083 | 1.000 | 0.000 | 0.000 | 0.075 |
성별코드(GENDER_CODE) | 0.107 | 0.000 | 1.000 | 0.999 | 0.105 |
성별(GENDER) | 0.107 | 0.000 | 0.999 | 1.000 | 0.105 |
연령대명(AGE_GROUP_NAME) | 1.000 | 0.075 | 0.105 | 0.105 | 1.000 |
사건번호(ACCIDENT_NO) | 접수일자(RCPT_YMD) | 성별코드(GENDER_CODE) | 성별(GENDER) | 연령대코드(AGE_GROUP_CODE) | 연령대명(AGE_GROUP_NAME) | 지역코드(AREA_CODE) | 지역명(AREA_NAME) | |
---|---|---|---|---|---|---|---|---|
69962 | 2017-0001942 | 2017-01-02 | 2 | 여성 | 5 | 40 - 49세 | 1205 | 전주시 |
77015 | 2016-0944163 | 2016-12-26 | 2 | 여성 | 4 | 30 - 39세 | 1604 | 제주시 |
94559 | 2017-0213769 | 2017-03-23 | 1 | 남성 | 9 | 불명 | 101 | 강남구 |
83357 | 2017-0173807 | 2017-03-09 | 2 | 여성 | 4 | 30 - 39세 | 800 | 경기도 |
3761 | 2016-0162800 | 2016-03-09 | 1 | 남성 | 5 | 40 - 49세 | 831 | 화성시 |
3438 | 2016-0259074 | 2016-04-16 | <NA> | <NA> | 3 | 20 - 29세 | 603 | 서구 |
37380 | 2016-0640684 | 2016-09-07 | 2 | 여성 | 4 | 30 - 39세 | 119 | 양천구 |
77324 | 2017-0018755 | 2017-01-09 | <NA> | <NA> | <NA> | <NA> | 821 | 하남시 |
86972 | 2017-0045850 | 2017-01-18 | <NA> | <NA> | 3 | 20 - 29세 | 814 | 오산시 |
46367 | 2016-0760632 | 2016-10-24 | <NA> | <NA> | <NA> | <NA> | 831 | 화성시 |
사건번호(ACCIDENT_NO) | 접수일자(RCPT_YMD) | 성별코드(GENDER_CODE) | 성별(GENDER) | 연령대코드(AGE_GROUP_CODE) | 연령대명(AGE_GROUP_NAME) | 지역코드(AREA_CODE) | 지역명(AREA_NAME) | |
---|---|---|---|---|---|---|---|---|
11912 | 2016-0411491 | 2016-06-17 | 2 | 여성 | <NA> | <NA> | 100 | 서울특별시 |
28085 | 2016-0540571 | 2016-08-02 | <NA> | <NA> | <NA> | <NA> | 800 | 경기도 |
24725 | 2016-0522666 | 2016-07-27 | 1 | 남성 | 8 | 70 - 79세 | 1205 | 전주시 |
12828 | 2016-0379454 | 2016-06-03 | <NA> | <NA> | 3 | 20 - 29세 | 1700 | 세종특별자치시 |
13840 | 2016-0349792 | 2016-05-24 | 2 | 여성 | 4 | 30 - 39세 | 1305 | 여수시 |
9109 | 2016-0307309 | 2016-05-08 | <NA> | <NA> | 5 | 40 - 49세 | 407 | 연수구 |
56535 | 2016-0745536 | 2016-10-18 | <NA> | <NA> | 4 | 30 - 39세 | 1115 | 홍성군 |
50309 | 2016-0794230 | 2016-11-04 | 1 | 남성 | 5 | 40 - 49세 | 906 | 춘천시 |
91707 | 2017-0092768 | 2017-02-07 | 2 | 여성 | 12 | 65 - 69세 | 1205 | 전주시 |
55373 | 2016-0656576 | 2016-09-12 | 2 | 여성 | <NA> | <NA> | 505 | 서구 |