Dataset statistics
Number of variables | 6 |
---|---|
Number of observations | 10000 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 576.2 KiB |
Average record size in memory | 59.0 B |
Variable types
Text | 2 |
---|---|
DateTime | 1 |
Numeric | 3 |
Dataset
Description | 공정거래위원회의 소비자 민원에 대한 학습데이터의 데이터로, 접수기관별 사건내역으로 보여지도록 나타내는 데이터 입니다. 이 데이터는 사건제목, 기관코드, 처리결과코드 등을 포함하고 있습니다. |
---|---|
Author | 공정거래위원회 |
URL | https://www.data.go.kr/data/15098333/fileData.do |
사건번호(ACCIDENT_NO) has unique values | Unique |
Reproduction
Analysis started | 2023-12-12 23:25:13.851198 |
---|---|
Analysis finished | 2023-12-12 23:25:16.410310 |
Duration | 2.56 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
UNIQUE
 
Distinct | 10000 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Length
Max length | 12 |
---|---|
Median length | 12 |
Mean length | 12 |
Min length | 12 |
Characters and Unicode
Total characters | 120000 |
---|---|
Distinct characters | 11 |
Distinct categories | 2 ? |
Distinct scripts | 1 ? |
Distinct blocks | 1 ? |
Unique
Unique | 10000 ? |
---|---|
Unique (%) | 100.0% |
Sample
1st row | 2020-0060988 |
---|---|
2nd row | 2020-0032642 |
3rd row | 2020-0005438 |
4th row | 2020-0008838 |
5th row | 2020-0003166 |
Value | Count | Frequency (%) |
2020-0060988 | 1 | < 0.1% |
2020-0093995 | 1 | < 0.1% |
2020-0048906 | 1 | < 0.1% |
2020-0059831 | 1 | < 0.1% |
2020-0089053 | 1 | < 0.1% |
2020-0083063 | 1 | < 0.1% |
2020-0043452 | 1 | < 0.1% |
2020-0066773 | 1 | < 0.1% |
2020-0054060 | 1 | < 0.1% |
2020-0065958 | 1 | < 0.1% |
Other values (9990) | 9990 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 44728 | |
2 | 24906 | |
- | 10000 | 8.3% |
1 | 6086 | 5.1% |
7 | 4994 | 4.2% |
3 | 4961 | 4.1% |
6 | 4918 | 4.1% |
9 | 4886 | 4.1% |
4 | 4868 | 4.1% |
5 | 4834 | 4.0% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 110000 | |
Dash Punctuation | 10000 | 8.3% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 44728 | |
2 | 24906 | |
1 | 6086 | 5.5% |
7 | 4994 | 4.5% |
3 | 4961 | 4.5% |
6 | 4918 | 4.5% |
9 | 4886 | 4.4% |
4 | 4868 | 4.4% |
5 | 4834 | 4.4% |
8 | 4819 | 4.4% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 10000 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 120000 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
0 | 44728 | |
2 | 24906 | |
- | 10000 | 8.3% |
1 | 6086 | 5.1% |
7 | 4994 | 4.2% |
3 | 4961 | 4.1% |
6 | 4918 | 4.1% |
9 | 4886 | 4.1% |
4 | 4868 | 4.1% |
5 | 4834 | 4.0% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 120000 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 44728 | |
2 | 24906 | |
- | 10000 | 8.3% |
1 | 6086 | 5.1% |
7 | 4994 | 4.2% |
3 | 4961 | 4.1% |
6 | 4918 | 4.1% |
9 | 4886 | 4.1% |
4 | 4868 | 4.1% |
5 | 4834 | 4.0% |
접수일자(RCPT_YMD)
Date
Distinct | 56 |
---|---|
Distinct (%) | 0.6% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Minimum | 2020-01-01 00:00:00 |
---|---|
Maximum | 2020-02-27 00:00:00 |
기관코드(INSTITUTION_CODE)
Real number (ℝ)
Distinct | 149 |
---|---|
Distinct (%) | 1.5% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 35955.484 |
Minimum | 10000 |
---|---|
Maximum | 41711 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 10000 |
---|---|
5-th percentile | 10000 |
Q1 | 40239 |
median | 40440 |
Q3 | 41004 |
95-th percentile | 41704 |
Maximum | 41711 |
Range | 31711 |
Interquartile range (IQR) | 765 |
Descriptive statistics
Standard deviation | 10636.126 |
---|---|
Coefficient of variation (CV) | 0.29581375 |
Kurtosis | 1.9458606 |
Mean | 35955.484 |
Median Absolute Deviation (MAD) | 339 |
Skewness | -1.94231 |
Sum | 3.5955484 × 108 |
Variance | 1.1312719 × 108 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
10000 | 1383 | 13.8% |
41101 | 435 | 4.3% |
41004 | 320 | 3.2% |
40314 | 309 | 3.1% |
41105 | 277 | 2.8% |
40305 | 257 | 2.6% |
40315 | 192 | 1.9% |
40514 | 188 | 1.9% |
41104 | 174 | 1.7% |
41704 | 169 | 1.7% |
Other values (139) | 6296 |
Value | Count | Frequency (%) |
10000 | 1383 | |
20100 | 3 | < 0.1% |
30100 | 28 | 0.3% |
30200 | 29 | 0.3% |
30300 | 17 | 0.2% |
30400 | 21 | 0.2% |
30500 | 15 | 0.1% |
30600 | 16 | 0.2% |
30700 | 25 | 0.2% |
30800 | 167 | 1.7% |
Value | Count | Frequency (%) |
41711 | 148 | |
41709 | 120 | |
41707 | 24 | 0.2% |
41706 | 90 | |
41705 | 60 | 0.6% |
41704 | 169 | |
41703 | 125 | |
41702 | 102 | |
41700 | 7 | 0.1% |
41109 | 18 | 0.2% |
Distinct | 9855 |
---|---|
Distinct (%) | 98.6% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Length
Max length | 96 |
---|---|
Median length | 62 |
Mean length | 21.6733 |
Min length | 1 |
Characters and Unicode
Total characters | 216733 |
---|---|
Distinct characters | 1082 |
Distinct categories | 12 ? |
Distinct scripts | 3 ? |
Distinct blocks | 4 ? |
Unique
Unique | 9752 ? |
---|---|
Unique (%) | 97.5% |
Sample
1st row | 우한폐렴 확산으로 인한 나트랑 여행 환불불가의 부당함에 이의제기 |
---|---|
2nd row | 배송 완료된 제품에 대해 가격 오기를 이유로 사업자가 차액 청구함 |
3rd row | 체크페이 앱에서 온누리 모바일상품권 등록할때 불편해요 |
4th row | [포털][카카오] 전자상거래 피해구제신청 - 한국소비자원/[기타 정보] |
5th row | 기계구입후 고장으로인한 교환문의 |
Value | Count | Frequency (%) |
문의 | 2945 | 5.4% |
환불 | 972 | 1.8% |
후 | 719 | 1.3% |
취소 | 689 | 1.3% |
인한 | 627 | 1.1% |
관련 | 581 | 1.1% |
위약금 | 573 | 1.0% |
요청 | 561 | 1.0% |
건 | 547 | 1.0% |
요구 | 397 | 0.7% |
Other values (14931) | 46193 |
Most occurring characters
Value | Count | Frequency (%) |
47978 | 22.1% | |
의 | 5128 | 2.4% |
문 | 4444 | 2.1% |
지 | 3064 | 1.4% |
불 | 3009 | 1.4% |
로 | 2891 | 1.3% |
이 | 2775 | 1.3% |
한 | 2772 | 1.3% |
구 | 2743 | 1.3% |
해 | 2563 | 1.2% |
Other values (1072) | 139366 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 163804 | |
Space Separator | 47978 | 22.1% |
Decimal Number | 1725 | 0.8% |
Uppercase Letter | 972 | 0.4% |
Other Punctuation | 909 | 0.4% |
Close Punctuation | 428 | 0.2% |
Open Punctuation | 385 | 0.2% |
Lowercase Letter | 365 | 0.2% |
Dash Punctuation | 99 | < 0.1% |
Math Symbol | 65 | < 0.1% |
Other values (2) | 3 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
의 | 5128 | 3.1% |
문 | 4444 | 2.7% |
지 | 3064 | 1.9% |
불 | 3009 | 1.8% |
로 | 2891 | 1.8% |
이 | 2775 | 1.7% |
한 | 2772 | 1.7% |
구 | 2743 | 1.7% |
해 | 2563 | 1.6% |
환 | 2543 | 1.6% |
Other values (992) | 131872 |
Uppercase Letter
Value | Count | Frequency (%) |
T | 169 | |
S | 157 | |
A | 130 | |
V | 80 | |
G | 67 | 6.9% |
L | 65 | 6.7% |
K | 62 | 6.4% |
C | 47 | 4.8% |
P | 46 | 4.7% |
X | 23 | 2.4% |
Other values (14) | 126 |
Lowercase Letter
Value | Count | Frequency (%) |
s | 89 | |
a | 77 | |
t | 50 | |
v | 33 | 9.0% |
k | 20 | 5.5% |
p | 17 | 4.7% |
l | 12 | 3.3% |
g | 10 | 2.7% |
c | 10 | 2.7% |
e | 8 | 2.2% |
Other values (12) | 39 |
Other Punctuation
Value | Count | Frequency (%) |
. | 631 | |
/ | 196 | 21.6% |
% | 24 | 2.6% |
* | 23 | 2.5% |
' | 12 | 1.3% |
! | 7 | 0.8% |
; | 6 | 0.7% |
: | 5 | 0.6% |
‰ | 2 | 0.2% |
& | 2 | 0.2% |
Decimal Number
Value | Count | Frequency (%) |
1 | 498 | |
0 | 283 | |
2 | 254 | |
3 | 238 | |
5 | 159 | 9.2% |
4 | 87 | 5.0% |
9 | 69 | 4.0% |
7 | 48 | 2.8% |
8 | 45 | 2.6% |
6 | 44 | 2.6% |
Math Symbol
Value | Count | Frequency (%) |
> | 19 | |
< | 18 | |
+ | 12 | |
= | 8 | |
~ | 8 |
Close Punctuation
Value | Count | Frequency (%) |
) | 383 | |
] | 45 | 10.5% |
Open Punctuation
Value | Count | Frequency (%) |
( | 341 | |
[ | 44 | 11.4% |
Space Separator
Value | Count | Frequency (%) |
47978 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 99 |
Connector Punctuation
Value | Count | Frequency (%) |
_ | 2 |
Control
Value | Count | Frequency (%) |
1 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 163804 | |
Common | 51592 | 23.8% |
Latin | 1337 | 0.6% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
의 | 5128 | 3.1% |
문 | 4444 | 2.7% |
지 | 3064 | 1.9% |
불 | 3009 | 1.8% |
로 | 2891 | 1.8% |
이 | 2775 | 1.7% |
한 | 2772 | 1.7% |
구 | 2743 | 1.7% |
해 | 2563 | 1.6% |
환 | 2543 | 1.6% |
Other values (992) | 131872 |
Latin
Value | Count | Frequency (%) |
T | 169 | |
S | 157 | |
A | 130 | 9.7% |
s | 89 | 6.7% |
V | 80 | 6.0% |
a | 77 | 5.8% |
G | 67 | 5.0% |
L | 65 | 4.9% |
K | 62 | 4.6% |
t | 50 | 3.7% |
Other values (36) | 391 |
Common
Value | Count | Frequency (%) |
47978 | ||
. | 631 | 1.2% |
1 | 498 | 1.0% |
) | 383 | 0.7% |
( | 341 | 0.7% |
0 | 283 | 0.5% |
2 | 254 | 0.5% |
3 | 238 | 0.5% |
/ | 196 | 0.4% |
5 | 159 | 0.3% |
Other values (24) | 631 | 1.2% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 163794 | |
ASCII | 52927 | 24.4% |
Compat Jamo | 10 | < 0.1% |
Punctuation | 2 | < 0.1% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
47978 | ||
. | 631 | 1.2% |
1 | 498 | 0.9% |
) | 383 | 0.7% |
( | 341 | 0.6% |
0 | 283 | 0.5% |
2 | 254 | 0.5% |
3 | 238 | 0.4% |
/ | 196 | 0.4% |
T | 169 | 0.3% |
Other values (69) | 1956 | 3.7% |
Hangul
Value | Count | Frequency (%) |
의 | 5128 | 3.1% |
문 | 4444 | 2.7% |
지 | 3064 | 1.9% |
불 | 3009 | 1.8% |
로 | 2891 | 1.8% |
이 | 2775 | 1.7% |
한 | 2772 | 1.7% |
구 | 2743 | 1.7% |
해 | 2563 | 1.6% |
환 | 2543 | 1.6% |
Other values (983) | 131862 |
Punctuation
Value | Count | Frequency (%) |
‰ | 2 |
Compat Jamo
Value | Count | Frequency (%) |
ㄱ | 2 | |
ㄹ | 1 | |
ㅁ | 1 | |
ㅔ | 1 | |
ㄴ | 1 | |
ㅀ | 1 | |
ㅎ | 1 | |
ㅏ | 1 | |
ㅂ | 1 |
상담이유코드(DSCSN_REASON_CODE)
Real number (ℝ)
Distinct | 16 |
---|---|
Distinct (%) | 0.2% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 609.4348 |
Minimum | 601 |
---|---|
Maximum | 616 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 601 |
---|---|
5-th percentile | 603 |
Q1 | 607 |
median | 609 |
Q3 | 611 |
95-th percentile | 616 |
Maximum | 616 |
Range | 15 |
Interquartile range (IQR) | 4 |
Descriptive statistics
Standard deviation | 3.4644695 |
---|---|
Coefficient of variation (CV) | 0.0056847255 |
Kurtosis | -0.13168962 |
Mean | 609.4348 |
Median Absolute Deviation (MAD) | 2 |
Skewness | 0.21149008 |
Sum | 6094348 |
Variance | 12.002549 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
611 | 2753 | |
608 | 1785 | |
607 | 1632 | |
616 | 1006 | 10.1% |
606 | 961 | 9.6% |
610 | 575 | 5.8% |
615 | 417 | 4.2% |
603 | 276 | 2.8% |
601 | 137 | 1.4% |
602 | 133 | 1.3% |
Other values (6) | 325 | 3.2% |
Value | Count | Frequency (%) |
601 | 137 | 1.4% |
602 | 133 | 1.3% |
603 | 276 | 2.8% |
604 | 49 | 0.5% |
605 | 7 | 0.1% |
606 | 961 | |
607 | 1632 | |
608 | 1785 | |
609 | 117 | 1.2% |
610 | 575 | 5.8% |
Value | Count | Frequency (%) |
616 | 1006 | 10.1% |
615 | 417 | 4.2% |
614 | 19 | 0.2% |
613 | 117 | 1.2% |
612 | 16 | 0.2% |
611 | 2753 | |
610 | 575 | 5.8% |
609 | 117 | 1.2% |
608 | 1785 | |
607 | 1632 |
처리결과코드(PRCS_RESULT_CODE)
Real number (ℝ)
Distinct | 25 |
---|---|
Distinct (%) | 0.2% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 528.2863 |
Minimum | 401 |
---|---|
Maximum | 612 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 401 |
---|---|
5-th percentile | 501 |
Q1 | 502 |
median | 509 |
Q3 | 527 |
95-th percentile | 610 |
Maximum | 612 |
Range | 211 |
Interquartile range (IQR) | 25 |
Descriptive statistics
Standard deviation | 40.45611 |
---|---|
Coefficient of variation (CV) | 0.076579895 |
Kurtosis | -0.036253472 |
Mean | 528.2863 |
Median Absolute Deviation (MAD) | 8 |
Skewness | 1.3224907 |
Sum | 5282863 |
Variance | 1636.6968 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
509 | 2391 | |
501 | 2298 | |
502 | 1103 | |
527 | 1092 | |
610 | 632 | 6.3% |
603 | 498 | 5.0% |
505 | 270 | 2.7% |
504 | 237 | 2.4% |
605 | 233 | 2.3% |
510 | 192 | 1.9% |
Other values (15) | 1054 |
Value | Count | Frequency (%) |
401 | 3 | < 0.1% |
501 | 2298 | |
502 | 1103 | |
504 | 237 | 2.4% |
505 | 270 | 2.7% |
506 | 17 | 0.2% |
507 | 113 | 1.1% |
509 | 2391 | |
510 | 192 | 1.9% |
511 | 123 | 1.2% |
Value | Count | Frequency (%) |
612 | 9 | 0.1% |
611 | 1 | < 0.1% |
610 | 632 | |
609 | 72 | 0.7% |
608 | 79 | 0.8% |
607 | 111 | 1.1% |
606 | 51 | 0.5% |
605 | 233 | 2.3% |
604 | 184 | 1.8% |
603 | 498 |
접수일자(RCPT_YMD) | 기관코드(INSTITUTION_CODE) | 상담이유코드(DSCSN_REASON_CODE) | 처리결과코드(PRCS_RESULT_CODE) | |
---|---|---|---|---|
접수일자(RCPT_YMD) | 1.000 | 0.667 | 0.159 | 0.095 |
기관코드(INSTITUTION_CODE) | 0.667 | 1.000 | 0.010 | 0.126 |
상담이유코드(DSCSN_REASON_CODE) | 0.159 | 0.010 | 1.000 | 0.316 |
처리결과코드(PRCS_RESULT_CODE) | 0.095 | 0.126 | 0.316 | 1.000 |
기관코드(INSTITUTION_CODE) | 상담이유코드(DSCSN_REASON_CODE) | 처리결과코드(PRCS_RESULT_CODE) | |
---|---|---|---|
기관코드(INSTITUTION_CODE) | 1.000 | -0.012 | 0.024 |
상담이유코드(DSCSN_REASON_CODE) | -0.012 | 1.000 | -0.091 |
처리결과코드(PRCS_RESULT_CODE) | 0.024 | -0.091 | 1.000 |
사건번호(ACCIDENT_NO) | 접수일자(RCPT_YMD) | 기관코드(INSTITUTION_CODE) | 사건제목(ACCIDENT_TITLE) | 상담이유코드(DSCSN_REASON_CODE) | 처리결과코드(PRCS_RESULT_CODE) | |
---|---|---|---|---|---|---|
51890 | 2020-0060988 | 2020-01-31 | 40515 | 우한폐렴 확산으로 인한 나트랑 여행 환불불가의 부당함에 이의제기 | 611 | 527 |
27566 | 2020-0032642 | 2020-01-17 | 30200 | 배송 완료된 제품에 대해 가격 오기를 이유로 사업자가 차액 청구함 | 603 | 502 |
4590 | 2020-0005438 | 2020-01-03 | 10000 | 체크페이 앱에서 온누리 모바일상품권 등록할때 불편해요 | 602 | 502 |
7854 | 2020-0008838 | 2020-01-06 | 10000 | [포털][카카오] 전자상거래 피해구제신청 - 한국소비자원/[기타 정보] | 615 | 509 |
2879 | 2020-0003166 | 2020-01-03 | 41702 | 기계구입후 고장으로인한 교환문의 | 606 | 501 |
35448 | 2020-0041921 | 2020-01-22 | 41703 | 음식점에서 옷이 타버림 배상요구함 | 608 | 502 |
8427 | 2020-0009448 | 2020-01-07 | 40801 | 1년전 보이스피싱 통장정지 개설 문의 | 616 | 505 |
816 | 2020-0000707 | 2020-01-02 | 40438 | 세탁하자로 신체상의 피해발생하여 배상요청하는건 | 608 | 527 |
88336 | 2020-0102189 | 2020-02-18 | 40508 | 코웨이안마의자 동일고장 3회 발생 문의 | 608 | 601 |
56377 | 2020-0065764 | 2020-02-03 | 41101 | 부동산 중개업자님 불공정거래 신고하고자 문의 10 | 607 | 510 |
사건번호(ACCIDENT_NO) | 접수일자(RCPT_YMD) | 기관코드(INSTITUTION_CODE) | 사건제목(ACCIDENT_TITLE) | 상담이유코드(DSCSN_REASON_CODE) | 처리결과코드(PRCS_RESULT_CODE) | |
---|---|---|---|---|---|---|
23299 | 2020-0027520 | 2020-01-15 | 10000 | (중재원각하)무릎인공관절 수술 후 재시술받은데 따른 문의 | 616 | 509 |
35406 | 2020-0042032 | 2020-01-22 | 40227 | 란탈정수기 약정기간 도래후 해지시 발생한 설치비 지불 요청건. | 611 | 509 |
55942 | 2020-0065473 | 2020-02-03 | 40818 | 10개월전에 방판으로 구매한 전집류 반품문의 | 616 | 509 |
52233 | 2020-0060781 | 2020-01-31 | 10000 | 신종코로나바이러스 발생 여행출발전 신의칙상의 주의의무에 관한 손해배상의 건 | 607 | 502 |
87117 | 2020-0100889 | 2020-02-17 | 40628 | 신발세탁 후 하자 건 | 615 | 501 |
59931 | 2020-0058408 | 2020-01-31 | 30800 | 주택수리 | 616 | 507 |
56701 | 2020-0066166 | 2020-02-03 | 10000 | 중고 오토바이 구입후 하루만에 시동이 켜지지 않아 환불하려 합니다. | 608 | 509 |
90000 | 2020-0104949 | 2020-02-19 | 40646 | 업체 사기로 인한 피해구제 신청을 하고자 함 | 607 | 509 |
43958 | 2020-0052860 | 2020-01-29 | 40305 | 전자담배 파손 보험 계약해지 요청건 | 603 | 502 |
23610 | 2020-0028163 | 2020-01-15 | 40807 | 키즈카페에서 어린이 골절 치료비 청구 건 | 616 | 509 |