Dataset statistics
Number of variables | 6 |
---|---|
Number of observations | 627 |
Missing cells | 43 |
Missing cells (%) | 1.1% |
Duplicate rows | 3 |
Duplicate rows (%) | 0.5% |
Total size in memory | 30.7 KiB |
Average record size in memory | 50.2 B |
Variable types
Numeric | 1 |
---|---|
Text | 2 |
Categorical | 3 |
Dataset
Description | 대전광역시 시설관리공단에서 운영중인 대전역 앞 지하도 상가(동구 중앙로 지하 200)의 상가에 대한 정보이력(일렬번호, 상가이름, 구분, 상세구분, 전화번호, 등록일자) 제공 |
---|---|
Author | 대전광역시시설관리공단 |
URL | https://www.data.go.kr/data/15123937/fileData.do |
Dataset has 3 (0.5%) duplicate rows | Duplicates |
일렬번호 is highly overall correlated with 상세구분 | High correlation |
구분 is highly overall correlated with 상세구분 | High correlation |
상세구분 is highly overall correlated with 일렬번호 and 1 other fields | High correlation |
구분 is highly imbalanced (52.7%) | Imbalance |
전화번호 has 43 (6.9%) missing values | Missing |
Reproduction
Analysis started | 2023-12-11 23:18:02.023551 |
---|---|
Analysis finished | 2023-12-11 23:18:02.475046 |
Duration | 0.45 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
일렬번호
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 216 |
---|---|
Distinct (%) | 34.4% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 2814.3238 |
Minimum | 2706 |
---|---|
Maximum | 2923 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 5.6 KiB |
Quantile statistics
Minimum | 2706 |
---|---|
5-th percentile | 2716 |
Q1 | 2756 |
median | 2815 |
Q3 | 2870 |
95-th percentile | 2913.7 |
Maximum | 2923 |
Range | 217 |
Interquartile range (IQR) | 114 |
Descriptive statistics
Standard deviation | 64.285645 |
---|---|
Coefficient of variation (CV) | 0.022842306 |
Kurtosis | -1.2500318 |
Mean | 2814.3238 |
Median Absolute Deviation (MAD) | 57 |
Skewness | -0.0012274107 |
Sum | 1764581 |
Variance | 4132.6442 |
Monotonicity | Increasing |
Value | Count | Frequency (%) |
2741 | 7 | 1.1% |
2742 | 6 | 1.0% |
2706 | 5 | 0.8% |
2855 | 5 | 0.8% |
2919 | 5 | 0.8% |
2876 | 5 | 0.8% |
2904 | 4 | 0.6% |
2805 | 4 | 0.6% |
2782 | 4 | 0.6% |
2857 | 4 | 0.6% |
Other values (206) | 578 |
Value | Count | Frequency (%) |
2706 | 5 | |
2707 | 3 | |
2708 | 3 | |
2709 | 3 | |
2710 | 3 | |
2711 | 3 | |
2712 | 3 | |
2713 | 2 | 0.3% |
2714 | 3 | |
2715 | 2 | 0.3% |
Value | Count | Frequency (%) |
2923 | 3 | |
2922 | 3 | |
2921 | 3 | |
2920 | 2 | 0.3% |
2919 | 5 | |
2918 | 3 | |
2917 | 4 | |
2916 | 4 | |
2915 | 2 | 0.3% |
2914 | 3 |
상가이름
Text
Distinct | 149 |
---|---|
Distinct (%) | 23.8% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 5.0 KiB |
Value | Count | Frequency (%) |
큰별통신 | 17 | 2.7% |
천보당안경콘택트 | 11 | 1.7% |
연예인 | 11 | 1.7% |
여성크로커 | 11 | 1.7% |
올포유 | 10 | 1.6% |
청담동 | 10 | 1.6% |
밤블비 | 10 | 1.6% |
자방모드 | 10 | 1.6% |
현아통신 | 9 | 1.4% |
흙비 | 9 | 1.4% |
Other values (142) | 531 |
Most occurring characters
Value | Count | Frequency (%) |
통 | 66 | 2.6% |
신 | 66 | 2.6% |
보 | 58 | 2.3% |
자 | 46 | 1.8% |
지 | 43 | 1.7% |
모 | 41 | 1.6% |
리 | 37 | 1.5% |
우 | 34 | 1.4% |
드 | 34 | 1.4% |
아 | 33 | 1.3% |
Other values (222) | 2052 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 2437 | |
Uppercase Letter | 18 | 0.7% |
Space Separator | 12 | 0.5% |
Close Punctuation | 12 | 0.5% |
Open Punctuation | 12 | 0.5% |
Decimal Number | 12 | 0.5% |
Other Punctuation | 3 | 0.1% |
Other Symbol | 2 | 0.1% |
Math Symbol | 1 | < 0.1% |
Letter Number | 1 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
통 | 66 | 2.7% |
신 | 66 | 2.7% |
보 | 58 | 2.4% |
자 | 46 | 1.9% |
지 | 43 | 1.8% |
모 | 41 | 1.7% |
리 | 37 | 1.5% |
우 | 34 | 1.4% |
드 | 34 | 1.4% |
아 | 33 | 1.4% |
Other values (208) | 1979 |
Uppercase Letter
Value | Count | Frequency (%) |
N | 4 | |
E | 4 | |
W | 4 | |
G | 3 | |
M | 3 |
Decimal Number
Value | Count | Frequency (%) |
0 | 6 | |
2 | 6 |
Space Separator
Value | Count | Frequency (%) |
12 |
Close Punctuation
Value | Count | Frequency (%) |
) | 12 |
Open Punctuation
Value | Count | Frequency (%) |
( | 12 |
Other Punctuation
Value | Count | Frequency (%) |
. | 3 |
Other Symbol
Value | Count | Frequency (%) |
㈜ | 2 |
Math Symbol
Value | Count | Frequency (%) |
~ | 1 |
Letter Number
Value | Count | Frequency (%) |
Ⅱ | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 2439 | |
Common | 52 | 2.1% |
Latin | 19 | 0.8% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
통 | 66 | 2.7% |
신 | 66 | 2.7% |
보 | 58 | 2.4% |
자 | 46 | 1.9% |
지 | 43 | 1.8% |
모 | 41 | 1.7% |
리 | 37 | 1.5% |
우 | 34 | 1.4% |
드 | 34 | 1.4% |
아 | 33 | 1.4% |
Other values (209) | 1981 |
Common
Value | Count | Frequency (%) |
12 | ||
) | 12 | |
( | 12 | |
0 | 6 | |
2 | 6 | |
. | 3 | 5.8% |
~ | 1 | 1.9% |
Latin
Value | Count | Frequency (%) |
N | 4 | |
E | 4 | |
W | 4 | |
G | 3 | |
M | 3 | |
Ⅱ | 1 | 5.3% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 2437 | |
ASCII | 70 | 2.8% |
None | 2 | 0.1% |
Number Forms | 1 | < 0.1% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
통 | 66 | 2.7% |
신 | 66 | 2.7% |
보 | 58 | 2.4% |
자 | 46 | 1.9% |
지 | 43 | 1.8% |
모 | 41 | 1.7% |
리 | 37 | 1.5% |
우 | 34 | 1.4% |
드 | 34 | 1.4% |
아 | 33 | 1.4% |
Other values (208) | 1979 |
ASCII
Value | Count | Frequency (%) |
12 | ||
) | 12 | |
( | 12 | |
0 | 6 | |
2 | 6 | |
N | 4 | 5.7% |
E | 4 | 5.7% |
W | 4 | 5.7% |
G | 3 | 4.3% |
. | 3 | 4.3% |
Other values (2) | 4 | 5.7% |
None
Value | Count | Frequency (%) |
㈜ | 2 |
Number Forms
Value | Count | Frequency (%) |
Ⅱ | 1 |
구분
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 6 |
---|---|
Distinct (%) | 1.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 5.0 KiB |
3 | |
---|---|
<NA> | |
2 | 15 |
4 | 11 |
5 | 8 |
Length
Max length | 4 |
---|---|
Median length | 1 |
Mean length | 1.784689 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 3 |
---|---|
2nd row | 3 |
3rd row | 3 |
4th row | <NA> |
5th row | 3 |
Common Values
Value | Count | Frequency (%) |
3 | 427 | |
<NA> | 164 | 26.2% |
2 | 15 | 2.4% |
4 | 11 | 1.8% |
5 | 8 | 1.3% |
1 | 2 | 0.3% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
3 | 427 | |
na | 164 | 26.2% |
2 | 15 | 2.4% |
4 | 11 | 1.8% |
5 | 8 | 1.3% |
1 | 2 | 0.3% |
상세구분
Categorical
HIGH CORRELATION
 
Distinct | 42 |
---|---|
Distinct (%) | 6.7% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 5.0 KiB |
20 | |
---|---|
<NA> | |
35 | |
5 | |
29 | 18 |
Other values (37) |
Length
Max length | 7 |
---|---|
Median length | 2 |
Mean length | 2.7400319 |
Min length | 1 |
Unique
Unique | 1 ? |
---|---|
Unique (%) | 0.2% |
Sample
1st row | 20 |
---|---|
2nd row | 20 |
3rd row | 20 |
4th row | <NA> |
5th row | 20 |
Common Values
Value | Count | Frequency (%) |
20 | 205 | |
<NA> | 184 | |
35 | 33 | 5.3% |
5 | 22 | 3.5% |
29 | 18 | 2.9% |
2,20 | 14 | 2.2% |
3 | 14 | 2.2% |
2,14 | 11 | 1.8% |
22 | 8 | 1.3% |
10 | 8 | 1.3% |
Other values (32) | 110 |
Length
Value | Count | Frequency (%) |
20 | 205 | |
na | 184 | |
35 | 33 | 5.3% |
5 | 22 | 3.5% |
29 | 18 | 2.9% |
2,20 | 14 | 2.2% |
3 | 14 | 2.2% |
2,14 | 11 | 1.8% |
2,12,33 | 8 | 1.3% |
22 | 8 | 1.3% |
Other values (32) | 110 |
전화번호
Text
MISSING
 
Distinct | 102 |
---|---|
Distinct (%) | 17.5% |
Missing | 43 |
Missing (%) | 6.9% |
Memory size | 5.0 KiB |
Value | Count | Frequency (%) |
2522149 | 22 | 3.8% |
2233777 | 17 | 2.9% |
2531904 | 16 | 2.7% |
2554952 | 13 | 2.2% |
2579255 | 12 | 2.1% |
2579064 | 11 | 1.9% |
2565454 | 11 | 1.9% |
2269867 | 10 | 1.7% |
2533670 | 10 | 1.7% |
2228525 | 10 | 1.7% |
Other values (92) | 452 |
Most occurring characters
Value | Count | Frequency (%) |
2 | 1143 | |
5 | 657 | |
7 | 352 | 8.5% |
4 | 343 | 8.3% |
3 | 339 | 8.2% |
6 | 335 | 8.1% |
9 | 259 | 6.3% |
0 | 246 | 6.0% |
1 | 236 | 5.7% |
8 | 208 | 5.0% |
Other values (7) | 14 | 0.3% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 4118 | |
Other Letter | 10 | 0.2% |
Open Punctuation | 2 | < 0.1% |
Close Punctuation | 2 | < 0.1% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
2 | 1143 | |
5 | 657 | |
7 | 352 | 8.5% |
4 | 343 | 8.3% |
3 | 339 | 8.2% |
6 | 335 | 8.1% |
9 | 259 | 6.3% |
0 | 246 | 6.0% |
1 | 236 | 5.7% |
8 | 208 | 5.1% |
Other Letter
Value | Count | Frequency (%) |
알 | 2 | |
려 | 2 | |
줘 | 2 | |
도 | 2 | |
됨 | 2 |
Open Punctuation
Value | Count | Frequency (%) |
( | 2 |
Close Punctuation
Value | Count | Frequency (%) |
) | 2 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 4122 | |
Hangul | 10 | 0.2% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
2 | 1143 | |
5 | 657 | |
7 | 352 | 8.5% |
4 | 343 | 8.3% |
3 | 339 | 8.2% |
6 | 335 | 8.1% |
9 | 259 | 6.3% |
0 | 246 | 6.0% |
1 | 236 | 5.7% |
8 | 208 | 5.0% |
Other values (2) | 4 | 0.1% |
Hangul
Value | Count | Frequency (%) |
알 | 2 | |
려 | 2 | |
줘 | 2 | |
도 | 2 | |
됨 | 2 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 4122 | |
Hangul | 10 | 0.2% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
2 | 1143 | |
5 | 657 | |
7 | 352 | 8.5% |
4 | 343 | 8.3% |
3 | 339 | 8.2% |
6 | 335 | 8.1% |
9 | 259 | 6.3% |
0 | 246 | 6.0% |
1 | 236 | 5.7% |
8 | 208 | 5.0% |
Other values (2) | 4 | 0.1% |
Hangul
Value | Count | Frequency (%) |
알 | 2 | |
려 | 2 | |
줘 | 2 | |
도 | 2 | |
됨 | 2 |
등록일자
Categorical
Distinct | 13 |
---|---|
Distinct (%) | 2.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 5.0 KiB |
2021-10-06 | |
---|---|
2019-09-05 | |
2019-08-30 | |
2022-08-02 | 6 |
2020-02-24 | 4 |
Other values (8) | 14 |
Length
Max length | 10 |
---|---|
Median length | 10 |
Mean length | 10 |
Min length | 10 |
Unique
Unique | 2 ? |
---|---|
Unique (%) | 0.3% |
Sample
1st row | 2021-10-06 |
---|---|
2nd row | 2021-10-06 |
3rd row | 2021-10-06 |
4th row | 2019-08-30 |
5th row | 2019-09-05 |
Common Values
Value | Count | Frequency (%) |
2021-10-06 | 228 | |
2019-09-05 | 216 | |
2019-08-30 | 159 | |
2022-08-02 | 6 | 1.0% |
2020-02-24 | 4 | 0.6% |
2022-01-03 | 2 | 0.3% |
2022-01-20 | 2 | 0.3% |
2023-09-15 | 2 | 0.3% |
2023-05-22 | 2 | 0.3% |
2022-11-01 | 2 | 0.3% |
Other values (3) | 4 | 0.6% |
Length
Value | Count | Frequency (%) |
2021-10-06 | 228 | |
2019-09-05 | 216 | |
2019-08-30 | 159 | |
2022-08-02 | 6 | 1.0% |
2020-02-24 | 4 | 0.6% |
2022-01-03 | 2 | 0.3% |
2022-01-20 | 2 | 0.3% |
2023-09-15 | 2 | 0.3% |
2023-05-22 | 2 | 0.3% |
2022-11-01 | 2 | 0.3% |
Other values (3) | 4 | 0.6% |
일렬번호 | 구분 | 상세구분 | 등록일자 | |
---|---|---|---|---|
일렬번호 | 1.000 | 0.428 | 0.867 | 0.237 |
구분 | 0.428 | 1.000 | 0.815 | 0.193 |
상세구분 | 0.867 | 0.815 | 1.000 | 0.492 |
등록일자 | 0.237 | 0.193 | 0.492 | 1.000 |
상세구분 | 구분 | 등록일자 | |
---|---|---|---|
상세구분 | 1.000 | 0.518 | 0.184 |
구분 | 0.518 | 1.000 | 0.106 |
등록일자 | 0.184 | 0.106 | 1.000 |
일렬번호 | 구분 | 상세구분 | 등록일자 | |
---|---|---|---|---|
일렬번호 | 1.000 | 0.191 | 0.501 | 0.099 |
구분 | 0.191 | 1.000 | 0.518 | 0.106 |
상세구분 | 0.501 | 0.518 | 1.000 | 0.184 |
등록일자 | 0.099 | 0.106 | 0.184 | 1.000 |
일렬번호 | 상가이름 | 구분 | 상세구분 | 전화번호 | 등록일자 | |
---|---|---|---|---|---|---|
0 | 2706 | 몽실 | 3 | 20 | 2210893 | 2021-10-06 |
1 | 2706 | 몽실 | 3 | 20 | 2210893 | 2021-10-06 |
2 | 2706 | 몽실 | 3 | 20 | 2210893 | 2021-10-06 |
3 | 2706 | 몽실 | <NA> | <NA> | 2210893 | 2019-08-30 |
4 | 2706 | 몽실 | 3 | 20 | 2210893 | 2019-09-05 |
5 | 2707 | 몽실 | 3 | 20 | 2210893 | 2021-10-06 |
6 | 2707 | 몽실 | <NA> | <NA> | 2210893 | 2019-08-30 |
7 | 2707 | 몽실 | 3 | 20 | 2210893 | 2019-09-05 |
8 | 2708 | 흙비 | 3 | 20 | 2225636 | 2021-10-06 |
9 | 2708 | 흙비 | <NA> | <NA> | 2225636 | 2019-08-30 |
일렬번호 | 상가이름 | 구분 | 상세구분 | 전화번호 | 등록일자 | |
---|---|---|---|---|---|---|
617 | 2920 | 미성건강카페 | 5 | 35 | 2249115 | 2019-09-05 |
618 | 2921 | 종료 | <NA> | <NA> | <NA> | 2021-10-06 |
619 | 2921 | 현금출금기 | <NA> | <NA> | 221256057 | 2019-08-30 |
620 | 2921 | 현금출금기 | 4 | 35 | 221256057 | 2019-09-05 |
621 | 2922 | 명품가발 | 3 | 35 | 2266667 | 2021-10-06 |
622 | 2922 | 명품가발 | <NA> | <NA> | 2266667 | 2019-08-30 |
623 | 2922 | 명품가발 | 3 | 35 | 2266667 | 2019-09-05 |
624 | 2923 | 공예협동조합 | 4 | 35 | 8637686 | 2021-10-06 |
625 | 2923 | 공예협동조합 | <NA> | <NA> | 8637686 | 2019-08-30 |
626 | 2923 | 공예협동조합 | 4 | 35 | 8637686 | 2019-09-05 |
Most frequently occurring
일렬번호 | 상가이름 | 구분 | 상세구분 | 전화번호 | 등록일자 | # duplicates | |
---|---|---|---|---|---|---|---|
0 | 2706 | 몽실 | 3 | 20 | 2210893 | 2021-10-06 | 3 |
1 | 2753 | 밤블비 | 3 | 20 | 2533670 | 2021-10-06 | 2 |
2 | 2855 | 단골언니 | 3 | 20 | 2542008 | 2021-10-06 | 2 |