Dataset statistics
Number of variables | 6 |
---|---|
Number of observations | 535 |
Missing cells | 314 |
Missing cells (%) | 9.8% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 27.3 KiB |
Average record size in memory | 52.2 B |
Variable types
Text | 1 |
---|---|
Categorical | 1 |
Numeric | 4 |
Dataset
Description | 영업소 위치정보 |
---|---|
Author | 충청남도 |
URL | https://alldam.chungnam.go.kr/bigdata/collect/view.chungnam?menuCd=DOM_000000201001001000&apiIdx=2644 |
Y좌표값 is highly overall correlated with 노선명 | High correlation |
노선코드 is highly overall correlated with 노선명 | High correlation |
영업소코드 is highly overall correlated with 노선명 | High correlation |
노선명 is highly overall correlated with Y좌표값 and 2 other fields | High correlation |
X좌표값 has 157 (29.3%) missing values | Missing |
Y좌표값 has 157 (29.3%) missing values | Missing |
영업소명 has unique values | Unique |
영업소코드 has unique values | Unique |
Reproduction
Analysis started | 2024-03-13 11:52:09.766600 |
---|---|
Analysis finished | 2024-03-13 11:52:12.376243 |
Duration | 2.61 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
영업소명
Text
UNIQUE
 
Distinct | 535 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 4.3 KiB |
Value | Count | Frequency (%) |
판교 | 1 | 0.2% |
신림 | 1 | 0.2% |
삼척 | 1 | 0.2% |
속초 | 1 | 0.2% |
북양양 | 1 | 0.2% |
하조대 | 1 | 0.2% |
남강릉 | 1 | 0.2% |
동해 | 1 | 0.2% |
망상 | 1 | 0.2% |
옥계 | 1 | 0.2% |
Other values (525) | 525 |
Most occurring characters
Value | Count | Frequency (%) |
서 | 60 | 4.1% |
동 | 59 | 4.0% |
산 | 59 | 4.0% |
남 | 54 | 3.7% |
주 | 51 | 3.5% |
천 | 49 | 3.3% |
양 | 46 | 3.1% |
북 | 37 | 2.5% |
안 | 28 | 1.9% |
성 | 28 | 1.9% |
Other values (213) | 1000 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 1413 | |
Uppercase Letter | 45 | 3.1% |
Close Punctuation | 6 | 0.4% |
Open Punctuation | 6 | 0.4% |
Decimal Number | 1 | 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
서 | 60 | 4.2% |
동 | 59 | 4.2% |
산 | 59 | 4.2% |
남 | 54 | 3.8% |
주 | 51 | 3.6% |
천 | 49 | 3.5% |
양 | 46 | 3.3% |
북 | 37 | 2.6% |
안 | 28 | 2.0% |
성 | 28 | 2.0% |
Other values (204) | 942 |
Uppercase Letter
Value | Count | Frequency (%) |
C | 21 | |
J | 19 | |
T | 2 | 4.4% |
K | 1 | 2.2% |
E | 1 | 2.2% |
I | 1 | 2.2% |
Close Punctuation
Value | Count | Frequency (%) |
) | 6 |
Open Punctuation
Value | Count | Frequency (%) |
( | 6 |
Decimal Number
Value | Count | Frequency (%) |
2 | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 1413 | |
Latin | 45 | 3.1% |
Common | 13 | 0.9% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
서 | 60 | 4.2% |
동 | 59 | 4.2% |
산 | 59 | 4.2% |
남 | 54 | 3.8% |
주 | 51 | 3.6% |
천 | 49 | 3.5% |
양 | 46 | 3.3% |
북 | 37 | 2.6% |
안 | 28 | 2.0% |
성 | 28 | 2.0% |
Other values (204) | 942 |
Latin
Value | Count | Frequency (%) |
C | 21 | |
J | 19 | |
T | 2 | 4.4% |
K | 1 | 2.2% |
E | 1 | 2.2% |
I | 1 | 2.2% |
Common
Value | Count | Frequency (%) |
) | 6 | |
( | 6 | |
2 | 1 | 7.7% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 1413 | |
ASCII | 58 | 3.9% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
서 | 60 | 4.2% |
동 | 59 | 4.2% |
산 | 59 | 4.2% |
남 | 54 | 3.8% |
주 | 51 | 3.6% |
천 | 49 | 3.5% |
양 | 46 | 3.3% |
북 | 37 | 2.6% |
안 | 28 | 2.0% |
성 | 28 | 2.0% |
Other values (204) | 942 |
ASCII
Value | Count | Frequency (%) |
C | 21 | |
J | 19 | |
) | 6 | 10.3% |
( | 6 | 10.3% |
T | 2 | 3.4% |
K | 1 | 1.7% |
E | 1 | 1.7% |
2 | 1 | 1.7% |
I | 1 | 1.7% |
노선명
Categorical
HIGH CORRELATION
 
Distinct | 46 |
---|---|
Distinct (%) | 8.6% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 4.3 KiB |
경부선 | |
---|---|
남해선A | 34 |
호남선A | 31 |
중부선-대전통영선A | 31 |
서해안선 | 29 |
Other values (41) |
Length
Max length | 10 |
---|---|
Median length | 8 |
Mean length | 5.1196262 |
Min length | 3 |
Unique
Unique | 3 ? |
---|---|
Unique (%) | 0.6% |
Sample
1st row | 경부선 |
---|---|
2nd row | 경부선 |
3rd row | 경부선 |
4th row | 경부선 |
5th row | 경부선 |
Common Values
Value | Count | Frequency (%) |
경부선 | 44 | 8.2% |
남해선A | 34 | 6.4% |
호남선A | 31 | 5.8% |
중부선-대전통영선A | 31 | 5.8% |
서해안선 | 29 | 5.4% |
수도권제2순환선 | 27 | 5.0% |
영동선 | 25 | 4.7% |
중부내륙선 | 25 | 4.7% |
당진상주선 | 23 | 4.3% |
중앙선 | 22 | 4.1% |
Other values (36) | 244 |
Length
Value | Count | Frequency (%) |
경부선 | 44 | 8.2% |
남해선a | 34 | 6.4% |
호남선a | 31 | 5.8% |
중부선-대전통영선a | 31 | 5.8% |
서해안선 | 29 | 5.4% |
수도권제2순환선 | 27 | 5.0% |
영동선 | 25 | 4.7% |
중부내륙선 | 25 | 4.7% |
당진상주선 | 23 | 4.3% |
중앙선 | 22 | 4.1% |
Other values (36) | 244 |
X좌표값
Real number (ℝ)
MISSING
 
Distinct | 378 |
---|---|
Distinct (%) | 100.0% |
Missing | 157 |
Missing (%) | 29.3% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 127.77785 |
Minimum | 126.43506 |
---|---|
Maximum | 129.43521 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 4.8 KiB |
Quantile statistics
Minimum | 126.43506 |
---|---|
5-th percentile | 126.68688 |
Q1 | 127.10253 |
median | 127.63796 |
Q3 | 128.47895 |
95-th percentile | 129.13705 |
Maximum | 129.43521 |
Range | 3.000148 |
Interquartile range (IQR) | 1.37642 |
Descriptive statistics
Standard deviation | 0.79847217 |
---|---|
Coefficient of variation (CV) | 0.0062489093 |
Kurtosis | -1.1130404 |
Mean | 127.77785 |
Median Absolute Deviation (MAD) | 0.659708 |
Skewness | 0.27127908 |
Sum | 48300.026 |
Variance | 0.6375578 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
126.780851 | 1 | 0.2% |
128.577591 | 1 | 0.2% |
128.563472 | 1 | 0.2% |
128.535586 | 1 | 0.2% |
128.514744 | 1 | 0.2% |
128.539343 | 1 | 0.2% |
127.855581 | 1 | 0.2% |
127.775947 | 1 | 0.2% |
126.84252 | 1 | 0.2% |
126.792398 | 1 | 0.2% |
Other values (368) | 368 | |
(Missing) | 157 |
Value | Count | Frequency (%) |
126.435065 | 1 | |
126.480819 | 1 | |
126.485422 | 1 | |
126.48607 | 1 | |
126.497717 | 1 | |
126.542929 | 1 | |
126.554466 | 1 | |
126.55674 | 1 | |
126.562018 | 1 | |
126.56535 | 1 |
Value | Count | Frequency (%) |
129.435213 | 1 | |
129.394802 | 1 | |
129.364993 | 1 | |
129.314538 | 1 | |
129.298314 | 1 | |
129.288158 | 1 | |
129.275381 | 1 | |
129.262036 | 1 | |
129.249202 | 1 | |
129.245438 | 1 |
Y좌표값
Real number (ℝ)
HIGH CORRELATION
  MISSING
 
Distinct | 378 |
---|---|
Distinct (%) | 100.0% |
Missing | 157 |
Missing (%) | 29.3% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 36.254568 |
Minimum | 34.693356 |
---|---|
Maximum | 38.202889 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 4.8 KiB |
Quantile statistics
Minimum | 34.693356 |
---|---|
5-th percentile | 34.996765 |
Q1 | 35.419931 |
median | 36.135414 |
Q3 | 37.102793 |
95-th percentile | 37.670387 |
Maximum | 38.202889 |
Range | 3.509533 |
Interquartile range (IQR) | 1.6828622 |
Descriptive statistics
Standard deviation | 0.90531614 |
---|---|
Coefficient of variation (CV) | 0.024971092 |
Kurtosis | -1.2179285 |
Mean | 36.254568 |
Median Absolute Deviation (MAD) | 0.8423645 |
Skewness | 0.19284873 |
Sum | 13704.227 |
Variance | 0.81959732 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
37.35841 | 1 | 0.2% |
36.288086 | 1 | 0.2% |
36.200288 | 1 | 0.2% |
36.091865 | 1 | 0.2% |
36.04843 | 1 | 0.2% |
35.934437 | 1 | 0.2% |
37.67533 | 1 | 0.2% |
37.839835 | 1 | 0.2% |
37.349762 | 1 | 0.2% |
37.344963 | 1 | 0.2% |
Other values (368) | 368 | |
(Missing) | 157 |
Value | Count | Frequency (%) |
34.693356 | 1 | |
34.702834 | 1 | |
34.737619 | 1 | |
34.812134 | 1 | |
34.825889 | 1 | |
34.837264 | 1 | |
34.858761 | 1 | |
34.863289 | 1 | |
34.869414 | 1 | |
34.883591 | 1 |
Value | Count | Frequency (%) |
38.202889 | 1 | |
38.155223 | 1 | |
38.071824 | 1 | |
38.029957 | 1 | |
37.990592 | 1 | |
37.921166 | 1 | |
37.917929 | 1 | |
37.839835 | 1 | |
37.835913 | 1 | |
37.785347 | 1 |
노선코드
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 46 |
---|---|
Distinct (%) | 8.6% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 129.18131 |
Minimum | 1 |
---|---|
Maximum | 700 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 4.8 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 17 |
median | 45 |
Q3 | 110 |
95-th percentile | 627 |
Maximum | 700 |
Range | 699 |
Interquartile range (IQR) | 93 |
Descriptive statistics
Standard deviation | 192.32185 |
---|---|
Coefficient of variation (CV) | 1.4887746 |
Kurtosis | 2.1784105 |
Mean | 129.18131 |
Median Absolute Deviation (MAD) | 30 |
Skewness | 1.8590017 |
Sum | 69112 |
Variance | 36987.696 |
Monotonicity | Increasing |
Value | Count | Frequency (%) |
1 | 44 | 8.2% |
10 | 34 | 6.4% |
25 | 31 | 5.8% |
35 | 31 | 5.8% |
15 | 29 | 5.4% |
400 | 27 | 5.0% |
50 | 25 | 4.7% |
45 | 25 | 4.7% |
30 | 23 | 4.3% |
55 | 22 | 4.1% |
Other values (36) | 244 |
Value | Count | Frequency (%) |
1 | 44 | |
10 | 34 | |
12 | 12 | 2.2% |
14 | 3 | 0.6% |
15 | 29 | |
16 | 1 | 0.2% |
17 | 15 | 2.8% |
20 | 8 | 1.5% |
25 | 31 | |
29 | 12 | 2.2% |
Value | Count | Frequency (%) |
700 | 10 | 1.9% |
688 | 11 | |
627 | 9 | 1.7% |
600 | 14 | |
551 | 2 | 0.4% |
500 | 3 | 0.6% |
451 | 5 | 0.9% |
400 | 27 | |
301 | 12 | |
300 | 2 | 0.4% |
영업소코드
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 535 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 448.43925 |
Minimum | 4 |
---|---|
Maximum | 987 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 4.8 KiB |
Quantile statistics
Minimum | 4 |
---|---|
5-th percentile | 63.7 |
Q1 | 193.5 |
median | 527 |
Q3 | 678.5 |
95-th percentile | 832.3 |
Maximum | 987 |
Range | 983 |
Interquartile range (IQR) | 485 |
Descriptive statistics
Standard deviation | 266.62386 |
---|---|
Coefficient of variation (CV) | 0.5945596 |
Kurtosis | -1.4183981 |
Mean | 448.43925 |
Median Absolute Deviation (MAD) | 251 |
Skewness | -0.0052675723 |
Sum | 239915 |
Variance | 71088.284 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
65 | 1 | 0.2% |
765 | 1 | 0.2% |
703 | 1 | 0.2% |
702 | 1 | 0.2% |
701 | 1 | 0.2% |
585 | 1 | 0.2% |
584 | 1 | 0.2% |
583 | 1 | 0.2% |
582 | 1 | 0.2% |
581 | 1 | 0.2% |
Other values (525) | 525 |
Value | Count | Frequency (%) |
4 | 1 | |
11 | 1 | |
12 | 1 | |
13 | 1 | |
25 | 1 | |
26 | 1 | |
29 | 1 | |
31 | 1 | |
33 | 1 | |
34 | 1 |
Value | Count | Frequency (%) |
987 | 1 | |
986 | 1 | |
985 | 1 | |
984 | 1 | |
983 | 1 | |
982 | 1 | |
981 | 1 | |
879 | 1 | |
876 | 1 | |
870 | 1 |
노선명 | X좌표값 | Y좌표값 | 노선코드 | 영업소코드 | |
---|---|---|---|---|---|
노선명 | 1.000 | 0.814 | 0.865 | 1.000 | 0.898 |
X좌표값 | 0.814 | 1.000 | 0.585 | 0.459 | 0.440 |
Y좌표값 | 0.865 | 0.585 | 1.000 | 0.535 | 0.506 |
노선코드 | 1.000 | 0.459 | 0.535 | 1.000 | 0.624 |
영업소코드 | 0.898 | 0.440 | 0.506 | 0.624 | 1.000 |
X좌표값 | Y좌표값 | 노선코드 | 영업소코드 | 노선명 | |
---|---|---|---|---|---|
X좌표값 | 1.000 | -0.089 | 0.145 | 0.071 | 0.428 |
Y좌표값 | -0.089 | 1.000 | 0.225 | -0.069 | 0.503 |
노선코드 | 0.145 | 0.225 | 1.000 | 0.259 | 0.965 |
영업소코드 | 0.071 | -0.069 | 0.259 | 1.000 | 0.573 |
노선명 | 0.428 | 0.503 | 0.965 | 0.573 | 1.000 |
영업소명 | 노선명 | X좌표값 | Y좌표값 | 노선코드 | 영업소코드 | |
---|---|---|---|---|---|---|
0 | 판교 | 경부선 | 127.1032 | 37.396399 | 1 | 65 |
1 | 대왕판교 | 경부선 | 127.093456 | 37.408887 | 1 | 69 |
2 | 서울 | 경부선 | 127.102077 | 37.365046 | 1 | 101 |
3 | 수원신갈 | 경부선 | 127.102395 | 37.266835 | 1 | 103 |
4 | 기흥 | 경부선 | 127.102448 | 37.222267 | 1 | 105 |
5 | 오산 | 경부선 | 127.08013 | 37.143445 | 1 | 106 |
6 | 안성 | 경부선 | 127.15041 | 36.995586 | 1 | 107 |
7 | 천안 | 경부선 | 127.166314 | 36.826143 | 1 | 108 |
8 | 목천 | 경부선 | 127.230608 | 36.768049 | 1 | 110 |
9 | 청주 | 경부선 | 127.37979 | 36.625153 | 1 | 111 |
영업소명 | 노선명 | X좌표값 | Y좌표값 | 노선코드 | 영업소코드 | |
---|---|---|---|---|---|---|
525 | 다사 | 대구외곽순환선 | 128.469342 | 35.849407 | 700 | 57 |
526 | 지천 | 대구외곽순환선 | 128.534811 | 35.937639 | 700 | 58 |
527 | 북달성 | 대구외곽순환선 | 128.458653 | 35.871124 | 700 | 772 |
528 | 북다사 | 대구외곽순환선 | 128.468224 | 35.885329 | 700 | 773 |
529 | 남칠곡 | 대구외곽순환선 | 128.515656 | 35.916919 | 700 | 774 |
530 | 동명동호 | 대구외곽순환선 | 128.5517 | 35.970377 | 700 | 775 |
531 | 연경 | 대구외곽순환선 | 128.611328 | 35.936574 | 700 | 776 |
532 | 파군재 | 대구외곽순환선 | 128.637672 | 35.934839 | 700 | 777 |
533 | 둔산 | 대구외곽순환선 | 128.689438 | 35.900013 | 700 | 778 |
534 | 율암 | 대구외곽순환선 | 128.701789 | 35.888182 | 700 | 779 |