Dataset statistics
Number of variables | 9 |
---|---|
Number of observations | 23 |
Missing cells | 23 |
Missing cells (%) | 11.1% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 1.9 KiB |
Average record size in memory | 84.7 B |
Variable types
Numeric | 4 |
---|---|
Text | 2 |
Categorical | 3 |
Dataset
Description | 화성시 버스정보안내단말기 설치현황에 대한 데이터로 연번. 읍면동, 버스정류장수, 쉘터형, 독립형 버스정류장 수에 대한 데이터를 포함하고 있습니다. |
---|---|
Author | 경기도 화성시 |
URL | https://www.data.go.kr/data/15041970/fileData.do |
버스정류장수 is highly overall correlated with 쉘터형 | High correlation |
쉘터형 is highly overall correlated with 버스정류장수 and 4 other fields | High correlation |
독립형 is highly overall correlated with 쉘터형 and 1 other fields | High correlation |
전자종이형 is highly overall correlated with 쉘터형 | High correlation |
교통약자 is highly overall correlated with 쉘터형 and 1 other fields | High correlation |
확인필요 is highly overall correlated with 쉘터형 and 2 other fields | High correlation |
전자종이형 is highly imbalanced (57.4%) | Imbalance |
버스정류장수 has 2 (8.7%) missing values | Missing |
비고 has 21 (91.3%) missing values | Missing |
연번 has unique values | Unique |
읍면동 has unique values | Unique |
쉘터형 has 1 (4.3%) zeros | Zeros |
독립형 has 1 (4.3%) zeros | Zeros |
Reproduction
Analysis started | 2024-05-04 08:01:26.757592 |
---|---|
Analysis finished | 2024-05-04 08:01:33.823427 |
Duration | 7.07 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
연번
Real number (ℝ)
UNIQUE
 
Distinct | 23 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 12 |
Minimum | 1 |
---|---|
Maximum | 23 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 339.0 B |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 2.1 |
Q1 | 6.5 |
median | 12 |
Q3 | 17.5 |
95-th percentile | 21.9 |
Maximum | 23 |
Range | 22 |
Interquartile range (IQR) | 11 |
Descriptive statistics
Standard deviation | 6.78233 |
---|---|
Coefficient of variation (CV) | 0.56519417 |
Kurtosis | -1.2 |
Mean | 12 |
Median Absolute Deviation (MAD) | 6 |
Skewness | 0 |
Sum | 276 |
Variance | 46 |
Monotonicity | Strictly increasing |
Value | Count | Frequency (%) |
1 | 1 | 4.3% |
2 | 1 | 4.3% |
23 | 1 | 4.3% |
22 | 1 | 4.3% |
21 | 1 | 4.3% |
20 | 1 | 4.3% |
19 | 1 | 4.3% |
18 | 1 | 4.3% |
17 | 1 | 4.3% |
16 | 1 | 4.3% |
Other values (13) | 13 |
Value | Count | Frequency (%) |
1 | 1 | |
2 | 1 | |
3 | 1 | |
4 | 1 | |
5 | 1 | |
6 | 1 | |
7 | 1 | |
8 | 1 | |
9 | 1 | |
10 | 1 |
Value | Count | Frequency (%) |
23 | 1 | |
22 | 1 | |
21 | 1 | |
20 | 1 | |
19 | 1 | |
18 | 1 | |
17 | 1 | |
16 | 1 | |
15 | 1 | |
14 | 1 |
읍면동
Text
UNIQUE
 
Distinct | 23 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 316.0 B |
Value | Count | Frequency (%) |
정남면 | 1 | 4.3% |
마도면 | 1 | 4.3% |
수원 | 1 | 4.3% |
양감면 | 1 | 4.3% |
장안면 | 1 | 4.3% |
팔탄면 | 1 | 4.3% |
우정읍 | 1 | 4.3% |
향남읍 | 1 | 4.3% |
새솔동 | 1 | 4.3% |
서신면 | 1 | 4.3% |
Other values (13) | 13 |
Most occurring characters
Value | Count | Frequency (%) |
동 | 10 | 12.8% |
면 | 9 | 11.5% |
읍 | 4 | 5.1% |
산 | 3 | 3.8% |
남 | 3 | 3.8% |
탄 | 3 | 3.8% |
, | 3 | 3.8% |
정 | 2 | 2.6% |
양 | 2 | 2.6% |
봉 | 2 | 2.6% |
Other values (33) | 37 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 67 | |
Decimal Number | 7 | 9.0% |
Other Punctuation | 3 | 3.8% |
Math Symbol | 1 | 1.3% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
동 | 10 | 14.9% |
면 | 9 | 13.4% |
읍 | 4 | 6.0% |
산 | 3 | 4.5% |
남 | 3 | 4.5% |
탄 | 3 | 4.5% |
정 | 2 | 3.0% |
양 | 2 | 3.0% |
봉 | 2 | 3.0% |
송 | 2 | 3.0% |
Other values (26) | 27 |
Decimal Number
Value | Count | Frequency (%) |
2 | 2 | |
1 | 2 | |
8 | 1 | |
4 | 1 | |
3 | 1 |
Other Punctuation
Value | Count | Frequency (%) |
, | 3 |
Math Symbol
Value | Count | Frequency (%) |
~ | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 67 | |
Common | 11 | 14.1% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
동 | 10 | 14.9% |
면 | 9 | 13.4% |
읍 | 4 | 6.0% |
산 | 3 | 4.5% |
남 | 3 | 4.5% |
탄 | 3 | 4.5% |
정 | 2 | 3.0% |
양 | 2 | 3.0% |
봉 | 2 | 3.0% |
송 | 2 | 3.0% |
Other values (26) | 27 |
Common
Value | Count | Frequency (%) |
, | 3 | |
2 | 2 | |
1 | 2 | |
8 | 1 | 9.1% |
~ | 1 | 9.1% |
4 | 1 | 9.1% |
3 | 1 | 9.1% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 67 | |
ASCII | 11 | 14.1% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
동 | 10 | 14.9% |
면 | 9 | 13.4% |
읍 | 4 | 6.0% |
산 | 3 | 4.5% |
남 | 3 | 4.5% |
탄 | 3 | 4.5% |
정 | 2 | 3.0% |
양 | 2 | 3.0% |
봉 | 2 | 3.0% |
송 | 2 | 3.0% |
Other values (26) | 27 |
ASCII
Value | Count | Frequency (%) |
, | 3 | |
2 | 2 | |
1 | 2 | |
8 | 1 | 9.1% |
~ | 1 | 9.1% |
4 | 1 | 9.1% |
3 | 1 | 9.1% |
버스정류장수
Real number (ℝ)
HIGH CORRELATION
  MISSING
 
Distinct | 21 |
---|---|
Distinct (%) | 100.0% |
Missing | 2 |
Missing (%) | 8.7% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 144.38095 |
Minimum | 27 |
---|---|
Maximum | 333 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 339.0 B |
Quantile statistics
Minimum | 27 |
---|---|
5-th percentile | 29 |
Q1 | 80 |
median | 125 |
Q3 | 179 |
95-th percentile | 297 |
Maximum | 333 |
Range | 306 |
Interquartile range (IQR) | 99 |
Descriptive statistics
Standard deviation | 88.179633 |
---|---|
Coefficient of variation (CV) | 0.61074284 |
Kurtosis | -0.39060572 |
Mean | 144.38095 |
Median Absolute Deviation (MAD) | 53 |
Skewness | 0.63269435 |
Sum | 3032 |
Variance | 7775.6476 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
67 | 1 | 4.3% |
118 | 1 | 4.3% |
166 | 1 | 4.3% |
176 | 1 | 4.3% |
197 | 1 | 4.3% |
297 | 1 | 4.3% |
27 | 1 | 4.3% |
179 | 1 | 4.3% |
142 | 1 | 4.3% |
116 | 1 | 4.3% |
Other values (11) | 11 | |
(Missing) | 2 | 8.7% |
Value | Count | Frequency (%) |
27 | 1 | |
29 | 1 | |
40 | 1 | |
67 | 1 | |
73 | 1 | |
80 | 1 | |
81 | 1 | |
85 | 1 | |
116 | 1 | |
118 | 1 |
Value | Count | Frequency (%) |
333 | 1 | |
297 | 1 | |
263 | 1 | |
260 | 1 | |
197 | 1 | |
179 | 1 | |
178 | 1 | |
176 | 1 | |
166 | 1 | |
142 | 1 |
쉘터형
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 22 |
---|---|
Distinct (%) | 95.7% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 67.478261 |
Minimum | 0 |
---|---|
Maximum | 250 |
Zeros | 1 |
Zeros (%) | 4.3% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 339.0 B |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 1.8 |
Q1 | 36.5 |
median | 50 |
Q3 | 82 |
95-th percentile | 156.5 |
Maximum | 250 |
Range | 250 |
Interquartile range (IQR) | 45.5 |
Descriptive statistics
Standard deviation | 58.492322 |
---|---|
Coefficient of variation (CV) | 0.8668321 |
Kurtosis | 3.1819864 |
Mean | 67.478261 |
Median Absolute Deviation (MAD) | 25 |
Skewness | 1.6412432 |
Sum | 1552 |
Variance | 3421.3518 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
50 | 2 | 8.7% |
80 | 1 | 4.3% |
45 | 1 | 4.3% |
1 | 1 | 4.3% |
0 | 1 | 4.3% |
48 | 1 | 4.3% |
59 | 1 | 4.3% |
61 | 1 | 4.3% |
84 | 1 | 4.3% |
143 | 1 | 4.3% |
Other values (12) | 12 |
Value | Count | Frequency (%) |
0 | 1 | |
1 | 1 | |
9 | 1 | |
22 | 1 | |
25 | 1 | |
35 | 1 | |
38 | 1 | |
39 | 1 | |
45 | 1 | |
48 | 1 |
Value | Count | Frequency (%) |
250 | 1 | |
158 | 1 | |
143 | 1 | |
139 | 1 | |
107 | 1 | |
84 | 1 | |
80 | 1 | |
61 | 1 | |
59 | 1 | |
56 | 1 |
독립형
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 17 |
---|---|
Distinct (%) | 73.9% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 10.130435 |
Minimum | 0 |
---|---|
Maximum | 24 |
Zeros | 1 |
Zeros (%) | 4.3% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 339.0 B |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 1 |
Q1 | 3 |
median | 10 |
Q3 | 15 |
95-th percentile | 22.8 |
Maximum | 24 |
Range | 24 |
Interquartile range (IQR) | 12 |
Descriptive statistics
Standard deviation | 7.6056459 |
---|---|
Coefficient of variation (CV) | 0.75077191 |
Kurtosis | -1.0538642 |
Mean | 10.130435 |
Median Absolute Deviation (MAD) | 7 |
Skewness | 0.34523572 |
Sum | 233 |
Variance | 57.84585 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1 | 3 | |
6 | 2 | 8.7% |
3 | 2 | 8.7% |
15 | 2 | 8.7% |
13 | 2 | 8.7% |
0 | 1 | 4.3% |
11 | 1 | 4.3% |
7 | 1 | 4.3% |
21 | 1 | 4.3% |
2 | 1 | 4.3% |
Other values (7) | 7 |
Value | Count | Frequency (%) |
0 | 1 | 4.3% |
1 | 3 | |
2 | 1 | 4.3% |
3 | 2 | |
6 | 2 | |
7 | 1 | 4.3% |
8 | 1 | 4.3% |
10 | 1 | 4.3% |
11 | 1 | 4.3% |
12 | 1 | 4.3% |
Value | Count | Frequency (%) |
24 | 1 | |
23 | 1 | |
21 | 1 | |
20 | 1 | |
18 | 1 | |
15 | 2 | |
13 | 2 | |
12 | 1 | |
11 | 1 | |
10 | 1 |
전자종이형
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | 8.7% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 316.0 B |
0 | |
---|---|
1 | 2 |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 1 |
---|---|
2nd row | 0 |
3rd row | 0 |
4th row | 0 |
5th row | 0 |
Common Values
Value | Count | Frequency (%) |
0 | 21 | |
1 | 2 | 8.7% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
0 | 21 | |
1 | 2 | 8.7% |
교통약자
Categorical
HIGH CORRELATION
 
Distinct | 5 |
---|---|
Distinct (%) | 21.7% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 316.0 B |
0 | |
---|---|
4 | |
2 | 1 |
11 | 1 |
8 | 1 |
Length
Max length | 2 |
---|---|
Median length | 1 |
Mean length | 1.0434783 |
Min length | 1 |
Unique
Unique | 3 ? |
---|---|
Unique (%) | 13.0% |
Sample
1st row | 0 |
---|---|
2nd row | 0 |
3rd row | 2 |
4th row | 0 |
5th row | 0 |
Common Values
Value | Count | Frequency (%) |
0 | 17 | |
4 | 3 | 13.0% |
2 | 1 | 4.3% |
11 | 1 | 4.3% |
8 | 1 | 4.3% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
0 | 17 | |
4 | 3 | 13.0% |
2 | 1 | 4.3% |
11 | 1 | 4.3% |
8 | 1 | 4.3% |
확인필요
Categorical
HIGH CORRELATION
 
Distinct | 4 |
---|---|
Distinct (%) | 17.4% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 316.0 B |
0 | |
---|---|
1 | |
2 | |
4 | 1 |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 1 ? |
---|---|
Unique (%) | 4.3% |
Sample
1st row | 0 |
---|---|
2nd row | 0 |
3rd row | 2 |
4th row | 1 |
5th row | 0 |
Common Values
Value | Count | Frequency (%) |
0 | 15 | |
1 | 5 | 21.7% |
2 | 2 | 8.7% |
4 | 1 | 4.3% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
0 | 15 | |
1 | 5 | 21.7% |
2 | 2 | 8.7% |
4 | 1 | 4.3% |
비고
Text
MISSING
 
Distinct | 2 |
---|---|
Distinct (%) | 100.0% |
Missing | 21 |
Missing (%) | 91.3% |
Memory size | 316.0 B |
Value | Count | Frequency (%) |
국립축산과학원 | 1 | |
서동탄역 | 1 |
Most occurring characters
Value | Count | Frequency (%) |
국 | 1 | |
립 | 1 | |
축 | 1 | |
산 | 1 | |
과 | 1 | |
학 | 1 | |
원 | 1 | |
서 | 1 | |
동 | 1 | |
탄 | 1 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 11 |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
국 | 1 | |
립 | 1 | |
축 | 1 | |
산 | 1 | |
과 | 1 | |
학 | 1 | |
원 | 1 | |
서 | 1 | |
동 | 1 | |
탄 | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 11 |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
국 | 1 | |
립 | 1 | |
축 | 1 | |
산 | 1 | |
과 | 1 | |
학 | 1 | |
원 | 1 | |
서 | 1 | |
동 | 1 | |
탄 | 1 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 11 |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
국 | 1 | |
립 | 1 | |
축 | 1 | |
산 | 1 | |
과 | 1 | |
학 | 1 | |
원 | 1 | |
서 | 1 | |
동 | 1 | |
탄 | 1 |
연번 | 읍면동 | 버스정류장수 | 쉘터형 | 독립형 | 전자종이형 | 교통약자 | 확인필요 | 비고 | |
---|---|---|---|---|---|---|---|---|---|
연번 | 1.000 | 1.000 | 0.675 | 0.740 | 0.721 | 0.000 | 0.518 | 0.000 | NaN |
읍면동 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 0.000 |
버스정류장수 | 0.675 | 1.000 | 1.000 | 0.840 | 0.192 | 0.328 | 0.764 | 0.375 | NaN |
쉘터형 | 0.740 | 1.000 | 0.840 | 1.000 | 0.757 | 0.935 | 0.805 | 0.922 | NaN |
독립형 | 0.721 | 1.000 | 0.192 | 0.757 | 1.000 | 0.000 | 0.710 | 0.791 | NaN |
전자종이형 | 0.000 | 1.000 | 0.328 | 0.935 | 0.000 | 1.000 | 0.000 | 0.462 | NaN |
교통약자 | 0.518 | 1.000 | 0.764 | 0.805 | 0.710 | 0.000 | 1.000 | 0.599 | NaN |
확인필요 | 0.000 | 1.000 | 0.375 | 0.922 | 0.791 | 0.462 | 0.599 | 1.000 | NaN |
비고 | NaN | 0.000 | NaN | NaN | NaN | NaN | NaN | NaN | 1.000 |
교통약자 | 전자종이형 | 확인필요 | |
---|---|---|---|
교통약자 | 1.000 | 0.000 | 0.506 |
전자종이형 | 0.000 | 1.000 | 0.287 |
확인필요 | 0.506 | 0.287 | 1.000 |
연번 | 버스정류장수 | 쉘터형 | 독립형 | 전자종이형 | 교통약자 | 확인필요 | |
---|---|---|---|---|---|---|---|
연번 | 1.000 | 0.347 | -0.190 | -0.356 | 0.000 | 0.036 | 0.000 |
버스정류장수 | 0.347 | 1.000 | 0.859 | 0.493 | 0.223 | 0.483 | 0.150 |
쉘터형 | -0.190 | 0.859 | 1.000 | 0.727 | 0.654 | 0.594 | 0.557 |
독립형 | -0.356 | 0.493 | 0.727 | 1.000 | 0.000 | 0.368 | 0.510 |
전자종이형 | 0.000 | 0.223 | 0.654 | 0.000 | 1.000 | 0.000 | 0.287 |
교통약자 | 0.036 | 0.483 | 0.594 | 0.368 | 0.000 | 1.000 | 0.506 |
확인필요 | 0.000 | 0.150 | 0.557 | 0.510 | 0.287 | 0.506 | 1.000 |
연번 | 읍면동 | 버스정류장수 | 쉘터형 | 독립형 | 전자종이형 | 교통약자 | 확인필요 | 비고 | |
---|---|---|---|---|---|---|---|---|---|
0 | 1 | 정남면 | 125 | 80 | 6 | 1 | 0 | 0 | <NA> |
1 | 2 | 진안동 | 67 | 50 | 8 | 0 | 0 | 0 | <NA> |
2 | 3 | 병점1,2동 | 73 | 50 | 15 | 0 | 2 | 2 | <NA> |
3 | 4 | 반월동 | 40 | 22 | 13 | 0 | 0 | 1 | <NA> |
4 | 5 | 기배동 | 29 | 9 | 1 | 0 | 0 | 0 | <NA> |
5 | 6 | 화산동 | 81 | 56 | 20 | 0 | 0 | 0 | <NA> |
6 | 7 | 동탄1,2,3동 | 178 | 139 | 13 | 0 | 11 | 0 | <NA> |
7 | 8 | 동탄4~8동 | 333 | 250 | 24 | 0 | 8 | 1 | <NA> |
8 | 9 | 봉담읍 | 260 | 158 | 23 | 1 | 4 | 2 | <NA> |
9 | 10 | 남양읍 | 263 | 107 | 18 | 0 | 4 | 4 | <NA> |
연번 | 읍면동 | 버스정류장수 | 쉘터형 | 독립형 | 전자종이형 | 교통약자 | 확인필요 | 비고 | |
---|---|---|---|---|---|---|---|---|---|
13 | 14 | 송산면 | 142 | 45 | 1 | 0 | 0 | 0 | <NA> |
14 | 15 | 서신면 | 179 | 53 | 3 | 0 | 0 | 0 | <NA> |
15 | 16 | 새솔동 | 27 | 35 | 2 | 0 | 0 | 0 | <NA> |
16 | 17 | 향남읍 | 297 | 143 | 21 | 0 | 4 | 1 | <NA> |
17 | 18 | 우정읍 | 197 | 84 | 7 | 0 | 0 | 0 | <NA> |
18 | 19 | 팔탄면 | 176 | 61 | 15 | 0 | 0 | 1 | <NA> |
19 | 20 | 장안면 | 166 | 59 | 11 | 0 | 0 | 0 | <NA> |
20 | 21 | 양감면 | 118 | 48 | 3 | 0 | 0 | 1 | <NA> |
21 | 22 | 수원 | <NA> | 0 | 1 | 0 | 0 | 0 | 국립축산과학원 |
22 | 23 | 오산 | <NA> | 1 | 0 | 0 | 0 | 0 | 서동탄역 |