Dataset statistics
Number of variables | 7 |
---|---|
Number of observations | 216 |
Missing cells | 21 |
Missing cells (%) | 1.4% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 12.8 KiB |
Average record size in memory | 60.6 B |
Variable types
Numeric | 4 |
---|---|
Categorical | 2 |
Text | 1 |
Dataset
Description | 전라북도 정읍시 도로기반시설물 정보( 관리번호, 행정읍면동, 도엽번호, 도로구간번호, 설치일자, 정류장명)등 자료제공 합니다. |
---|---|
Author | 전라북도 정읍시 |
URL | https://www.data.go.kr/data/15085009/fileData.do |
데이터기준일자 has constant value "" | Constant |
관리번호 is highly overall correlated with 행정읍면동 and 2 other fields | High correlation |
행정읍면동 is highly overall correlated with 관리번호 and 1 other fields | High correlation |
도엽번호 is highly overall correlated with 관리번호 and 2 other fields | High correlation |
설치일자 is highly overall correlated with 관리번호 and 1 other fields | High correlation |
설치일자 is highly imbalanced (61.2%) | Imbalance |
정류장명 has 21 (9.7%) missing values | Missing |
관리번호 has unique values | Unique |
Reproduction
Analysis started | 2023-12-12 11:55:27.407734 |
---|---|
Analysis finished | 2023-12-12 11:55:30.562286 |
Duration | 3.15 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
관리번호
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 216 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 133741.59 |
Minimum | 1 |
---|---|
Maximum | 989301 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 2.0 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 11.75 |
Q1 | 55.75 |
median | 115.5 |
Q3 | 171.25 |
95-th percentile | 900001.25 |
Maximum | 989301 |
Range | 989300 |
Interquartile range (IQR) | 115.5 |
Descriptive statistics
Standard deviation | 284618.54 |
---|---|
Coefficient of variation (CV) | 2.1281229 |
Kurtosis | 2.1797465 |
Mean | 133741.59 |
Median Absolute Deviation (MAD) | 58 |
Skewness | 1.9308629 |
Sum | 28888184 |
Variance | 8.1007715 × 1010 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
159 | 1 | 0.5% |
102 | 1 | 0.5% |
129 | 1 | 0.5% |
9 | 1 | 0.5% |
127 | 1 | 0.5% |
122 | 1 | 0.5% |
8 | 1 | 0.5% |
143 | 1 | 0.5% |
39 | 1 | 0.5% |
141 | 1 | 0.5% |
Other values (206) | 206 |
Value | Count | Frequency (%) |
1 | 1 | |
2 | 1 | |
3 | 1 | |
4 | 1 | |
5 | 1 | |
6 | 1 | |
7 | 1 | |
8 | 1 | |
9 | 1 | |
10 | 1 |
Value | Count | Frequency (%) |
989301 | 1 | |
984301 | 1 | |
936001 | 1 | |
926001 | 1 | |
900008 | 1 | |
900007 | 1 | |
900006 | 1 | |
900005 | 1 | |
900004 | 1 | |
900003 | 1 |
행정읍면동
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 14 |
---|---|
Distinct (%) | 6.5% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 4.5180457 × 109 |
Minimum | 4.518025 × 109 |
---|---|
Maximum | 4.5180595 × 109 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 2.0 KiB |
Quantile statistics
Minimum | 4.518025 × 109 |
---|---|
5-th percentile | 4.518025 × 109 |
Q1 | 4.518031 × 109 |
median | 4.5180535 × 109 |
Q3 | 4.518056 × 109 |
95-th percentile | 4.5180595 × 109 |
Maximum | 4.5180595 × 109 |
Range | 34500 |
Interquartile range (IQR) | 25000 |
Descriptive statistics
Standard deviation | 13214.091 |
---|---|
Coefficient of variation (CV) | 2.9247361 × 10-6 |
Kurtosis | -1.3010968 |
Mean | 4.5180457 × 109 |
Median Absolute Deviation (MAD) | 4500 |
Skewness | -0.65576747 |
Sum | 9.7589787 × 1011 |
Variance | 1.7461221 × 108 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
4518025000 | 47 | |
4518053500 | 36 | |
4518059500 | 19 | |
4518058000 | 19 | |
4518051000 | 17 | 7.9% |
4518057000 | 14 | 6.5% |
4518056000 | 14 | 6.5% |
4518032000 | 10 | 4.6% |
4518042000 | 10 | 4.6% |
4518054500 | 9 | 4.2% |
Other values (4) | 21 |
Value | Count | Frequency (%) |
4518025000 | 47 | |
4518031000 | 8 | 3.7% |
4518032000 | 10 | 4.6% |
4518039000 | 4 | 1.9% |
4518040000 | 1 | 0.5% |
4518042000 | 10 | 4.6% |
4518051000 | 17 | 7.9% |
4518052000 | 8 | 3.7% |
4518053500 | 36 | |
4518054500 | 9 | 4.2% |
Value | Count | Frequency (%) |
4518059500 | 19 | |
4518058000 | 19 | |
4518057000 | 14 | 6.5% |
4518056000 | 14 | 6.5% |
4518054500 | 9 | 4.2% |
4518053500 | 36 | |
4518052000 | 8 | 3.7% |
4518051000 | 17 | |
4518042000 | 10 | 4.6% |
4518040000 | 1 | 0.5% |
도엽번호
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 148 |
---|---|
Distinct (%) | 68.5% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 3.5608176 × 109 |
Minimum | 3.5608025 × 109 |
---|---|
Maximum | 3.5612021 × 109 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 2.0 KiB |
Quantile statistics
Minimum | 3.5608025 × 109 |
---|---|
5-th percentile | 3.5608028 × 109 |
Q1 | 3.5608139 × 109 |
median | 3.5608179 × 109 |
Q3 | 3.5608188 × 109 |
95-th percentile | 3.5608233 × 109 |
Maximum | 3.5612021 × 109 |
Range | 399609 |
Interquartile range (IQR) | 4951.25 |
Descriptive statistics
Standard deviation | 26969.176 |
---|---|
Coefficient of variation (CV) | 7.5738718 × 10-6 |
Kurtosis | 194.499 |
Mean | 3.5608176 × 109 |
Median Absolute Deviation (MAD) | 1907.5 |
Skewness | 13.579214 |
Sum | 7.6913661 × 1011 |
Variance | 7.2733647 × 108 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
3560802684 | 5 | 2.3% |
3560818312 | 5 | 2.3% |
3560818633 | 4 | 1.9% |
3560814103 | 3 | 1.4% |
3560823501 | 3 | 1.4% |
3560808283 | 3 | 1.4% |
3560823391 | 3 | 1.4% |
3560818822 | 3 | 1.4% |
3560818834 | 2 | 0.9% |
3560818843 | 2 | 0.9% |
Other values (138) | 183 |
Value | Count | Frequency (%) |
3560802504 | 2 | 0.9% |
3560802582 | 2 | 0.9% |
3560802683 | 1 | 0.5% |
3560802684 | 5 | |
3560802693 | 1 | 0.5% |
3560802801 | 1 | 0.5% |
3560803433 | 1 | 0.5% |
3560803621 | 1 | 0.5% |
3560803713 | 1 | 0.5% |
3560803812 | 1 | 0.5% |
Value | Count | Frequency (%) |
3561202113 | 1 | 0.5% |
3560823604 | 2 | |
3560823501 | 3 | |
3560823391 | 3 | |
3560823312 | 2 | |
3560823284 | 1 | 0.5% |
3560823174 | 1 | 0.5% |
3560823172 | 1 | 0.5% |
3560823074 | 2 | |
3560823062 | 1 | 0.5% |
도로구간번호
Real number (ℝ)
Distinct | 176 |
---|---|
Distinct (%) | 81.5% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 367872.22 |
Minimum | 15 |
---|---|
Maximum | 989306 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 2.0 KiB |
Quantile statistics
Minimum | 15 |
---|---|
5-th percentile | 426.5 |
Q1 | 3184.75 |
median | 310421.5 |
Q3 | 709789.5 |
95-th percentile | 900071 |
Maximum | 989306 |
Range | 989291 |
Interquartile range (IQR) | 706604.75 |
Descriptive statistics
Standard deviation | 320228.41 |
---|---|
Coefficient of variation (CV) | 0.87048814 |
Kurtosis | -1.2539801 |
Mean | 367872.22 |
Median Absolute Deviation (MAD) | 308398.5 |
Skewness | 0.37662795 |
Sum | 79460400 |
Variance | 1.0254623 × 1011 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
328003 | 6 | 2.8% |
310728 | 3 | 1.4% |
307001 | 3 | 1.4% |
310706 | 3 | 1.4% |
360002 | 2 | 0.9% |
710021 | 2 | 0.9% |
720187 | 2 | 0.9% |
434 | 2 | 0.9% |
850 | 2 | 0.9% |
404 | 2 | 0.9% |
Other values (166) | 189 |
Value | Count | Frequency (%) |
15 | 2 | |
83 | 1 | |
96 | 1 | |
193 | 1 | |
235 | 1 | |
237 | 1 | |
289 | 1 | |
327 | 1 | |
404 | 2 | |
434 | 2 |
Value | Count | Frequency (%) |
989306 | 1 | |
984302 | 1 | |
936001 | 1 | |
926007 | 1 | |
901023 | 1 | |
901020 | 1 | |
900342 | 2 | |
900341 | 1 | |
900310 | 1 | |
900071 | 2 |
설치일자
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 14 |
---|---|
Distinct (%) | 6.5% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 1.8 KiB |
1900-01-01 | |
---|---|
2012-01-01 | |
2013-01-01 | 11 |
1998-05-01 | 5 |
2010-01-01 | 5 |
Other values (9) | 13 |
Length
Max length | 10 |
---|---|
Median length | 10 |
Mean length | 10 |
Min length | 10 |
Unique
Unique | 7 ? |
---|---|
Unique (%) | 3.2% |
Sample
1st row | 1900-01-01 |
---|---|
2nd row | 1900-01-01 |
3rd row | 1900-01-01 |
4th row | 1900-01-01 |
5th row | 1900-01-01 |
Common Values
Value | Count | Frequency (%) |
1900-01-01 | 165 | |
2012-01-01 | 17 | 7.9% |
2013-01-01 | 11 | 5.1% |
1998-05-01 | 5 | 2.3% |
2010-01-01 | 5 | 2.3% |
2009-01-01 | 4 | 1.9% |
2003-08-01 | 2 | 0.9% |
2020-10-01 | 1 | 0.5% |
2003-06-01 | 1 | 0.5% |
2015-08-31 | 1 | 0.5% |
Other values (4) | 4 | 1.9% |
Length
Value | Count | Frequency (%) |
1900-01-01 | 165 | |
2012-01-01 | 17 | 7.9% |
2013-01-01 | 11 | 5.1% |
1998-05-01 | 5 | 2.3% |
2010-01-01 | 5 | 2.3% |
2009-01-01 | 4 | 1.9% |
2003-08-01 | 2 | 0.9% |
2020-10-01 | 1 | 0.5% |
2003-06-01 | 1 | 0.5% |
2015-08-31 | 1 | 0.5% |
Other values (4) | 4 | 1.9% |
정류장명
Text
MISSING
 
Distinct | 157 |
---|---|
Distinct (%) | 80.5% |
Missing | 21 |
Missing (%) | 9.7% |
Memory size | 1.8 KiB |
Value | Count | Frequency (%) |
수성주공아파트 | 4 | 2.1% |
두지 | 4 | 2.1% |
부전마을 | 3 | 1.5% |
대림apt | 2 | 1.0% |
효축마을 | 2 | 1.0% |
차단 | 2 | 1.0% |
신월 | 2 | 1.0% |
시기주공아파트 | 2 | 1.0% |
정읍여고 | 2 | 1.0% |
엄동 | 2 | 1.0% |
Other values (147) | 170 |
Most occurring characters
Value | Count | Frequency (%) |
정 | 25 | 3.2% |
신 | 22 | 2.8% |
동 | 22 | 2.8% |
장 | 19 | 2.4% |
교 | 19 | 2.4% |
아 | 18 | 2.3% |
파 | 17 | 2.2% |
원 | 17 | 2.2% |
마 | 16 | 2.0% |
시 | 16 | 2.0% |
Other values (189) | 594 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 756 | |
Uppercase Letter | 20 | 2.5% |
Decimal Number | 6 | 0.8% |
Other Punctuation | 2 | 0.3% |
Other Symbol | 1 | 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
정 | 25 | 3.3% |
신 | 22 | 2.9% |
동 | 22 | 2.9% |
장 | 19 | 2.5% |
교 | 19 | 2.5% |
아 | 18 | 2.4% |
파 | 17 | 2.2% |
원 | 17 | 2.2% |
마 | 16 | 2.1% |
시 | 16 | 2.1% |
Other values (178) | 565 |
Uppercase Letter
Value | Count | Frequency (%) |
T | 6 | |
A | 6 | |
P | 6 | |
C | 1 | 5.0% |
I | 1 | 5.0% |
Decimal Number
Value | Count | Frequency (%) |
1 | 3 | |
2 | 2 | |
3 | 1 | 16.7% |
Other Punctuation
Value | Count | Frequency (%) |
, | 1 | |
. | 1 |
Other Symbol
Value | Count | Frequency (%) |
㈜ | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 757 | |
Latin | 20 | 2.5% |
Common | 8 | 1.0% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
정 | 25 | 3.3% |
신 | 22 | 2.9% |
동 | 22 | 2.9% |
장 | 19 | 2.5% |
교 | 19 | 2.5% |
아 | 18 | 2.4% |
파 | 17 | 2.2% |
원 | 17 | 2.2% |
마 | 16 | 2.1% |
시 | 16 | 2.1% |
Other values (179) | 566 |
Latin
Value | Count | Frequency (%) |
T | 6 | |
A | 6 | |
P | 6 | |
C | 1 | 5.0% |
I | 1 | 5.0% |
Common
Value | Count | Frequency (%) |
1 | 3 | |
2 | 2 | |
, | 1 | 12.5% |
. | 1 | 12.5% |
3 | 1 | 12.5% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 756 | |
ASCII | 28 | 3.6% |
None | 1 | 0.1% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
정 | 25 | 3.3% |
신 | 22 | 2.9% |
동 | 22 | 2.9% |
장 | 19 | 2.5% |
교 | 19 | 2.5% |
아 | 18 | 2.4% |
파 | 17 | 2.2% |
원 | 17 | 2.2% |
마 | 16 | 2.1% |
시 | 16 | 2.1% |
Other values (178) | 565 |
ASCII
Value | Count | Frequency (%) |
T | 6 | |
A | 6 | |
P | 6 | |
1 | 3 | |
2 | 2 | 7.1% |
, | 1 | 3.6% |
. | 1 | 3.6% |
C | 1 | 3.6% |
I | 1 | 3.6% |
3 | 1 | 3.6% |
None
Value | Count | Frequency (%) |
㈜ | 1 |
데이터기준일자
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 0.5% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 1.8 KiB |
2022-09-23 |
---|
Length
Max length | 10 |
---|---|
Median length | 10 |
Mean length | 10 |
Min length | 10 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 2022-09-23 |
---|---|
2nd row | 2022-09-23 |
3rd row | 2022-09-23 |
4th row | 2022-09-23 |
5th row | 2022-09-23 |
Common Values
Value | Count | Frequency (%) |
2022-09-23 | 216 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
2022-09-23 | 216 |
관리번호 | 행정읍면동 | 도엽번호 | 도로구간번호 | 설치일자 | |
---|---|---|---|---|---|
관리번호 | 1.000 | 0.605 | 0.214 | 0.779 | 0.948 |
행정읍면동 | 0.605 | 1.000 | 0.360 | 0.870 | 0.447 |
도엽번호 | 0.214 | 0.360 | 1.000 | 0.079 | 1.000 |
도로구간번호 | 0.779 | 0.870 | 0.079 | 1.000 | 0.555 |
설치일자 | 0.948 | 0.447 | 1.000 | 0.555 | 1.000 |
관리번호 | 행정읍면동 | 도엽번호 | 도로구간번호 | 설치일자 | |
---|---|---|---|---|---|
관리번호 | 1.000 | 0.507 | 0.523 | 0.260 | 0.661 |
행정읍면동 | 0.507 | 1.000 | 0.581 | -0.418 | 0.247 |
도엽번호 | 0.523 | 0.581 | 1.000 | 0.003 | 0.972 |
도로구간번호 | 0.260 | -0.418 | 0.003 | 1.000 | 0.258 |
설치일자 | 0.661 | 0.247 | 0.972 | 0.258 | 1.000 |
관리번호 | 행정읍면동 | 도엽번호 | 도로구간번호 | 설치일자 | 정류장명 | 데이터기준일자 | |
---|---|---|---|---|---|---|---|
0 | 159 | 4518032000 | 3560821871 | 800193 | 1900-01-01 | 차단 | 2022-09-23 |
1 | 157 | 4518032000 | 3560821871 | 800191 | 1900-01-01 | 차단 | 2022-09-23 |
2 | 158 | 4518032000 | 3560821883 | 800199 | 1900-01-01 | 엄동 | 2022-09-23 |
3 | 156 | 4518032000 | 3560821883 | 800164 | 1900-01-01 | 엄동 | 2022-09-23 |
4 | 153 | 4518032000 | 3560821993 | 800141 | 1900-01-01 | 옹암 | 2022-09-23 |
5 | 160 | 4518032000 | 3560821993 | 800141 | 1900-01-01 | 옹암 | 2022-09-23 |
6 | 154 | 4518032000 | 3560821894 | 800132 | 1900-01-01 | 천원 | 2022-09-23 |
7 | 155 | 4518032000 | 3560821803 | 800122 | 1900-01-01 | 원천 | 2022-09-23 |
8 | 161 | 4518032000 | 3560821803 | 800122 | 1900-01-01 | 원천 | 2022-09-23 |
9 | 989301 | 4518032000 | 3561202113 | 989306 | 2020-10-01 | <NA> | 2022-09-23 |
관리번호 | 행정읍면동 | 도엽번호 | 도로구간번호 | 설치일자 | 정류장명 | 데이터기준일자 | |
---|---|---|---|---|---|---|---|
206 | 163 | 4518042000 | 3560815883 | 900030 | 1900-01-01 | 원촌정류장 | 2022-09-23 |
207 | 168 | 4518042000 | 3560815883 | 900030 | 1900-01-01 | 원촌승강장 | 2022-09-23 |
208 | 167 | 4518042000 | 3560815981 | 900033 | 1900-01-01 | 원촌승강장 | 2022-09-23 |
209 | 169 | 4518042000 | 3560815884 | 900071 | 1900-01-01 | 칠보초교 | 2022-09-23 |
210 | 170 | 4518042000 | 3560815884 | 900071 | 1900-01-01 | 칠보초교 | 2022-09-23 |
211 | 162 | 4518042000 | 3560815984 | 900042 | 1900-01-01 | <NA> | 2022-09-23 |
212 | 166 | 4518042000 | 3560820091 | 900042 | 1900-01-01 | 송산.남전 | 2022-09-23 |
213 | 171 | 4518042000 | 3560815991 | 900068 | 1900-01-01 | <NA> | 2022-09-23 |
214 | 165 | 4518042000 | 3560815994 | 900053 | 1900-01-01 | 시기정류장 | 2022-09-23 |
215 | 164 | 4518042000 | 3560815994 | 900050 | 1900-01-01 | 시기정류장 | 2022-09-23 |