Overview

Dataset statistics

Number of variables32
Number of observations10000
Missing cells56517
Missing cells (%)17.7%
Duplicate rows33
Duplicate rows (%)0.3%
Total size in memory2.7 MiB
Average record size in memory282.0 B

Variable types

Numeric11
Text4
Boolean4
Categorical10
Unsupported3

Dataset

Description경기도_BMS 정류소 이력정보
Author경기도
URLhttps://data.gg.go.kr/portal/data/service/selectServicePage.do?&infId=NP8AI9EKCQE2ECMG7FSG33258933&infSeq=1

Alerts

환승정류장유무 has constant value ""Constant
GIS유무 has constant value ""Constant
DRT유무 has constant value ""Constant
Dataset has 33 (0.3%) duplicate rowsDuplicates
중앙차로여부 is highly imbalanced (87.3%)Imbalance
정류장영문명 is highly imbalanced (99.3%)Imbalance
처리진행코드 is highly imbalanced (98.1%)Imbalance
사용구분 is highly imbalanced (85.6%)Imbalance
정류장유형명 is highly imbalanced (98.0%)Imbalance
환승역타입명 is highly imbalanced (63.7%)Imbalance
처리진행코드명 is highly imbalanced (98.1%)Imbalance
지역X좌표 has 2170 (21.7%) missing valuesMissing
지역Y좌표 has 2170 (21.7%) missing valuesMissing
링크아이디 has 353 (3.5%) missing valuesMissing
환승정류장유무 has 9309 (93.1%) missing valuesMissing
ARS아이디 has 1133 (11.3%) missing valuesMissing
기관코드 has 1514 (15.1%) missing valuesMissing
GIS유무 has 1512 (15.1%) missing valuesMissing
비고 has 6804 (68.0%) missing valuesMissing
행정동코드 has 1552 (15.5%) missing valuesMissing
정류장중국어명 has 10000 (100.0%) missing valuesMissing
정류장일본어명 has 10000 (100.0%) missing valuesMissing
정류장베트남명 has 10000 (100.0%) missing valuesMissing
정류장중국어명 is an unsupported type, check if it needs cleaning or further analysisUnsupported
정류장일본어명 is an unsupported type, check if it needs cleaning or further analysisUnsupported
정류장베트남명 is an unsupported type, check if it needs cleaning or further analysisUnsupported
지역X좌표 has 5310 (53.1%) zerosZeros
지역Y좌표 has 5310 (53.1%) zerosZeros
링크아이디 has 1166 (11.7%) zerosZeros
정류장유형 has 9953 (99.5%) zerosZeros

Reproduction

Analysis started2023-12-10 21:11:06.567975
Analysis finished2023-12-10 21:11:07.745444
Duration1.18 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

이력아이디
Real number (ℝ)

Distinct3198
Distinct (%)32.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.0000226 × 109
Minimum1 × 109
Maximum1.0003635 × 109
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-11T06:11:07.839345image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1 × 109
5-th percentile1 × 109
Q11 × 109
median1 × 109
Q31.0000456 × 109
95-th percentile1.0000735 × 109
Maximum1.0003635 × 109
Range363465
Interquartile range (IQR)45572.5

Descriptive statistics

Standard deviation33133.548
Coefficient of variation (CV)3.3132799 × 10-5
Kurtosis28.104936
Mean1.0000226 × 109
Median Absolute Deviation (MAD)0
Skewness3.4090321
Sum1.0000226 × 1013
Variance1.097832 × 109
MonotonicityNot monotonic
2023-12-11T06:11:07.973089image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1000000000 5132
51.3%
1000008720 9
 
0.1%
1000004570 7
 
0.1%
1000001410 6
 
0.1%
1000004540 6
 
0.1%
1000007240 6
 
0.1%
1000036780 6
 
0.1%
1000061510 6
 
0.1%
1000012840 6
 
0.1%
1000002970 6
 
0.1%
Other values (3188) 4810
48.1%
ValueCountFrequency (%)
1000000000 5132
51.3%
1000000841 1
 
< 0.1%
1000000940 2
 
< 0.1%
1000000950 6
 
0.1%
1000000960 4
 
< 0.1%
1000001150 2
 
< 0.1%
1000001340 1
 
< 0.1%
1000001350 2
 
< 0.1%
1000001360 2
 
< 0.1%
1000001390 1
 
< 0.1%
ValueCountFrequency (%)
1000363465 1
< 0.1%
1000363372 1
< 0.1%
1000363367 1
< 0.1%
1000363346 1
< 0.1%
1000363316 1
< 0.1%
1000363304 1
< 0.1%
1000363299 1
< 0.1%
1000363275 1
< 0.1%
1000363269 1
< 0.1%
1000363101 1
< 0.1%

정류장아이디
Real number (ℝ)

Distinct8458
Distinct (%)84.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.0672259 × 108
Minimum1 × 108
Maximum5.0000024 × 108
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-11T06:11:08.304348image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1 × 108
5-th percentile1.1200035 × 108
Q12.0600021 × 108
median2.1700067 × 108
Q32.2800106 × 108
95-th percentile2.390006 × 108
Maximum5.0000024 × 108
Range4.0000023 × 108
Interquartile range (IQR)22000856

Descriptive statistics

Standard deviation41558058
Coefficient of variation (CV)0.20103298
Kurtosis9.6674749
Mean2.0672259 × 108
Median Absolute Deviation (MAD)11000440
Skewness0.018533973
Sum2.0672259 × 1012
Variance1.7270722 × 1015
MonotonicityNot monotonic
2023-12-11T06:11:08.442466image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
202000027 5
 
0.1%
228001802 5
 
0.1%
214000597 5
 
0.1%
206000016 5
 
0.1%
203000162 5
 
0.1%
238000214 5
 
0.1%
233001216 5
 
0.1%
214000136 5
 
0.1%
221000063 5
 
0.1%
201000178 5
 
0.1%
Other values (8448) 9950
99.5%
ValueCountFrequency (%)
100000005 1
< 0.1%
100000018 1
< 0.1%
100000026 1
< 0.1%
100000028 1
< 0.1%
100000030 1
< 0.1%
100000035 1
< 0.1%
100000039 1
< 0.1%
100000043 1
< 0.1%
100000055 1
< 0.1%
100000057 1
< 0.1%
ValueCountFrequency (%)
500000239 1
< 0.1%
500000237 1
< 0.1%
500000230 1
< 0.1%
500000229 1
< 0.1%
500000228 1
< 0.1%
500000226 1
< 0.1%
500000213 1
< 0.1%
500000212 1
< 0.1%
500000199 1
< 0.1%
500000196 1
< 0.1%
Distinct7542
Distinct (%)75.4%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-11T06:11:08.701190image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length30
Median length26
Mean length6.5241
Min length1

Characters and Unicode

Total characters65241
Distinct characters721
Distinct categories10 ?
Distinct scripts4 ?
Distinct blocks6 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique5816 ?
Unique (%)58.2%

Sample

1st row화정7교
2nd row안터입구
3rd row이마트
4th row새마을연수원
5th row나자로마을.신안아파트
ValueCountFrequency (%)
부대앞 20
 
0.2%
현대아파트 17
 
0.2%
이마트 12
 
0.1%
주공2단지 11
 
0.1%
명칭누락 11
 
0.1%
새마을금고 8
 
0.1%
대림아파트 7
 
0.1%
입구 7
 
0.1%
대우아파트 7
 
0.1%
우성아파트 7
 
0.1%
Other values (7549) 9953
98.9%
2023-12-11T06:11:09.112835image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2037
 
3.1%
. 1991
 
3.1%
1525
 
2.3%
1505
 
2.3%
1489
 
2.3%
1381
 
2.1%
1362
 
2.1%
1233
 
1.9%
1101
 
1.7%
1086
 
1.7%
Other values (711) 50531
77.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 59945
91.9%
Other Punctuation 2090
 
3.2%
Decimal Number 1982
 
3.0%
Uppercase Letter 586
 
0.9%
Close Punctuation 251
 
0.4%
Open Punctuation 248
 
0.4%
Lowercase Letter 66
 
0.1%
Space Separator 60
 
0.1%
Dash Punctuation 12
 
< 0.1%
Other Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2037
 
3.4%
1525
 
2.5%
1505
 
2.5%
1489
 
2.5%
1381
 
2.3%
1362
 
2.3%
1233
 
2.1%
1101
 
1.8%
1086
 
1.8%
1003
 
1.7%
Other values (659) 46223
77.1%
Uppercase Letter
ValueCountFrequency (%)
A 221
37.7%
K 64
 
10.9%
G 58
 
9.9%
S 54
 
9.2%
L 41
 
7.0%
C 36
 
6.1%
T 33
 
5.6%
B 17
 
2.9%
I 12
 
2.0%
P 12
 
2.0%
Other values (10) 38
 
6.5%
Lowercase Letter
ValueCountFrequency (%)
c 13
19.7%
e 12
18.2%
g 8
12.1%
i 8
12.1%
t 6
9.1%
k 5
 
7.6%
s 4
 
6.1%
b 2
 
3.0%
l 2
 
3.0%
y 2
 
3.0%
Other values (3) 4
 
6.1%
Decimal Number
ValueCountFrequency (%)
1 561
28.3%
2 531
26.8%
3 297
15.0%
4 149
 
7.5%
5 147
 
7.4%
6 78
 
3.9%
7 57
 
2.9%
8 56
 
2.8%
9 53
 
2.7%
0 53
 
2.7%
Other Punctuation
ValueCountFrequency (%)
. 1991
95.3%
, 86
 
4.1%
· 12
 
0.6%
? 1
 
< 0.1%
Close Punctuation
ValueCountFrequency (%)
) 251
100.0%
Open Punctuation
ValueCountFrequency (%)
( 248
100.0%
Space Separator
ValueCountFrequency (%)
60
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 12
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 59939
91.9%
Common 4644
 
7.1%
Latin 652
 
1.0%
Han 6
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2037
 
3.4%
1525
 
2.5%
1505
 
2.5%
1489
 
2.5%
1381
 
2.3%
1362
 
2.3%
1233
 
2.1%
1101
 
1.8%
1086
 
1.8%
1003
 
1.7%
Other values (653) 46217
77.1%
Latin
ValueCountFrequency (%)
A 221
33.9%
K 64
 
9.8%
G 58
 
8.9%
S 54
 
8.3%
L 41
 
6.3%
C 36
 
5.5%
T 33
 
5.1%
B 17
 
2.6%
c 13
 
2.0%
I 12
 
1.8%
Other values (23) 103
15.8%
Common
ValueCountFrequency (%)
. 1991
42.9%
1 561
 
12.1%
2 531
 
11.4%
3 297
 
6.4%
) 251
 
5.4%
( 248
 
5.3%
4 149
 
3.2%
5 147
 
3.2%
, 86
 
1.9%
6 78
 
1.7%
Other values (9) 305
 
6.6%
Han
ValueCountFrequency (%)
1
16.7%
1
16.7%
1
16.7%
1
16.7%
1
16.7%
1
16.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 59939
91.9%
ASCII 5283
 
8.1%
None 12
 
< 0.1%
CJK 5
 
< 0.1%
Box Drawing 1
 
< 0.1%
CJK Compat Ideographs 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
2037
 
3.4%
1525
 
2.5%
1505
 
2.5%
1489
 
2.5%
1381
 
2.3%
1362
 
2.3%
1233
 
2.1%
1101
 
1.8%
1086
 
1.8%
1003
 
1.7%
Other values (653) 46217
77.1%
ASCII
ValueCountFrequency (%)
. 1991
37.7%
1 561
 
10.6%
2 531
 
10.1%
3 297
 
5.6%
) 251
 
4.8%
( 248
 
4.7%
A 221
 
4.2%
4 149
 
2.8%
5 147
 
2.8%
, 86
 
1.6%
Other values (40) 801
15.2%
None
ValueCountFrequency (%)
· 12
100.0%
Box Drawing
ValueCountFrequency (%)
1
100.0%
CJK
ValueCountFrequency (%)
1
20.0%
1
20.0%
1
20.0%
1
20.0%
1
20.0%
CJK Compat Ideographs
ValueCountFrequency (%)
1
100.0%

Y좌표
Real number (ℝ)

Distinct8803
Distinct (%)88.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean202980.62
Minimum144710.5
Maximum268433
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-11T06:11:09.257396image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum144710.5
5-th percentile174960
Q1187056.25
median203562
Q3211769.25
95-th percentile245570.61
Maximum268433
Range123722.5
Interquartile range (IQR)24713

Descriptive statistics

Standard deviation20438.126
Coefficient of variation (CV)0.10069004
Kurtosis0.57619598
Mean202980.62
Median Absolute Deviation (MAD)12341.198
Skewness0.70388152
Sum2.0298062 × 109
Variance4.17717 × 108
MonotonicityNot monotonic
2023-12-11T06:11:09.412760image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
206008.0 7
 
0.1%
202879.0 5
 
0.1%
192715.0 5
 
0.1%
207305.0 5
 
0.1%
205695.0 5
 
0.1%
206773.0 5
 
0.1%
212288.0 5
 
0.1%
205985.0 5
 
0.1%
200609.0 4
 
< 0.1%
186857.0 4
 
< 0.1%
Other values (8793) 9950
99.5%
ValueCountFrequency (%)
144710.49917 1
< 0.1%
146353.48366 1
< 0.1%
146402.64 1
< 0.1%
148165.6457 1
< 0.1%
148674.83759 1
< 0.1%
148815.0 1
< 0.1%
148968.6807 1
< 0.1%
149060.02313 1
< 0.1%
151440.8543 1
< 0.1%
151703.7255 1
< 0.1%
ValueCountFrequency (%)
268433.0 1
< 0.1%
268390.6104 1
< 0.1%
268368.9889 1
< 0.1%
268250.0 1
< 0.1%
268102.8746 1
< 0.1%
268058.8304 1
< 0.1%
267970.106 1
< 0.1%
267970.0 1
< 0.1%
267966.0 1
< 0.1%
267937.5 1
< 0.1%

X좌표
Real number (ℝ)

Distinct8855
Distinct (%)88.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean439899.01
Minimum379053.93
Maximum528731
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-11T06:11:09.564124image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum379053.93
5-th percentile391394.93
Q1421630.75
median440929.84
Q3458672.67
95-th percentile484468.49
Maximum528731
Range149677.07
Interquartile range (IQR)37041.923

Descriptive statistics

Standard deviation26416.139
Coefficient of variation (CV)0.060050463
Kurtosis-0.11268096
Mean439899.01
Median Absolute Deviation (MAD)18732.844
Skewness0.11494901
Sum4.3989901 × 109
Variance6.9781241 × 108
MonotonicityNot monotonic
2023-12-11T06:11:09.719143image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
432161.0 5
 
0.1%
508881.0 5
 
0.1%
417680.0 5
 
0.1%
464336.0 5
 
0.1%
423526.0 5
 
0.1%
448966.0 5
 
0.1%
455389.0 5
 
0.1%
384889.0 5
 
0.1%
388198.0 5
 
0.1%
444572.0 5
 
0.1%
Other values (8845) 9950
99.5%
ValueCountFrequency (%)
379053.93354 1
< 0.1%
379092.0 1
< 0.1%
379271.6902 1
< 0.1%
379679.07505 1
< 0.1%
379692.0 1
< 0.1%
379819.0 1
< 0.1%
379830.0 1
< 0.1%
379867.0 1
< 0.1%
380402.0 1
< 0.1%
380632.0 1
< 0.1%
ValueCountFrequency (%)
528731.0 1
< 0.1%
526299.0 1
< 0.1%
526181.0 1
< 0.1%
525687.0 1
< 0.1%
524391.0 1
< 0.1%
524390.0 1
< 0.1%
524352.0 1
< 0.1%
523346.0 1
< 0.1%
523339.0 1
< 0.1%
523153.0 1
< 0.1%

지역X좌표
Real number (ℝ)

MISSING  ZEROS 

Distinct2446
Distinct (%)31.2%
Missing2170
Missing (%)21.7%
Infinite0
Infinite (%)0.0%
Mean4076.909
Minimum0
Maximum12746.336
Zeros5310
Zeros (%)53.1%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-11T06:11:09.872557image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q312650.644
95-th percentile12719.448
Maximum12746.336
Range12746.336
Interquartile range (IQR)12650.644

Descriptive statistics

Standard deviation5925.2431
Coefficient of variation (CV)1.4533665
Kurtosis-1.4144963
Mean4076.909
Median Absolute Deviation (MAD)0
Skewness0.76538862
Sum31922198
Variance35108506
MonotonicityNot monotonic
2023-12-11T06:11:10.012422image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0.0 5310
53.1%
12703.263 3
 
< 0.1%
12704.536 2
 
< 0.1%
12705.793 2
 
< 0.1%
12658.769 2
 
< 0.1%
12702.844 2
 
< 0.1%
12648.268 2
 
< 0.1%
12657.054 2
 
< 0.1%
12648.774 2
 
< 0.1%
12708.386 2
 
< 0.1%
Other values (2436) 2501
25.0%
(Missing) 2170
21.7%
ValueCountFrequency (%)
0.0 5310
53.1%
127.04088 1
 
< 0.1%
127.04098 1
 
< 0.1%
127.10357 1
 
< 0.1%
127.10395 1
 
< 0.1%
12634.161 1
 
< 0.1%
12634.29 1
 
< 0.1%
12634.618 1
 
< 0.1%
12634.662 1
 
< 0.1%
12634.888 1
 
< 0.1%
ValueCountFrequency (%)
12746.336 1
< 0.1%
12746.191 1
< 0.1%
12746.189 1
< 0.1%
12746.107 1
< 0.1%
12745.964 1
< 0.1%
12745.894 1
< 0.1%
12745.759 1
< 0.1%
12745.758 1
< 0.1%
12745.334 1
< 0.1%
12744.809 1
< 0.1%

지역Y좌표
Real number (ℝ)

MISSING  ZEROS 

Distinct2465
Distinct (%)31.5%
Missing2170
Missing (%)21.7%
Infinite0
Infinite (%)0.0%
Mean1196.9462
Minimum0
Maximum3815.686
Zeros5310
Zeros (%)53.1%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-11T06:11:10.166255image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q33714.4223
95-th percentile3741.905
Maximum3815.686
Range3815.686
Interquartile range (IQR)3714.4223

Descriptive statistics

Standard deviation1739.648
Coefficient of variation (CV)1.4534053
Kurtosis-1.4139822
Mean1196.9462
Median Absolute Deviation (MAD)0
Skewness0.76556212
Sum9372089.1
Variance3026375.1
MonotonicityNot monotonic
2023-12-11T06:11:10.382943image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0.0 5310
53.1%
3718.816 3
 
< 0.1%
3715.267 3
 
< 0.1%
3718.817 2
 
< 0.1%
3722.862 2
 
< 0.1%
3726.031 2
 
< 0.1%
3739.271 2
 
< 0.1%
3720.275 2
 
< 0.1%
3742.285 2
 
< 0.1%
3719.87 2
 
< 0.1%
Other values (2455) 2500
25.0%
(Missing) 2170
21.7%
ValueCountFrequency (%)
0.0 5310
53.1%
37.766 1
 
< 0.1%
37.76607 1
 
< 0.1%
37.82218 1
 
< 0.1%
37.82278 1
 
< 0.1%
3654.785 1
 
< 0.1%
3654.91 1
 
< 0.1%
3655.132 1
 
< 0.1%
3655.222 1
 
< 0.1%
3655.507 1
 
< 0.1%
ValueCountFrequency (%)
3815.686 1
< 0.1%
3814.374 1
< 0.1%
3814.306 1
< 0.1%
3813.343 2
< 0.1%
3813.316 1
< 0.1%
3812.782 1
< 0.1%
3812.665 1
< 0.1%
3812.464 1
< 0.1%
3812.047 1
< 0.1%
3811.408 1
< 0.1%

링크아이디
Real number (ℝ)

MISSING  ZEROS 

Distinct6223
Distinct (%)64.5%
Missing353
Missing (%)3.5%
Infinite0
Infinite (%)0.0%
Mean1.9285044 × 109
Minimum0
Maximum2.8800001 × 109
Zeros1166
Zeros (%)11.7%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-11T06:11:10.515291image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q12.0601422 × 109
median2.1800464 × 109
Q32.2801875 × 109
95-th percentile2.3900506 × 109
Maximum2.8800001 × 109
Range2.8800001 × 109
Interquartile range (IQR)2.200453 × 108

Descriptive statistics

Standard deviation7.3581879 × 108
Coefficient of variation (CV)0.38154893
Kurtosis2.77027
Mean1.9285044 × 109
Median Absolute Deviation (MAD)1.099848 × 108
Skewness-2.1183698
Sum1.8604282 × 1013
Variance5.4142929 × 1017
MonotonicityNot monotonic
2023-12-11T06:11:10.663248image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 1166
 
11.7%
2060121900 9
 
0.1%
2390020200 8
 
0.1%
2180021600 8
 
0.1%
2030056200 7
 
0.1%
2210043900 7
 
0.1%
2390003000 7
 
0.1%
2390003300 7
 
0.1%
2280219800 6
 
0.1%
2050008500 6
 
0.1%
Other values (6213) 8416
84.2%
(Missing) 353
 
3.5%
ValueCountFrequency (%)
0 1166
11.7%
1000001700 1
 
< 0.1%
1000007000 1
 
< 0.1%
1010009800 1
 
< 0.1%
1020009800 1
 
< 0.1%
1020009900 2
 
< 0.1%
1020029900 1
 
< 0.1%
1030016300 1
 
< 0.1%
1040005700 1
 
< 0.1%
1040006100 1
 
< 0.1%
ValueCountFrequency (%)
2880000100 3
< 0.1%
2850004000 1
 
< 0.1%
2850002500 1
 
< 0.1%
2850002400 1
 
< 0.1%
2850002100 1
 
< 0.1%
2850002000 1
 
< 0.1%
2850001000 1
 
< 0.1%
2850000200 1
 
< 0.1%
2850000100 1
 
< 0.1%
2810001000 3
< 0.1%

정류장유형
Real number (ℝ)

ZEROS 

Distinct6
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.962
Minimum0
Maximum1100
Zeros9953
Zeros (%)99.5%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-11T06:11:10.808899image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile0
Maximum1100
Range1100
Interquartile range (IQR)0

Descriptive statistics

Standard deviation53.474597
Coefficient of variation (CV)18.053544
Kurtosis363.46627
Mean2.962
Median Absolute Deviation (MAD)0
Skewness19.044051
Sum29620
Variance2859.5325
MonotonicityNot monotonic
2023-12-11T06:11:10.918540image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
0 9953
99.5%
1000 19
 
0.2%
100 19
 
0.2%
1100 7
 
0.1%
1010 1
 
< 0.1%
10 1
 
< 0.1%
ValueCountFrequency (%)
0 9953
99.5%
10 1
 
< 0.1%
100 19
 
0.2%
1000 19
 
0.2%
1010 1
 
< 0.1%
1100 7
 
0.1%
ValueCountFrequency (%)
1100 7
 
0.1%
1010 1
 
< 0.1%
1000 19
 
0.2%
100 19
 
0.2%
10 1
 
< 0.1%
0 9953
99.5%

환승정류장유무
Boolean

CONSTANT  MISSING 

Distinct1
Distinct (%)0.1%
Missing9309
Missing (%)93.1%
Memory size97.7 KiB
False
 
691
(Missing)
9309 
ValueCountFrequency (%)
False 691
 
6.9%
(Missing) 9309
93.1%
2023-12-11T06:11:11.010579image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

중앙차로여부
Boolean

IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size87.9 KiB
False
9826 
True
 
174
ValueCountFrequency (%)
False 9826
98.3%
True 174
 
1.7%
2023-12-11T06:11:11.087534image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

정류장영문명
Categorical

IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
<NA>
9994 
 
6

Length

Max length4
Median length4
Mean length3.9982
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 9994
99.9%
6
 
0.1%

Length

2023-12-11T06:11:11.191728image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T06:11:11.285128image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 9994
100.0%

ARS아이디
Text

MISSING 

Distinct7359
Distinct (%)83.0%
Missing1133
Missing (%)11.3%
Memory size156.2 KiB
2023-12-11T06:11:11.639112image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length7
Median length5
Mean length5.0009022
Min length5

Characters and Unicode

Total characters44343
Distinct characters19
Distinct categories4 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique6122 ?
Unique (%)69.0%

Sample

1st row18648
2nd row47712
3rd row07140
4th row07458
5th row27139
ValueCountFrequency (%)
22094 5
 
0.1%
03093 5
 
0.1%
43279 5
 
0.1%
18496 5
 
0.1%
04235 5
 
0.1%
36841 5
 
0.1%
15138 5
 
0.1%
07014 5
 
0.1%
02009 5
 
0.1%
15678 5
 
0.1%
Other values (7349) 8817
99.4%
2023-12-11T06:11:12.174770image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 6089
13.7%
0 5850
13.2%
2 5730
12.9%
4 5267
11.9%
3 4513
10.2%
5 3547
8.0%
7 3438
7.8%
9 3421
7.7%
8 3382
7.6%
6 3093
7.0%
Other values (9) 13
 
< 0.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 44330
> 99.9%
Other Letter 6
 
< 0.1%
Uppercase Letter 5
 
< 0.1%
Other Punctuation 2
 
< 0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 6089
13.7%
0 5850
13.2%
2 5730
12.9%
4 5267
11.9%
3 4513
10.2%
5 3547
8.0%
7 3438
7.8%
9 3421
7.7%
8 3382
7.6%
6 3093
7.0%
Uppercase Letter
ValueCountFrequency (%)
V 1
20.0%
A 1
20.0%
L 1
20.0%
U 1
20.0%
E 1
20.0%
Other Letter
ValueCountFrequency (%)
3
50.0%
3
50.0%
Other Punctuation
ValueCountFrequency (%)
# 1
50.0%
! 1
50.0%

Most occurring scripts

ValueCountFrequency (%)
Common 44332
> 99.9%
Hangul 6
 
< 0.1%
Latin 5
 
< 0.1%

Most frequent character per script

Common
ValueCountFrequency (%)
1 6089
13.7%
0 5850
13.2%
2 5730
12.9%
4 5267
11.9%
3 4513
10.2%
5 3547
8.0%
7 3438
7.8%
9 3421
7.7%
8 3382
7.6%
6 3093
7.0%
Other values (2) 2
 
< 0.1%
Latin
ValueCountFrequency (%)
V 1
20.0%
A 1
20.0%
L 1
20.0%
U 1
20.0%
E 1
20.0%
Hangul
ValueCountFrequency (%)
3
50.0%
3
50.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 44337
> 99.9%
Hangul 6
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 6089
13.7%
0 5850
13.2%
2 5730
12.9%
4 5267
11.9%
3 4513
10.2%
5 3547
8.0%
7 3438
7.8%
9 3421
7.7%
8 3382
7.6%
6 3093
7.0%
Other values (7) 7
 
< 0.1%
Hangul
ValueCountFrequency (%)
3
50.0%
3
50.0%

기관코드
Real number (ℝ)

MISSING 

Distinct88
Distinct (%)1.0%
Missing1514
Missing (%)15.1%
Infinite0
Infinite (%)0.0%
Mean4.089943 × 109
Minimum1 × 109
Maximum5 × 109
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-11T06:11:12.322975image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1 × 109
5-th percentile4.1113 × 109
Q14.121 × 109
median4.131 × 109
Q34.148 × 109
95-th percentile4.182 × 109
Maximum5 × 109
Range4 × 109
Interquartile range (IQR)27000000

Descriptive statistics

Standard deviation3.8057039 × 108
Coefficient of variation (CV)0.093050292
Kurtosis53.362483
Mean4.089943 × 109
Median Absolute Deviation (MAD)15100000
Skewness-7.2582725
Sum3.4707256 × 1013
Variance1.4483382 × 1017
MonotonicityNot monotonic
2023-12-11T06:11:12.467023image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
4136000000 643
 
6.4%
4122000000 604
 
6.0%
4159000000 520
 
5.2%
4183000000 376
 
3.8%
4127300000 350
 
3.5%
4113500000 337
 
3.4%
4182000000 330
 
3.3%
4128100000 313
 
3.1%
4146100000 294
 
2.9%
4115000000 274
 
2.7%
Other values (78) 4445
44.5%
(Missing) 1514
 
15.1%
ValueCountFrequency (%)
1000000000 15
0.1%
1100000000 7
0.1%
1114000000 2
 
< 0.1%
1117000000 2
 
< 0.1%
1123000000 5
 
0.1%
1126000000 5
 
0.1%
1132000000 1
 
< 0.1%
1135000000 2
 
< 0.1%
1138000000 5
 
0.1%
1141000000 2
 
< 0.1%
ValueCountFrequency (%)
5000000000 27
 
0.3%
4183000000 376
3.8%
4182025000 2
 
< 0.1%
4182000000 330
3.3%
4180000000 201
2.0%
4167034023 2
 
< 0.1%
4167000000 227
2.3%
4165000000 15
 
0.1%
4163010200 2
 
< 0.1%
4163000000 196
2.0%

GIS유무
Boolean

CONSTANT  MISSING 

Distinct1
Distinct (%)< 0.1%
Missing1512
Missing (%)15.1%
Memory size97.7 KiB
True
8488 
(Missing)
1512 
ValueCountFrequency (%)
True 8488
84.9%
(Missing) 1512
 
15.1%
2023-12-11T06:11:12.600770image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

비고
Text

MISSING 

Distinct2283
Distinct (%)71.4%
Missing6804
Missing (%)68.0%
Memory size156.2 KiB
2023-12-11T06:11:12.763659image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length28
Median length25
Mean length7.0860451
Min length1

Characters and Unicode

Total characters22647
Distinct characters533
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2047 ?
Unique (%)64.0%

Sample

1st row이마트
2nd row나자로마을.신안A
3rd row잉크테크
4th row우성고.개나리A
5th row20160628:8로수정
ValueCountFrequency (%)
20160628:8로수정 503
 
15.7%
삭제예정8이하 131
 
4.1%
20160628:원복 22
 
0.7%
명칭누락 6
 
0.2%
이마트 5
 
0.2%
구종점 4
 
0.1%
안말 4
 
0.1%
부대앞 4
 
0.1%
시외버스터미널 4
 
0.1%
마을회관 4
 
0.1%
Other values (2272) 2510
78.5%
2023-12-11T06:11:13.127234image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2 1191
 
5.3%
8 1171
 
5.2%
6 1070
 
4.7%
0 1065
 
4.7%
774
 
3.4%
683
 
3.0%
1 665
 
2.9%
625
 
2.8%
. 538
 
2.4%
: 526
 
2.3%
Other values (523) 14339
63.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 15700
69.3%
Decimal Number 5324
 
23.5%
Other Punctuation 1064
 
4.7%
Uppercase Letter 434
 
1.9%
Close Punctuation 56
 
0.2%
Open Punctuation 55
 
0.2%
Lowercase Letter 9
 
< 0.1%
Dash Punctuation 3
 
< 0.1%
Math Symbol 1
 
< 0.1%
Space Separator 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
774
 
4.9%
683
 
4.4%
625
 
4.0%
525
 
3.3%
363
 
2.3%
326
 
2.1%
271
 
1.7%
271
 
1.7%
268
 
1.7%
256
 
1.6%
Other values (486) 11338
72.2%
Uppercase Letter
ValueCountFrequency (%)
A 354
81.6%
G 20
 
4.6%
L 13
 
3.0%
S 10
 
2.3%
C 9
 
2.1%
T 6
 
1.4%
K 6
 
1.4%
P 5
 
1.2%
B 3
 
0.7%
V 2
 
0.5%
Other values (6) 6
 
1.4%
Decimal Number
ValueCountFrequency (%)
2 1191
22.4%
8 1171
22.0%
6 1070
20.1%
0 1065
20.0%
1 665
12.5%
3 69
 
1.3%
5 39
 
0.7%
4 36
 
0.7%
7 10
 
0.2%
9 8
 
0.2%
Lowercase Letter
ValueCountFrequency (%)
c 4
44.4%
i 2
22.2%
s 2
22.2%
b 1
 
11.1%
Other Punctuation
ValueCountFrequency (%)
. 538
50.6%
: 526
49.4%
Close Punctuation
ValueCountFrequency (%)
) 56
100.0%
Open Punctuation
ValueCountFrequency (%)
( 55
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 3
100.0%
Math Symbol
ValueCountFrequency (%)
> 1
100.0%
Space Separator
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 15700
69.3%
Common 6504
28.7%
Latin 443
 
2.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
774
 
4.9%
683
 
4.4%
625
 
4.0%
525
 
3.3%
363
 
2.3%
326
 
2.1%
271
 
1.7%
271
 
1.7%
268
 
1.7%
256
 
1.6%
Other values (486) 11338
72.2%
Latin
ValueCountFrequency (%)
A 354
79.9%
G 20
 
4.5%
L 13
 
2.9%
S 10
 
2.3%
C 9
 
2.0%
T 6
 
1.4%
K 6
 
1.4%
P 5
 
1.1%
c 4
 
0.9%
B 3
 
0.7%
Other values (10) 13
 
2.9%
Common
ValueCountFrequency (%)
2 1191
18.3%
8 1171
18.0%
6 1070
16.5%
0 1065
16.4%
1 665
10.2%
. 538
8.3%
: 526
8.1%
3 69
 
1.1%
) 56
 
0.9%
( 55
 
0.8%
Other values (7) 98
 
1.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 15700
69.3%
ASCII 6947
30.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2 1191
17.1%
8 1171
16.9%
6 1070
15.4%
0 1065
15.3%
1 665
9.6%
. 538
7.7%
: 526
7.6%
A 354
 
5.1%
3 69
 
1.0%
) 56
 
0.8%
Other values (27) 242
 
3.5%
Hangul
ValueCountFrequency (%)
774
 
4.9%
683
 
4.4%
625
 
4.0%
525
 
3.3%
363
 
2.3%
326
 
2.1%
271
 
1.7%
271
 
1.7%
268
 
1.7%
256
 
1.6%
Other values (486) 11338
72.2%

업무코드
Categorical

Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
90
5018 
0
3767 
80
1074 
100
 
141

Length

Max length3
Median length2
Mean length1.6374
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row90
2nd row90
3rd row0
4th row90
5th row0

Common Values

ValueCountFrequency (%)
90 5018
50.2%
0 3767
37.7%
80 1074
 
10.7%
100 141
 
1.4%

Length

2023-12-11T06:11:13.310852image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T06:11:13.446627image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
90 5018
50.2%
0 3767
37.7%
80 1074
 
10.7%
100 141
 
1.4%
Distinct96
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-11T06:11:13.675236image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length11
Median length10
Mean length8.739
Min length5

Characters and Unicode

Total characters87390
Distinct characters48
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique19 ?
Unique (%)0.2%

Sample

1st rowiansan77
2nd rowprkseo03
3rd rowBASIC_INFO
4th rowb1268019
5th rowBASIC_INFO
ValueCountFrequency (%)
basic_info 2826
28.3%
auto_update 1325
13.2%
00000000 1117
 
11.2%
ckdejr12 1074
 
10.7%
smchoi 996
 
10.0%
kyung486 577
 
5.8%
prkseo03 400
 
4.0%
iansan77 232
 
2.3%
ad205712 110
 
1.1%
dasom296 95
 
0.9%
Other values (86) 1248
12.5%
2023-12-11T06:11:14.322695image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 10222
 
11.7%
I 5652
 
6.5%
A 5586
 
6.4%
O 4151
 
4.7%
_ 4151
 
4.7%
F 2826
 
3.2%
B 2826
 
3.2%
N 2826
 
3.2%
S 2826
 
3.2%
C 2826
 
3.2%
Other values (38) 43498
49.8%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter 38904
44.5%
Lowercase Letter 24579
28.1%
Decimal Number 19756
22.6%
Connector Punctuation 4151
 
4.7%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
k 2342
 
9.5%
c 2180
 
8.9%
s 1998
 
8.1%
e 1786
 
7.3%
o 1777
 
7.2%
r 1681
 
6.8%
n 1522
 
6.2%
j 1437
 
5.8%
i 1413
 
5.7%
d 1231
 
5.0%
Other values (14) 7212
29.3%
Uppercase Letter
ValueCountFrequency (%)
I 5652
14.5%
A 5586
14.4%
O 4151
10.7%
F 2826
7.3%
B 2826
7.3%
N 2826
7.3%
S 2826
7.3%
C 2826
7.3%
U 2650
6.8%
T 2650
6.8%
Other values (3) 4085
10.5%
Decimal Number
ValueCountFrequency (%)
0 10222
51.7%
2 2166
 
11.0%
1 2117
 
10.7%
7 1202
 
6.1%
4 873
 
4.4%
6 823
 
4.2%
8 820
 
4.2%
3 577
 
2.9%
9 549
 
2.8%
5 407
 
2.1%
Connector Punctuation
ValueCountFrequency (%)
_ 4151
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 63483
72.6%
Common 23907
 
27.4%

Most frequent character per script

Latin
ValueCountFrequency (%)
I 5652
 
8.9%
A 5586
 
8.8%
O 4151
 
6.5%
F 2826
 
4.5%
B 2826
 
4.5%
N 2826
 
4.5%
S 2826
 
4.5%
C 2826
 
4.5%
U 2650
 
4.2%
T 2650
 
4.2%
Other values (27) 28664
45.2%
Common
ValueCountFrequency (%)
0 10222
42.8%
_ 4151
17.4%
2 2166
 
9.1%
1 2117
 
8.9%
7 1202
 
5.0%
4 873
 
3.7%
6 823
 
3.4%
8 820
 
3.4%
3 577
 
2.4%
9 549
 
2.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 87390
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 10222
 
11.7%
I 5652
 
6.5%
A 5586
 
6.4%
O 4151
 
4.7%
_ 4151
 
4.7%
F 2826
 
3.2%
B 2826
 
3.2%
N 2826
 
3.2%
S 2826
 
3.2%
C 2826
 
3.2%
Other values (38) 43498
49.8%

등록일자
Real number (ℝ)

Distinct1537
Distinct (%)15.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.0104881 × 1013
Minimum2.0070902 × 1013
Maximum2.0170323 × 1013
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-11T06:11:14.502250image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2.0070902 × 1013
5-th percentile2.0070902 × 1013
Q12.0070902 × 1013
median2.0100317 × 1013
Q32.0131022 × 1013
95-th percentile2.0160614 × 1013
Maximum2.0170323 × 1013
Range9.942113 × 1010
Interquartile range (IQR)6.0120088 × 1010

Descriptive statistics

Standard deviation3.1220749 × 1010
Coefficient of variation (CV)0.001552894
Kurtosis-1.0623374
Mean2.0104881 × 1013
Median Absolute Deviation (MAD)2.9415 × 1010
Skewness0.51122572
Sum2.0104881 × 1017
Variance9.7473515 × 1020
MonotonicityNot monotonic
2023-12-11T06:11:14.667800image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
20070902000000 2480
24.8%
20160614155346 1312
 
13.1%
20100624212907 313
 
3.1%
20130516000000 100
 
1.0%
20121203000000 83
 
0.8%
20131203000000 59
 
0.6%
20131129000000 57
 
0.6%
20081029000000 49
 
0.5%
20080711000000 46
 
0.5%
20131209000000 46
 
0.5%
Other values (1527) 5455
54.5%
ValueCountFrequency (%)
20070902000000 2480
24.8%
20070902161722 33
 
0.3%
20070906000000 6
 
0.1%
20070916000000 5
 
0.1%
20070918000000 1
 
< 0.1%
20070920000000 3
 
< 0.1%
20071002000000 1
 
< 0.1%
20071008000000 1
 
< 0.1%
20071009000000 10
 
0.1%
20071011000000 1
 
< 0.1%
ValueCountFrequency (%)
20170323130458 1
< 0.1%
20170321134123 1
< 0.1%
20170321133919 1
< 0.1%
20170321113538 1
< 0.1%
20170320111637 1
< 0.1%
20170317090054 1
< 0.1%
20170316174635 1
< 0.1%
20170316143414 1
< 0.1%
20170316142813 1
< 0.1%
20170315111919 1
< 0.1%

처리진행코드
Categorical

IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
31
9982 
1
 
18

Length

Max length2
Median length2
Mean length1.9982
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row31
2nd row31
3rd row31
4th row31
5th row31

Common Values

ValueCountFrequency (%)
31 9982
99.8%
1 18
 
0.2%

Length

2023-12-11T06:11:14.815387image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T06:11:14.915679image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
31 9982
99.8%
1 18
 
0.2%

사용구분
Categorical

IMBALANCE 

Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
8
9578 
<NA>
 
318
9
 
100
0
 
4

Length

Max length4
Median length1
Mean length1.0954
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row8
2nd row8
3rd row8
4th row8
5th row8

Common Values

ValueCountFrequency (%)
8 9578
95.8%
<NA> 318
 
3.2%
9 100
 
1.0%
0 4
 
< 0.1%

Length

2023-12-11T06:11:15.026069image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T06:11:15.144172image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
8 9578
95.8%
na 318
 
3.2%
9 100
 
1.0%
0 4
 
< 0.1%

표지판유형
Categorical

Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
1
4353 
0
2246 
2
1886 
<NA>
1515 

Length

Max length4
Median length1
Mean length1.4545
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row1
4th row2
5th row1

Common Values

ValueCountFrequency (%)
1 4353
43.5%
0 2246
22.5%
2 1886
18.9%
<NA> 1515
 
15.2%

Length

2023-12-11T06:11:15.255764image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T06:11:15.360251image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 4353
43.5%
0 2246
22.5%
2 1886
18.9%
na 1515
 
15.2%

행정동코드
Real number (ℝ)

MISSING 

Distinct626
Distinct (%)7.4%
Missing1552
Missing (%)15.5%
Infinite0
Infinite (%)0.0%
Mean4.0953627 × 109
Minimum0
Maximum4.183041 × 109
Zeros1
Zeros (%)< 0.1%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-11T06:11:15.492739image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile4.1115129 × 109
Q14.1210104 × 109
median4.1310102 × 109
Q34.1465107 × 109
95-th percentile4.182035 × 109
Maximum4.183041 × 109
Range4.183041 × 109
Interquartile range (IQR)25500300

Descriptive statistics

Standard deviation3.4590945 × 108
Coefficient of variation (CV)0.084463691
Kurtosis66.546326
Mean4.0953627 × 109
Median Absolute Deviation (MAD)14000850
Skewness-8.1812686
Sum3.4597624 × 1013
Variance1.1965335 × 1017
MonotonicityNot monotonic
2023-12-11T06:11:15.628143image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
4136025600 114
 
1.1%
4139013200 103
 
1.0%
4136025300 97
 
1.0%
4127110300 77
 
0.8%
4122025000 74
 
0.7%
4122025300 72
 
0.7%
4182035000 69
 
0.7%
4182025000 68
 
0.7%
4122025600 67
 
0.7%
4182031000 65
 
0.7%
Other values (616) 7642
76.4%
(Missing) 1552
 
15.5%
ValueCountFrequency (%)
0 1
 
< 0.1%
1114054000 1
 
< 0.1%
1114062000 1
 
< 0.1%
1117068500 2
< 0.1%
1123053600 1
 
< 0.1%
1123054500 1
 
< 0.1%
1123070500 1
 
< 0.1%
1123072000 1
 
< 0.1%
1126058000 1
 
< 0.1%
1126065500 4
< 0.1%
ValueCountFrequency (%)
4183041000 28
0.3%
4183040000 58
0.6%
4183039500 33
0.3%
4183038000 35
0.4%
4183037000 22
 
0.2%
4183036000 26
0.3%
4183035000 34
0.3%
4183034000 17
 
0.2%
4183033000 34
0.3%
4183032000 26
0.3%

정류장중국어명
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing10000
Missing (%)100.0%
Memory size166.0 KiB

정류장일본어명
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing10000
Missing (%)100.0%
Memory size166.0 KiB

정류장베트남명
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing10000
Missing (%)100.0%
Memory size166.0 KiB

DRT유무
Boolean

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size87.9 KiB
False
10000 
ValueCountFrequency (%)
False 10000
100.0%
2023-12-11T06:11:15.721412image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

정류장유형명
Categorical

IMBALANCE 

Distinct6
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
미지정
9953 
시내
 
19
마을
 
19
시내,마을
 
7
시내,시외
 
1

Length

Max length5
Median length3
Mean length2.9977
Min length2

Unique

Unique2 ?
Unique (%)< 0.1%

Sample

1st row미지정
2nd row미지정
3rd row미지정
4th row미지정
5th row미지정

Common Values

ValueCountFrequency (%)
미지정 9953
99.5%
시내 19
 
0.2%
마을 19
 
0.2%
시내,마을 7
 
0.1%
시내,시외 1
 
< 0.1%
시외 1
 
< 0.1%

Length

2023-12-11T06:11:15.828887image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T06:11:15.949172image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
미지정 9953
99.5%
시내 19
 
0.2%
마을 19
 
0.2%
시내,마을 7
 
0.1%
시내,시외 1
 
< 0.1%
시외 1
 
< 0.1%

환승역타입명
Categorical

IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
<NA>
9309 
일반
 
691

Length

Max length4
Median length4
Mean length3.8618
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 9309
93.1%
일반 691
 
6.9%

Length

2023-12-11T06:11:16.069377image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T06:11:16.166813image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 9309
93.1%
일반 691
 
6.9%
Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
표지판 있음
4353 
표지판 없음
2246 
쉘터
1886 
<NA>
1515 

Length

Max length6
Median length6
Mean length4.9426
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row표지판 있음
2nd row표지판 있음
3rd row표지판 있음
4th row쉘터
5th row표지판 있음

Common Values

ValueCountFrequency (%)
표지판 있음 4353
43.5%
표지판 없음 2246
22.5%
쉘터 1886
18.9%
<NA> 1515
 
15.2%

Length

2023-12-11T06:11:16.291744image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T06:11:16.428804image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
표지판 6599
39.8%
있음 4353
26.2%
없음 2246
 
13.5%
쉘터 1886
 
11.4%
na 1515
 
9.1%

처리진행코드명
Categorical

IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
승인
9982 
작성중
 
18

Length

Max length3
Median length2
Mean length2.0018
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row승인
2nd row승인
3rd row승인
4th row승인
5th row승인

Common Values

ValueCountFrequency (%)
승인 9982
99.8%
작성중 18
 
0.2%

Length

2023-12-11T06:11:16.548921image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T06:11:16.632545image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
승인 9982
99.8%
작성중 18
 
0.2%

업무코드명
Categorical

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
<NA>
6233 
기초정보입력
3767 

Length

Max length6
Median length4
Mean length4.7534
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row기초정보입력
4th row<NA>
5th row기초정보입력

Common Values

ValueCountFrequency (%)
<NA> 6233
62.3%
기초정보입력 3767
37.7%

Length

2023-12-11T06:11:16.735694image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T06:11:16.836243image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 6233
62.3%
기초정보입력 3767
37.7%

Sample

이력아이디정류장아이디정류장명Y좌표X좌표지역X좌표지역Y좌표링크아이디정류장유형환승정류장유무중앙차로여부정류장영문명ARS아이디기관코드GIS유무비고업무코드등록아이디등록일자처리진행코드사용구분표지판유형행정동코드정류장중국어명정류장일본어명정류장베트남명DRT유무정류장유형명환승역타입명표지판유형명처리진행코드명업무코드명
315741000042590217000691화정7교183885.28021427248.343340.00.021700948000<NA>N<NA>186484127300000Y<NA>90iansan772011051900000031814127310200<NA><NA><NA>N미지정<NA>표지판 있음승인<NA>
118431000002960228001742안터입구218411.0412837.00.00.022800670000<NA>N<NA>477124146100000Y<NA>90prkseo032008052800000031814146110400<NA><NA><NA>N미지정<NA>표지판 있음승인<NA>
162421000000000206000039이마트210614.0428565.012707.2373721.54120600627000<NA>N<NA>071404113500000Y이마트0BASIC_INFO2007090200000031814113510300<NA><NA><NA>N미지정<NA>표지판 있음승인기초정보입력
113211000020600206000491새마을연수원211209.5431707.50.00.020600704000<NA>N<NA>074584113500000Y<NA>90b12680192009120200000031824113510600<NA><NA><NA>N미지정<NA>쉘터승인<NA>
5441000000000226000044나자로마을.신안아파트196932.0428418.012657.9633721.47222600166000<NA>N<NA>271394143000000Y나자로마을.신안A0BASIC_INFO2007090200000031814143010500<NA><NA><NA>N미지정<NA>표지판 있음승인기초정보입력
270921000000000217000037잉크테크180452.0424703.012646.8173719.44921700345000<NA>N<NA>180854127300000Y잉크테크0BASIC_INFO2007090200000031814127310300<NA><NA><NA>N미지정<NA>표지판 있음승인기초정보입력
178881000016160215000057보산역앞204985.0490284.00.00.021500076000<NA>N<NA>160964125000000Y<NA>90kiseho072009081200000031824125010600<NA><NA><NA>N미지정<NA>쉘터승인<NA>
284691000041980214001516삼계1리192252.53896393278.258160.00.021401111000<NA>N<NA>485424122000000Y<NA>80smchoi2011042100000031804122035000<NA><NA><NA>N미지정<NA>표지판 없음승인<NA>
445351000000000104000059구의2동주민센터입구207915.7449911.5<NA><NA>00<NA>N<NA>74077<NA><NA><NA>0AUTO_UPDATE20160614155346318<NA><NA><NA><NA><NA>N미지정<NA><NA>승인기초정보입력
11971000000000226000032우성고.개나리A198258.0427547.012658.8713720.99722600191000<NA>N<NA>271064143000000Y우성고.개나리A0BASIC_INFO2007090200000031814143010500<NA><NA><NA>N미지정<NA>표지판 있음승인기초정보입력
이력아이디정류장아이디정류장명Y좌표X좌표지역X좌표지역Y좌표링크아이디정류장유형환승정류장유무중앙차로여부정류장영문명ARS아이디기관코드GIS유무비고업무코드등록아이디등록일자처리진행코드사용구분표지판유형행정동코드정류장중국어명정류장일본어명정류장베트남명DRT유무정류장유형명환승역타입명표지판유형명처리진행코드명업무코드명
427221000050990219000446백석역.고양종합터미널181304.0460160.00.00.021900392000<NA>N<NA>203404128500000Y<NA>90AD2057122012070300000031804128510600<NA><NA><NA>N미지정<NA>표지판 없음승인<NA>
101671000037420233000708동부출장소.병점초등학교202979.0411904.00.00.023301131000<NA>N<NA>363424159000000Y<NA>90atfc55742010091200000031814159011600<NA><NA><NA>N미지정<NA>표지판 있음승인<NA>
268211000070370219000007대우.삼성오피스텔179168.3965462155.47630.00.021900568000<NA>N<NA>201554128500000Y<NA>90ckdejr122013120300000031814128510400<NA><NA><NA>N미지정<NA>표지판 있음승인<NA>
538101000078688202000254현대모닝아파트198581.8476420500.3789<NA><NA>20200175000NN<NA>031574111000000Y<NA>90000000002014020310574531824111513800<NA><NA><NA>N미지정일반쉘터승인<NA>
161841000031980202000026KBS수원센터202463.0418478.00.00.020200096000<NA>N<NA>030784111500000Y<NA>90ojjang892010060100000031814111514100<NA><NA><NA>N미지정<NA>표지판 있음승인<NA>
35411000067600240001142창대1리.꽃동산244250.98635442034.393970.00.024000932000<NA>N<NA><NA>4183000000Y<NA>80000000002013102900000031824183025000<NA><NA><NA>N미지정<NA>쉘터승인<NA>
128221000000000205000143장애인복지센터213845.0437972.012709.4323726.62820500117000<NA>N<NA>060424113300000Y장애인복지센터0BASIC_INFO2007090200000031814113313200<NA><NA><NA>N미지정<NA>표지판 있음승인기초정보입력
70751000000000201000027입북동196475.0421065.012657.6623717.49320100431000<NA>N<NA>020474111300000Y입북동0BASIC_INFO2007090200000031804111314000<NA><NA><NA>N미지정<NA>표지판 없음승인기초정보입력
506741000000000109000255온누리교회203568.4463086.1<NA><NA><NA>0<NA>N<NA><NA><NA><NA><NA>0BASIC_INFO2010062421290731<NA><NA><NA><NA><NA><NA>N미지정<NA><NA>승인기초정보입력
507561000031640229001381파주시장애인종합복지관188939.76537486857.642350.00.022901766000<NA>N<NA>309544148000000Y<NA>90smchoi2010052700000031804148025600<NA><NA><NA>N미지정<NA>표지판 없음승인<NA>

Duplicate rows

Most frequently occurring

이력아이디정류장아이디정류장명Y좌표X좌표지역X좌표지역Y좌표링크아이디정류장유형환승정류장유무중앙차로여부정류장영문명ARS아이디기관코드GIS유무비고업무코드등록아이디등록일자처리진행코드사용구분표지판유형행정동코드DRT유무정류장유형명환승역타입명표지판유형명처리진행코드명업무코드명# duplicates
101000000000214000131안중터미널193817.0387462.00.00.021401788000<NA>N<NA>151684122000000Y<NA>90kyung4862009030500000031824122025300N미지정<NA>쉘터승인<NA>3
01000000000106000005중랑역.동부시장206725.9454942.3<NA><NA>00<NA>Y<NA>76004<NA><NA><NA>90AUTO_UPDATE20160614155346318<NA><NA>N미지정<NA><NA>승인<NA>2
11000000000106000047금란교회앞209085.4455625.9<NA><NA>00<NA>N<NA>76072<NA><NA><NA>90AUTO_UPDATE20160614155346318<NA><NA>N미지정<NA><NA>승인<NA>2
21000000000112000006모래내시장.가좌역192397.3452259.9<NA><NA>00<NA>Y<NA>81011<NA><NA><NA>90AUTO_UPDATE20160614155346318<NA><NA>N미지정<NA><NA>승인<NA>2
31000000000115000271송정역183241.9451358.9<NA><NA>00<NA>N<NA>84025<NA><NA><NA>90AUTO_UPDATE20130206084000318<NA><NA>N미지정<NA><NA>승인<NA>2
41000000000116000010개봉역.한마을아파트187803.0444127.7<NA><NA>00<NA>Y<NA>85027<NA><NA><NA>90AUTO_UPDATE20160614155346318<NA><NA>N미지정<NA><NA>승인<NA>2
51000000000124000014상일초등학교앞215213.0449638.6<NA><NA>00<NA>N<NA>93072<NA><NA><NA>90AUTO_UPDATE20110321084000318<NA><NA>N미지정<NA><NA>승인<NA>2
61000000000124000022둔촌2동주민센터.서울양병원212552.1448253.5<NA><NA>00<NA>N<NA>93035<NA><NA><NA>90AUTO_UPDATE20160614155346318<NA><NA>N미지정<NA><NA>승인<NA>2
71000000000166000378KT부평지사174345.07843444992.53826<NA><NA>00<NA>N<NA>97017<NA><NA><NA>90AUTO_UPDATE20160614155346318<NA><NA>N미지정<NA><NA>승인<NA>2
81000000000167000358계양역176703.34928452376.42055<NA><NA>00<NA>N<NA>98126<NA><NA><NA>90AUTO_UPDATE20160614155346318<NA><NA>N미지정<NA><NA>승인<NA>2