Overview

Dataset statistics

Number of variables6
Number of observations3410
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory166.6 KiB
Average record size in memory50.0 B

Variable types

Numeric2
Categorical2
Text2

Dataset

Description인천 버스정보안내기(버스도착시간 정보안내장치) 설치 현황에 대한 데이터로서 설치지역, 설치년도, 버스정류소별 설치된 안내기 유형에 대한 정보 등을 제공합니다.
URLhttps://www.data.go.kr/data/15104176/fileData.do

Alerts

순서(NO) is highly overall correlated with 설치지역High correlation
정류소(ID) is highly overall correlated with 설치지역High correlation
설치지역 is highly overall correlated with 순서(NO) and 1 other fieldsHigh correlation
순서(NO) has unique valuesUnique

Reproduction

Analysis started2023-12-12 17:11:56.693835
Analysis finished2023-12-12 17:11:58.046140
Duration1.35 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순서(NO)
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct3410
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1710.7924
Minimum1
Maximum3454
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size30.1 KiB
2023-12-13T02:11:58.153166image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile171.45
Q1854.25
median1711.5
Q32563.75
95-th percentile3254.55
Maximum3454
Range3453
Interquartile range (IQR)1709.5

Descriptive statistics

Standard deviation988.50101
Coefficient of variation (CV)0.57780302
Kurtosis-1.1957014
Mean1710.7924
Median Absolute Deviation (MAD)855
Skewness0.0026530969
Sum5833802
Variance977134.24
MonotonicityNot monotonic
2023-12-13T02:11:58.358793image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
140 1
 
< 0.1%
2314 1
 
< 0.1%
2316 1
 
< 0.1%
2317 1
 
< 0.1%
2318 1
 
< 0.1%
2319 1
 
< 0.1%
2320 1
 
< 0.1%
2321 1
 
< 0.1%
2322 1
 
< 0.1%
2323 1
 
< 0.1%
Other values (3400) 3400
99.7%
ValueCountFrequency (%)
1 1
< 0.1%
2 1
< 0.1%
3 1
< 0.1%
4 1
< 0.1%
5 1
< 0.1%
6 1
< 0.1%
7 1
< 0.1%
8 1
< 0.1%
9 1
< 0.1%
10 1
< 0.1%
ValueCountFrequency (%)
3454 1
< 0.1%
3453 1
< 0.1%
3452 1
< 0.1%
3451 1
< 0.1%
3450 1
< 0.1%
3449 1
< 0.1%
3448 1
< 0.1%
3447 1
< 0.1%
3446 1
< 0.1%
3445 1
< 0.1%

설치지역
Categorical

HIGH CORRELATION 

Distinct10
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size26.8 KiB
서구
721 
부평구
483 
연수구
467 
남동구
458 
미추홀구
367 
Other values (5)
914 

Length

Max length7
Median length3
Mean length2.7938416
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row계양구
2nd row계양구
3rd row계양구
4th row계양구
5th row계양구

Common Values

ValueCountFrequency (%)
서구 721
21.1%
부평구 483
14.2%
연수구 467
13.7%
남동구 458
13.4%
미추홀구 367
10.8%
계양구 328
9.6%
중구 302
8.9%
강화군 152
 
4.5%
동구 115
 
3.4%
옹진(영흥군) 17
 
0.5%

Length

2023-12-13T02:11:58.572924image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T02:11:58.702995image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
서구 721
21.1%
부평구 483
14.2%
연수구 467
13.7%
남동구 458
13.4%
미추홀구 367
10.8%
계양구 328
9.6%
중구 302
8.9%
강화군 152
 
4.5%
동구 115
 
3.4%
옹진(영흥군 17
 
0.5%

설치년도
Categorical

Distinct15
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size26.8 KiB
2021년
630 
2019년
560 
2022년
440 
2020년
412 
2018년
336 
Other values (10)
1032 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2008년
2nd row2008년
3rd row2008년
4th row2008년
5th row2008년

Common Values

ValueCountFrequency (%)
2021년 630
18.5%
2019년 560
16.4%
2022년 440
12.9%
2020년 412
12.1%
2018년 336
9.9%
2017년 272
8.0%
2008년 241
 
7.1%
2014년 147
 
4.3%
2016년 99
 
2.9%
2011년 74
 
2.2%
Other values (5) 199
 
5.8%

Length

2023-12-13T02:11:58.868691image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
2021년 630
18.5%
2019년 560
16.4%
2022년 440
12.9%
2020년 412
12.1%
2018년 336
9.9%
2017년 272
8.0%
2008년 241
 
7.1%
2014년 147
 
4.3%
2016년 99
 
2.9%
2011년 74
 
2.2%
Other values (5) 199
 
5.8%

정류소(ID)
Real number (ℝ)

HIGH CORRELATION 

Distinct3385
Distinct (%)99.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean42580.794
Minimum32062
Maximum92032
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size30.1 KiB
2023-12-13T02:11:59.020338image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum32062
5-th percentile35413.35
Q138126.5
median40030.5
Q342065.75
95-th percentile89055.55
Maximum92032
Range59970
Interquartile range (IQR)3939.25

Descriptive statistics

Standard deviation12036.105
Coefficient of variation (CV)0.28266512
Kurtosis10.779949
Mean42580.794
Median Absolute Deviation (MAD)1980
Skewness3.4869296
Sum1.4520051 × 108
Variance1.4486783 × 108
MonotonicityNot monotonic
2023-12-13T02:11:59.185061image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
38394 3
 
0.1%
39588 3
 
0.1%
41563 2
 
0.1%
41309 2
 
0.1%
41562 2
 
0.1%
41565 2
 
0.1%
37513 2
 
0.1%
42829 2
 
0.1%
41247 2
 
0.1%
42838 2
 
0.1%
Other values (3375) 3388
99.4%
ValueCountFrequency (%)
32062 1
< 0.1%
32063 1
< 0.1%
35001 1
< 0.1%
35002 1
< 0.1%
35003 1
< 0.1%
35004 1
< 0.1%
35005 1
< 0.1%
35009 1
< 0.1%
35010 1
< 0.1%
35015 1
< 0.1%
ValueCountFrequency (%)
92032 1
< 0.1%
92031 1
< 0.1%
92026 1
< 0.1%
92025 1
< 0.1%
92024 1
< 0.1%
92022 1
< 0.1%
92021 1
< 0.1%
92020 1
< 0.1%
92019 1
< 0.1%
92015 1
< 0.1%
Distinct2329
Distinct (%)68.3%
Missing0
Missing (%)0.0%
Memory size26.8 KiB
2023-12-13T02:11:59.542832image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length24
Median length21
Mean length8.0184751
Min length2

Characters and Unicode

Total characters27343
Distinct characters532
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1489 ?
Unique (%)43.7%

Sample

1st row아나지고개
2nd row북인천여자중학교
3rd row경남아파트
4th row북인천여자중학교
5th row효성사거리
ValueCountFrequency (%)
풍림아파트 8
 
0.2%
현대아파트 8
 
0.2%
한국아파트 8
 
0.2%
대동아파트 8
 
0.2%
부평역 8
 
0.2%
쌍용아파트 7
 
0.2%
광명아파트 7
 
0.2%
삼보아파트 7
 
0.2%
경남아파트 7
 
0.2%
올리브백화점 6
 
0.2%
Other values (2356) 3387
97.9%
2023-12-13T02:12:00.120342image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
) 850
 
3.1%
( 847
 
3.1%
780
 
2.9%
768
 
2.8%
677
 
2.5%
3 601
 
2.2%
531
 
1.9%
507
 
1.9%
483
 
1.8%
8 475
 
1.7%
Other values (522) 20824
76.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 21777
79.6%
Decimal Number 3451
 
12.6%
Close Punctuation 850
 
3.1%
Open Punctuation 847
 
3.1%
Uppercase Letter 195
 
0.7%
Other Punctuation 134
 
0.5%
Space Separator 52
 
0.2%
Lowercase Letter 28
 
0.1%
Connector Punctuation 6
 
< 0.1%
Other Symbol 3
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
780
 
3.6%
768
 
3.5%
677
 
3.1%
531
 
2.4%
507
 
2.3%
483
 
2.2%
442
 
2.0%
350
 
1.6%
332
 
1.5%
296
 
1.4%
Other values (476) 16611
76.3%
Uppercase Letter
ValueCountFrequency (%)
K 34
17.4%
S 31
15.9%
C 19
9.7%
L 16
8.2%
G 15
7.7%
T 15
7.7%
H 15
7.7%
A 12
 
6.2%
I 9
 
4.6%
B 7
 
3.6%
Other values (8) 22
11.3%
Decimal Number
ValueCountFrequency (%)
3 601
17.4%
8 475
13.8%
2 431
12.5%
1 361
10.5%
4 358
10.4%
5 332
9.6%
9 327
9.5%
0 201
 
5.8%
6 194
 
5.6%
7 171
 
5.0%
Lowercase Letter
ValueCountFrequency (%)
e 19
67.9%
s 2
 
7.1%
g 1
 
3.6%
y 1
 
3.6%
m 1
 
3.6%
f 1
 
3.6%
i 1
 
3.6%
t 1
 
3.6%
n 1
 
3.6%
Other Punctuation
ValueCountFrequency (%)
. 107
79.9%
· 16
 
11.9%
, 6
 
4.5%
/ 5
 
3.7%
Close Punctuation
ValueCountFrequency (%)
) 850
100.0%
Open Punctuation
ValueCountFrequency (%)
( 847
100.0%
Space Separator
ValueCountFrequency (%)
52
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 6
100.0%
Other Symbol
ValueCountFrequency (%)
3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 21780
79.7%
Common 5340
 
19.5%
Latin 223
 
0.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
780
 
3.6%
768
 
3.5%
677
 
3.1%
531
 
2.4%
507
 
2.3%
483
 
2.2%
442
 
2.0%
350
 
1.6%
332
 
1.5%
296
 
1.4%
Other values (477) 16614
76.3%
Latin
ValueCountFrequency (%)
K 34
15.2%
S 31
13.9%
C 19
8.5%
e 19
8.5%
L 16
7.2%
G 15
 
6.7%
T 15
 
6.7%
H 15
 
6.7%
A 12
 
5.4%
I 9
 
4.0%
Other values (17) 38
17.0%
Common
ValueCountFrequency (%)
) 850
15.9%
( 847
15.9%
3 601
11.3%
8 475
8.9%
2 431
8.1%
1 361
6.8%
4 358
6.7%
5 332
 
6.2%
9 327
 
6.1%
0 201
 
3.8%
Other values (8) 557
10.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 21777
79.6%
ASCII 5547
 
20.3%
None 19
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
) 850
15.3%
( 847
15.3%
3 601
10.8%
8 475
8.6%
2 431
7.8%
1 361
6.5%
4 358
6.5%
5 332
 
6.0%
9 327
 
5.9%
0 201
 
3.6%
Other values (34) 764
13.8%
Hangul
ValueCountFrequency (%)
780
 
3.6%
768
 
3.5%
677
 
3.1%
531
 
2.4%
507
 
2.3%
483
 
2.2%
442
 
2.0%
350
 
1.6%
332
 
1.5%
296
 
1.4%
Other values (476) 16611
76.3%
None
ValueCountFrequency (%)
· 16
84.2%
3
 
15.8%
Distinct58
Distinct (%)1.7%
Missing0
Missing (%)0.0%
Memory size26.8 KiB
2023-12-13T02:12:00.394273image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length25
Median length12
Mean length11.772434
Min length7

Characters and Unicode

Total characters40144
Distinct characters47
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique10 ?
Unique (%)0.3%

Sample

1st rowLED 3단10열 독립
2nd rowLED 3단10열 부착
3rd rowLED 3단10열 부착
4th rowLED 3단10열 부착
5th rowLED 3단10열 부착
ValueCountFrequency (%)
led 2690
27.4%
부착 2347
23.9%
4단12열 1016
 
10.3%
2단8열 606
 
6.2%
독립 485
 
4.9%
4단 277
 
2.8%
16열 265
 
2.7%
3단10열 251
 
2.6%
8열 230
 
2.3%
2단 212
 
2.2%
Other values (45) 1453
14.8%
2023-12-13T02:12:00.780653image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
6512
16.2%
L 3410
 
8.5%
D 3410
 
8.5%
E 3035
 
7.6%
2812
 
7.0%
2809
 
7.0%
2351
 
5.9%
2351
 
5.9%
2 2105
 
5.2%
1 1987
 
4.9%
Other values (37) 9362
23.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 14304
35.6%
Uppercase Letter 10236
25.5%
Decimal Number 8116
20.2%
Space Separator 6512
16.2%
Open Punctuation 351
 
0.9%
Close Punctuation 351
 
0.9%
Dash Punctuation 194
 
0.5%
Connector Punctuation 37
 
0.1%
Other Punctuation 33
 
0.1%
Lowercase Letter 10
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2812
19.7%
2809
19.6%
2351
16.4%
2351
16.4%
727
 
5.1%
574
 
4.0%
543
 
3.8%
537
 
3.8%
500
 
3.5%
227
 
1.6%
Other values (11) 873
 
6.1%
Decimal Number
ValueCountFrequency (%)
2 2105
25.9%
1 1987
24.5%
4 1403
17.3%
8 899
11.1%
6 613
 
7.6%
3 557
 
6.9%
0 522
 
6.4%
5 27
 
0.3%
9 2
 
< 0.1%
7 1
 
< 0.1%
Uppercase Letter
ValueCountFrequency (%)
L 3410
33.3%
D 3410
33.3%
E 3035
29.7%
C 375
 
3.7%
B 2
 
< 0.1%
I 2
 
< 0.1%
T 2
 
< 0.1%
Other Punctuation
ValueCountFrequency (%)
. 15
45.5%
" 13
39.4%
/ 5
 
15.2%
Space Separator
ValueCountFrequency (%)
6512
100.0%
Open Punctuation
ValueCountFrequency (%)
( 351
100.0%
Close Punctuation
ValueCountFrequency (%)
) 351
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 194
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 37
100.0%
Lowercase Letter
ValueCountFrequency (%)
m 10
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 15594
38.8%
Hangul 14304
35.6%
Latin 10246
25.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2812
19.7%
2809
19.6%
2351
16.4%
2351
16.4%
727
 
5.1%
574
 
4.0%
543
 
3.8%
537
 
3.8%
500
 
3.5%
227
 
1.6%
Other values (11) 873
 
6.1%
Common
ValueCountFrequency (%)
6512
41.8%
2 2105
 
13.5%
1 1987
 
12.7%
4 1403
 
9.0%
8 899
 
5.8%
6 613
 
3.9%
3 557
 
3.6%
0 522
 
3.3%
( 351
 
2.3%
) 351
 
2.3%
Other values (8) 294
 
1.9%
Latin
ValueCountFrequency (%)
L 3410
33.3%
D 3410
33.3%
E 3035
29.6%
C 375
 
3.7%
m 10
 
0.1%
B 2
 
< 0.1%
I 2
 
< 0.1%
T 2
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 25840
64.4%
Hangul 14304
35.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
6512
25.2%
L 3410
13.2%
D 3410
13.2%
E 3035
11.7%
2 2105
 
8.1%
1 1987
 
7.7%
4 1403
 
5.4%
8 899
 
3.5%
6 613
 
2.4%
3 557
 
2.2%
Other values (16) 1909
 
7.4%
Hangul
ValueCountFrequency (%)
2812
19.7%
2809
19.6%
2351
16.4%
2351
16.4%
727
 
5.1%
574
 
4.0%
543
 
3.8%
537
 
3.8%
500
 
3.5%
227
 
1.6%
Other values (11) 873
 
6.1%

Interactions

2023-12-13T02:11:57.496460image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:11:57.251901image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:11:57.656075image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:11:57.385858image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T02:12:00.908227image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순서(NO)설치지역설치년도정류소(ID)안내기유형(상세)
순서(NO)1.0000.9420.7360.6120.851
설치지역0.9421.0000.4960.9060.835
설치년도0.7360.4961.0000.3010.913
정류소(ID)0.6120.9060.3011.0000.859
안내기유형(상세)0.8510.8350.9130.8591.000
2023-12-13T02:12:01.042397image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
설치년도설치지역
설치년도1.0000.208
설치지역0.2081.000
2023-12-13T02:12:01.151008image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순서(NO)정류소(ID)설치지역설치년도
순서(NO)1.000-0.0950.5970.379
정류소(ID)-0.0951.0000.7920.174
설치지역0.5970.7921.0000.208
설치년도0.3790.1740.2081.000

Missing values

2023-12-13T02:11:57.814656image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T02:11:57.969071image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

순서(NO)설치지역설치년도정류소(ID)정류소명안내기유형(상세)
0140계양구2008년41001아나지고개LED 3단10열 독립
1141계양구2008년41008북인천여자중학교LED 3단10열 부착
2142계양구2008년41014경남아파트LED 3단10열 부착
3143계양구2008년41016북인천여자중학교LED 3단10열 부착
4144계양구2008년41017효성사거리LED 3단10열 부착
5145계양구2008년41018효성사거리LED 3단10열 부착
6147계양구2008년41023효성소방파출소LED 3단10열 부착
7149계양구2008년41027미도아파트LED 3단10열 독립
8151계양구2008년41035효성현대4차LED 3단10열 독립
9152계양구2008년41037e편한세상계양LED 3단10열 독립
순서(NO)설치지역설치년도정류소(ID)정류소명안내기유형(상세)
34003402중구2022년35646e편한세상영종하늘도시(35646)LED거치형 3단 10열
34013403중구2022년35647e편한세상영종하늘도시(35647)LED거치형 3단 10열
34023404중구2022년35648그린나래지하차도(35648)LED거치형 3단 10열
34033405중구2022년35649그린나래지하차도(35649)LED거치형 3단 10열
34043406중구2022년35727카페거리(35727)LED거치형 3단 10열
34053407중구2022년35728카페거리(35728)LED거치형 3단 10열
34063408중구2022년35755영종KCC스위첸 옆문(35755)LED거치형 3단 10열
34073409중구2022년35756영종KCC스위첸 옆문(35756)LED거치형 3단 10열
34083410중구2022년35761e편한세상영종오션하임(35761)LED거치형 3단 10열
34093411중구2022년35823인천하늘중학교(35823)LED거치형 3단 10열