Overview

Dataset statistics

Number of variables6
Number of observations1284
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory61.6 KiB
Average record size in memory49.1 B

Variable types

Numeric1
Categorical2
Text3

Dataset

Description충청북도 충주시 버스 승강장 현황에 관한 데이터를 제공합니다(연번, 읍면동명, 유형, 주소, 주변위치, 승강장 표시 등)
URLhttps://www.data.go.kr/data/15029314/fileData.do

Alerts

연번 is highly overall correlated with 읍면동명High correlation
읍면동명 is highly overall correlated with 연번High correlation
연번 has unique valuesUnique

Reproduction

Analysis started2023-12-12 21:44:41.313763
Analysis finished2023-12-12 21:44:42.176067
Duration0.86 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct1284
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean642.5
Minimum1
Maximum1284
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size11.4 KiB
2023-12-13T06:44:42.275870image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile65.15
Q1321.75
median642.5
Q3963.25
95-th percentile1219.85
Maximum1284
Range1283
Interquartile range (IQR)641.5

Descriptive statistics

Standard deviation370.80318
Coefficient of variation (CV)0.57712558
Kurtosis-1.2
Mean642.5
Median Absolute Deviation (MAD)321
Skewness0
Sum824970
Variance137495
MonotonicityStrictly increasing
2023-12-13T06:44:42.434631image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.1%
805 1
 
0.1%
863 1
 
0.1%
862 1
 
0.1%
861 1
 
0.1%
860 1
 
0.1%
859 1
 
0.1%
858 1
 
0.1%
857 1
 
0.1%
856 1
 
0.1%
Other values (1274) 1274
99.2%
ValueCountFrequency (%)
1 1
0.1%
2 1
0.1%
3 1
0.1%
4 1
0.1%
5 1
0.1%
6 1
0.1%
7 1
0.1%
8 1
0.1%
9 1
0.1%
10 1
0.1%
ValueCountFrequency (%)
1284 1
0.1%
1283 1
0.1%
1282 1
0.1%
1281 1
0.1%
1280 1
0.1%
1279 1
0.1%
1278 1
0.1%
1277 1
0.1%
1276 1
0.1%
1275 1
0.1%

읍면동명
Categorical

HIGH CORRELATION 

Distinct25
Distinct (%)1.9%
Missing0
Missing (%)0.0%
Memory size10.2 KiB
대소원면
109 
주덕읍
91 
동량면
90 
중앙탑면
86 
앙성면
83 
Other values (20)
825 

Length

Max length5
Median length3
Mean length3.4742991
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row주덕읍
2nd row주덕읍
3rd row주덕읍
4th row주덕읍
5th row주덕읍

Common Values

ValueCountFrequency (%)
대소원면 109
 
8.5%
주덕읍 91
 
7.1%
동량면 90
 
7.0%
중앙탑면 86
 
6.7%
앙성면 83
 
6.5%
금가면 68
 
5.3%
살미면 67
 
5.2%
신니면 64
 
5.0%
수안보면 60
 
4.7%
교현안림동 59
 
4.6%
Other values (15) 507
39.5%

Length

2023-12-13T06:44:42.603484image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
대소원면 109
 
8.5%
주덕읍 91
 
7.1%
동량면 90
 
7.0%
중앙탑면 86
 
6.7%
앙성면 83
 
6.5%
금가면 68
 
5.3%
살미면 67
 
5.2%
신니면 64
 
5.0%
수안보면 60
 
4.7%
교현안림동 59
 
4.6%
Other values (15) 507
39.5%

유형
Categorical

Distinct4
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size10.2 KiB
유개형
733 
폴대형
531 
스마트형
 
17
데이터 미집계
 
3

Length

Max length7
Median length3
Mean length3.0225857
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row유개형
2nd row유개형
3rd row유개형
4th row유개형
5th row유개형

Common Values

ValueCountFrequency (%)
유개형 733
57.1%
폴대형 531
41.4%
스마트형 17
 
1.3%
데이터 미집계 3
 
0.2%

Length

2023-12-13T06:44:43.089311image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T06:44:43.205251image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
유개형 733
57.0%
폴대형 531
41.3%
스마트형 17
 
1.3%
데이터 3
 
0.2%
미집계 3
 
0.2%

주소
Text

Distinct1179
Distinct (%)91.8%
Missing0
Missing (%)0.0%
Memory size10.2 KiB
2023-12-13T06:44:43.586706image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length12
Mean length8.6799065
Min length4

Characters and Unicode

Total characters11145
Distinct characters150
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1102 ?
Unique (%)85.8%

Sample

1st row화곡리 478-3
2nd row화곡리 516
3rd row화곡리 89-63
4th row사락리 882-1
5th row사락리 687
ValueCountFrequency (%)
연수동 46
 
1.8%
호암동 29
 
1.1%
안림동 28
 
1.1%
데이터 25
 
1.0%
미집계 25
 
1.0%
용전리 25
 
1.0%
만정리 22
 
0.9%
21
 
0.8%
칠금동 21
 
0.8%
용탄동 20
 
0.8%
Other values (1264) 2310
89.8%
2023-12-13T06:44:44.121123image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1390
 
12.5%
- 952
 
8.5%
1 908
 
8.1%
902
 
8.1%
2 662
 
5.9%
3 570
 
5.1%
4 502
 
4.5%
5 447
 
4.0%
6 443
 
4.0%
406
 
3.6%
Other values (140) 3963
35.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 4808
43.1%
Other Letter 3993
35.8%
Space Separator 1390
 
12.5%
Dash Punctuation 952
 
8.5%
Other Punctuation 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
902
22.6%
406
 
10.2%
140
 
3.5%
130
 
3.3%
81
 
2.0%
70
 
1.8%
64
 
1.6%
54
 
1.4%
54
 
1.4%
52
 
1.3%
Other values (127) 2040
51.1%
Decimal Number
ValueCountFrequency (%)
1 908
18.9%
2 662
13.8%
3 570
11.9%
4 502
10.4%
5 447
9.3%
6 443
9.2%
7 380
7.9%
8 325
 
6.8%
0 304
 
6.3%
9 267
 
5.6%
Space Separator
ValueCountFrequency (%)
1390
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 952
100.0%
Other Punctuation
ValueCountFrequency (%)
, 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 7152
64.2%
Hangul 3993
35.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
902
22.6%
406
 
10.2%
140
 
3.5%
130
 
3.3%
81
 
2.0%
70
 
1.8%
64
 
1.6%
54
 
1.4%
54
 
1.4%
52
 
1.3%
Other values (127) 2040
51.1%
Common
ValueCountFrequency (%)
1390
19.4%
- 952
13.3%
1 908
12.7%
2 662
9.3%
3 570
8.0%
4 502
 
7.0%
5 447
 
6.2%
6 443
 
6.2%
7 380
 
5.3%
8 325
 
4.5%
Other values (3) 573
8.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 7152
64.2%
Hangul 3993
35.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1390
19.4%
- 952
13.3%
1 908
12.7%
2 662
9.3%
3 570
8.0%
4 502
 
7.0%
5 447
 
6.2%
6 443
 
6.2%
7 380
 
5.3%
8 325
 
4.5%
Other values (3) 573
8.0%
Hangul
ValueCountFrequency (%)
902
22.6%
406
 
10.2%
140
 
3.5%
130
 
3.3%
81
 
2.0%
70
 
1.8%
64
 
1.6%
54
 
1.4%
54
 
1.4%
52
 
1.3%
Other values (127) 2040
51.1%
Distinct1019
Distinct (%)79.4%
Missing0
Missing (%)0.0%
Memory size10.2 KiB
2023-12-13T06:44:44.451685image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length24
Median length21
Mean length7.5046729
Min length2

Characters and Unicode

Total characters9636
Distinct characters474
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique965 ?
Unique (%)75.2%

Sample

1st row계막마을 입구
2nd row화곡1리
3rd row화곡2리 마을회관 앞
4th row음동마을 앞
5th row매남마을 자랑비 옆
ValueCountFrequency (%)
353
 
13.7%
데이터 193
 
7.5%
미집계 193
 
7.5%
입구 152
 
5.9%
맞은편 143
 
5.6%
40
 
1.6%
건너편 34
 
1.3%
마을회관 29
 
1.1%
마을 19
 
0.7%
인근 16
 
0.6%
Other values (1028) 1401
54.5%
2023-12-13T06:44:44.956190image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1295
 
13.4%
369
 
3.8%
341
 
3.5%
307
 
3.2%
243
 
2.5%
226
 
2.3%
225
 
2.3%
220
 
2.3%
213
 
2.2%
204
 
2.1%
Other values (464) 5993
62.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 8126
84.3%
Space Separator 1295
 
13.4%
Decimal Number 114
 
1.2%
Uppercase Letter 36
 
0.4%
Close Punctuation 25
 
0.3%
Open Punctuation 25
 
0.3%
Dash Punctuation 9
 
0.1%
Other Punctuation 5
 
0.1%
Lowercase Letter 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
369
 
4.5%
341
 
4.2%
307
 
3.8%
243
 
3.0%
226
 
2.8%
225
 
2.8%
220
 
2.7%
213
 
2.6%
204
 
2.5%
196
 
2.4%
Other values (432) 5582
68.7%
Uppercase Letter
ValueCountFrequency (%)
C 5
13.9%
P 5
13.9%
G 5
13.9%
S 3
8.3%
T 3
8.3%
A 3
8.3%
I 2
 
5.6%
L 2
 
5.6%
D 2
 
5.6%
R 1
 
2.8%
Other values (5) 5
13.9%
Decimal Number
ValueCountFrequency (%)
1 36
31.6%
2 33
28.9%
3 12
 
10.5%
5 9
 
7.9%
6 6
 
5.3%
0 5
 
4.4%
9 4
 
3.5%
4 4
 
3.5%
8 3
 
2.6%
7 2
 
1.8%
Other Punctuation
ValueCountFrequency (%)
, 3
60.0%
. 2
40.0%
Space Separator
ValueCountFrequency (%)
1295
100.0%
Close Punctuation
ValueCountFrequency (%)
) 25
100.0%
Open Punctuation
ValueCountFrequency (%)
( 25
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 9
100.0%
Lowercase Letter
ValueCountFrequency (%)
e 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 8126
84.3%
Common 1473
 
15.3%
Latin 37
 
0.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
369
 
4.5%
341
 
4.2%
307
 
3.8%
243
 
3.0%
226
 
2.8%
225
 
2.8%
220
 
2.7%
213
 
2.6%
204
 
2.5%
196
 
2.4%
Other values (432) 5582
68.7%
Common
ValueCountFrequency (%)
1295
87.9%
1 36
 
2.4%
2 33
 
2.2%
) 25
 
1.7%
( 25
 
1.7%
3 12
 
0.8%
- 9
 
0.6%
5 9
 
0.6%
6 6
 
0.4%
0 5
 
0.3%
Other values (6) 18
 
1.2%
Latin
ValueCountFrequency (%)
C 5
13.5%
P 5
13.5%
G 5
13.5%
S 3
8.1%
T 3
8.1%
A 3
8.1%
I 2
 
5.4%
L 2
 
5.4%
D 2
 
5.4%
R 1
 
2.7%
Other values (6) 6
16.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 8126
84.3%
ASCII 1510
 
15.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1295
85.8%
1 36
 
2.4%
2 33
 
2.2%
) 25
 
1.7%
( 25
 
1.7%
3 12
 
0.8%
- 9
 
0.6%
5 9
 
0.6%
6 6
 
0.4%
C 5
 
0.3%
Other values (22) 55
 
3.6%
Hangul
ValueCountFrequency (%)
369
 
4.5%
341
 
4.2%
307
 
3.8%
243
 
3.0%
226
 
2.8%
225
 
2.8%
220
 
2.7%
213
 
2.6%
204
 
2.5%
196
 
2.4%
Other values (432) 5582
68.7%
Distinct721
Distinct (%)56.2%
Missing0
Missing (%)0.0%
Memory size10.2 KiB
2023-12-13T06:44:45.248530image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length37
Median length35
Mean length9.9859813
Min length2

Characters and Unicode

Total characters12822
Distinct characters339
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique699 ?
Unique (%)54.4%

Sample

1st row주덕,충주-계막-노은
2nd row화곡2리-화곡1리-계막
3rd row사락,음동-화곡2리-주덕,화곡
4th row계막-음동-원사락
5th row음동(종점)-매남-충주
ValueCountFrequency (%)
데이터 542
28.8%
미집계 542
28.8%
시내 7
 
0.4%
충주 3
 
0.2%
어린이승강장 3
 
0.2%
건너편 3
 
0.2%
수안보-화천-연풍 2
 
0.1%
호암리버빌앞 2
 
0.1%
중앙탑면 2
 
0.1%
대소원 2
 
0.1%
Other values (749) 774
41.1%
2023-12-13T06:44:45.676898image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 1417
 
11.1%
646
 
5.0%
636
 
5.0%
604
 
4.7%
585
 
4.6%
577
 
4.5%
546
 
4.3%
543
 
4.2%
492
 
3.8%
356
 
2.8%
Other values (329) 6420
50.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 10228
79.8%
Dash Punctuation 1417
 
11.1%
Space Separator 604
 
4.7%
Other Punctuation 329
 
2.6%
Decimal Number 120
 
0.9%
Close Punctuation 46
 
0.4%
Open Punctuation 46
 
0.4%
Uppercase Letter 32
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
646
 
6.3%
636
 
6.2%
585
 
5.7%
577
 
5.6%
546
 
5.3%
543
 
5.3%
492
 
4.8%
356
 
3.5%
195
 
1.9%
180
 
1.8%
Other values (300) 5472
53.5%
Uppercase Letter
ValueCountFrequency (%)
A 7
21.9%
T 4
12.5%
E 4
12.5%
P 4
12.5%
C 3
9.4%
V 2
 
6.2%
I 2
 
6.2%
R 1
 
3.1%
L 1
 
3.1%
H 1
 
3.1%
Other values (3) 3
9.4%
Decimal Number
ValueCountFrequency (%)
2 48
40.0%
1 44
36.7%
3 13
 
10.8%
4 4
 
3.3%
6 4
 
3.3%
5 3
 
2.5%
7 2
 
1.7%
8 1
 
0.8%
9 1
 
0.8%
Other Punctuation
ValueCountFrequency (%)
, 322
97.9%
. 6
 
1.8%
/ 1
 
0.3%
Dash Punctuation
ValueCountFrequency (%)
- 1417
100.0%
Space Separator
ValueCountFrequency (%)
604
100.0%
Close Punctuation
ValueCountFrequency (%)
) 46
100.0%
Open Punctuation
ValueCountFrequency (%)
( 46
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 10228
79.8%
Common 2562
 
20.0%
Latin 32
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
646
 
6.3%
636
 
6.2%
585
 
5.7%
577
 
5.6%
546
 
5.3%
543
 
5.3%
492
 
4.8%
356
 
3.5%
195
 
1.9%
180
 
1.8%
Other values (300) 5472
53.5%
Common
ValueCountFrequency (%)
- 1417
55.3%
604
23.6%
, 322
 
12.6%
2 48
 
1.9%
) 46
 
1.8%
( 46
 
1.8%
1 44
 
1.7%
3 13
 
0.5%
. 6
 
0.2%
4 4
 
0.2%
Other values (6) 12
 
0.5%
Latin
ValueCountFrequency (%)
A 7
21.9%
T 4
12.5%
E 4
12.5%
P 4
12.5%
C 3
9.4%
V 2
 
6.2%
I 2
 
6.2%
R 1
 
3.1%
L 1
 
3.1%
H 1
 
3.1%
Other values (3) 3
9.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 10228
79.8%
ASCII 2594
 
20.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 1417
54.6%
604
23.3%
, 322
 
12.4%
2 48
 
1.9%
) 46
 
1.8%
( 46
 
1.8%
1 44
 
1.7%
3 13
 
0.5%
A 7
 
0.3%
. 6
 
0.2%
Other values (19) 41
 
1.6%
Hangul
ValueCountFrequency (%)
646
 
6.3%
636
 
6.2%
585
 
5.7%
577
 
5.6%
546
 
5.3%
543
 
5.3%
492
 
4.8%
356
 
3.5%
195
 
1.9%
180
 
1.8%
Other values (300) 5472
53.5%

Interactions

2023-12-13T06:44:41.892165image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T06:44:45.790317image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번읍면동명유형
연번1.0000.9930.215
읍면동명0.9931.0000.256
유형0.2150.2561.000
2023-12-13T06:44:45.884518image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
유형읍면동명
유형1.0000.136
읍면동명0.1361.000
2023-12-13T06:44:45.961943image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번읍면동명유형
연번1.0000.9190.129
읍면동명0.9191.0000.136
유형0.1290.1361.000

Missing values

2023-12-13T06:44:42.031779image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T06:44:42.133783image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번읍면동명유형주소주변 위치승강장 표시
01주덕읍유개형화곡리 478-3계막마을 입구주덕,충주-계막-노은
12주덕읍유개형화곡리 516화곡1리화곡2리-화곡1리-계막
23주덕읍유개형화곡리 89-63화곡2리 마을회관 앞사락,음동-화곡2리-주덕,화곡
34주덕읍유개형사락리 882-1음동마을 앞계막-음동-원사락
45주덕읍유개형사락리 687매남마을 자랑비 옆음동(종점)-매남-충주
56주덕읍유개형사락리 578-1엄동마을 입구매남-음동(종점)
67주덕읍유개형제내리 625덕신교회 앞주덕-풍덕-노은
78주덕읍유개형제내리 210-5풍덕마을 입구노은-풍덕-주덕
89주덕읍유개형제내리 556-3방죽안(성동마을) 입구덕신초등학교-성동-주덕
910주덕읍유개형제내리 715성동마을 맞은편주덕-성동-창동
연번읍면동명유형주소주변 위치승강장 표시
12741275목행용탄동폴대형용탄동 778-24동화약품 정문 건너편데이터 미집계
12751276목행용탄동폴대형용탄동 1041-1소망공구데이터 미집계
12761277목행용탄동폴대형용탄동 1066-2충주로컬푸드스테이션데이터 미집계
12771278목행용탄동폴대형목행동 577-4목행초 맞은편데이터 미집계
12781279목행용탄동폴대형용탄동 643-5현대성우메탈 맞은편데이터 미집계
12791280목행용탄동폴대형용탄동 626현대성우메탈데이터 미집계
12801281칠금금릉동폴대형금릉동 255-26고가다리 아래데이터 미집계
12811282수안보면폴대형수안보로 129위담통합병원 입구데이터 미집계
12821283살미면폴대형충주호수로 2472해찬솔 바로앞데이터 미집계
12831284대소원면폴대형창현로 1030데이터 미집계데이터 미집계