Overview

Dataset statistics

Number of variables6
Number of observations539
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory26.4 KiB
Average record size in memory50.2 B

Variable types

Numeric2
Text2
Categorical2

Dataset

Description이 데이터는 2023년 8월 24일 기준으로 전라북도 남원시 버스승강장 현황에 대한 승강장 이름, 승강장 위치, 설치연도, 형태에 대한 데이터 입니다.
URLhttps://www.data.go.kr/data/15111760/fileData.do

Alerts

데이터기준일자 has constant value ""Constant
연번 is highly overall correlated with 형태High correlation
설치연도 is highly overall correlated with 형태High correlation
형태 is highly overall correlated with 연번 and 1 other fieldsHigh correlation
형태 is highly imbalanced (59.1%)Imbalance
연번 has unique valuesUnique

Reproduction

Analysis started2023-12-12 03:26:26.857771
Analysis finished2023-12-12 03:26:28.109303
Duration1.25 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct539
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean270
Minimum1
Maximum539
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size4.9 KiB
2023-12-12T12:26:28.220280image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile27.9
Q1135.5
median270
Q3404.5
95-th percentile512.1
Maximum539
Range538
Interquartile range (IQR)269

Descriptive statistics

Standard deviation155.74017
Coefficient of variation (CV)0.57681544
Kurtosis-1.2
Mean270
Median Absolute Deviation (MAD)135
Skewness0
Sum145530
Variance24255
MonotonicityStrictly increasing
2023-12-12T12:26:28.451337image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.2%
356 1
 
0.2%
370 1
 
0.2%
369 1
 
0.2%
368 1
 
0.2%
367 1
 
0.2%
366 1
 
0.2%
365 1
 
0.2%
364 1
 
0.2%
363 1
 
0.2%
Other values (529) 529
98.1%
ValueCountFrequency (%)
1 1
0.2%
2 1
0.2%
3 1
0.2%
4 1
0.2%
5 1
0.2%
6 1
0.2%
7 1
0.2%
8 1
0.2%
9 1
0.2%
10 1
0.2%
ValueCountFrequency (%)
539 1
0.2%
538 1
0.2%
537 1
0.2%
536 1
0.2%
535 1
0.2%
534 1
0.2%
533 1
0.2%
532 1
0.2%
531 1
0.2%
530 1
0.2%
Distinct444
Distinct (%)82.4%
Missing0
Missing (%)0.0%
Memory size4.3 KiB
2023-12-12T12:26:28.996242image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length2
Mean length2.9183673
Min length2

Characters and Unicode

Total characters1573
Distinct characters244
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique370 ?
Unique (%)68.6%

Sample

1st row공설시장
2nd row남원초교
3rd row남원초교
4th row공설시장(금동)
5th row노인복지회관
ValueCountFrequency (%)
신기 6
 
1.1%
신촌 5
 
0.9%
부동 3
 
0.6%
월산 3
 
0.6%
상신 3
 
0.6%
척동 3
 
0.6%
태평 3
 
0.6%
의지 3
 
0.6%
배산 3
 
0.6%
남창 3
 
0.6%
Other values (438) 510
93.6%
2023-12-12T12:26:29.791113image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
75
 
4.8%
55
 
3.5%
40
 
2.5%
36
 
2.3%
) 31
 
2.0%
( 31
 
2.0%
30
 
1.9%
26
 
1.7%
25
 
1.6%
23
 
1.5%
Other values (234) 1201
76.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1481
94.2%
Close Punctuation 31
 
2.0%
Open Punctuation 31
 
2.0%
Decimal Number 17
 
1.1%
Space Separator 7
 
0.4%
Uppercase Letter 4
 
0.3%
Other Punctuation 2
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
75
 
5.1%
55
 
3.7%
40
 
2.7%
36
 
2.4%
30
 
2.0%
26
 
1.8%
25
 
1.7%
23
 
1.6%
21
 
1.4%
21
 
1.4%
Other values (221) 1129
76.2%
Decimal Number
ValueCountFrequency (%)
2 8
47.1%
1 5
29.4%
3 2
 
11.8%
4 1
 
5.9%
5 1
 
5.9%
Uppercase Letter
ValueCountFrequency (%)
A 2
50.0%
S 1
25.0%
C 1
25.0%
Other Punctuation
ValueCountFrequency (%)
, 1
50.0%
. 1
50.0%
Close Punctuation
ValueCountFrequency (%)
) 31
100.0%
Open Punctuation
ValueCountFrequency (%)
( 31
100.0%
Space Separator
ValueCountFrequency (%)
7
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1481
94.2%
Common 88
 
5.6%
Latin 4
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
75
 
5.1%
55
 
3.7%
40
 
2.7%
36
 
2.4%
30
 
2.0%
26
 
1.8%
25
 
1.7%
23
 
1.6%
21
 
1.4%
21
 
1.4%
Other values (221) 1129
76.2%
Common
ValueCountFrequency (%)
) 31
35.2%
( 31
35.2%
2 8
 
9.1%
7
 
8.0%
1 5
 
5.7%
3 2
 
2.3%
4 1
 
1.1%
, 1
 
1.1%
5 1
 
1.1%
. 1
 
1.1%
Latin
ValueCountFrequency (%)
A 2
50.0%
S 1
25.0%
C 1
25.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1481
94.2%
ASCII 92
 
5.8%

Most frequent character per block

Hangul
ValueCountFrequency (%)
75
 
5.1%
55
 
3.7%
40
 
2.7%
36
 
2.4%
30
 
2.0%
26
 
1.8%
25
 
1.7%
23
 
1.6%
21
 
1.4%
21
 
1.4%
Other values (221) 1129
76.2%
ASCII
ValueCountFrequency (%)
) 31
33.7%
( 31
33.7%
2 8
 
8.7%
7
 
7.6%
1 5
 
5.4%
A 2
 
2.2%
3 2
 
2.2%
4 1
 
1.1%
S 1
 
1.1%
C 1
 
1.1%
Other values (3) 3
 
3.3%
Distinct533
Distinct (%)98.9%
Missing0
Missing (%)0.0%
Memory size4.3 KiB
2023-12-12T12:26:30.205226image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length31
Median length30
Mean length21.211503
Min length15

Characters and Unicode

Total characters11433
Distinct characters176
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique527 ?
Unique (%)97.8%

Sample

1st row전라북도 남원시 금동 197-39
2nd row전라북도 남원시 금동 326-1
3rd row전라북도 남원시 금동 326
4th row전라북도 남원시 금동 566-47
5th row전라북도 남원시 금동 356-1
ValueCountFrequency (%)
전라북도 538
20.7%
남원시 536
20.6%
송동면 36
 
1.4%
아영면 35
 
1.3%
운봉읍 34
 
1.3%
주천면 30
 
1.2%
이백면 29
 
1.1%
보절면 28
 
1.1%
대강면 27
 
1.0%
금지면 26
 
1.0%
Other values (723) 1286
49.4%
2023-12-12T12:26:30.772365image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2076
18.2%
561
 
4.9%
542
 
4.7%
541
 
4.7%
540
 
4.7%
538
 
4.7%
538
 
4.7%
537
 
4.7%
427
 
3.7%
- 421
 
3.7%
Other values (166) 4712
41.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 6841
59.8%
Space Separator 2076
 
18.2%
Decimal Number 2048
 
17.9%
Dash Punctuation 421
 
3.7%
Open Punctuation 23
 
0.2%
Close Punctuation 23
 
0.2%
Other Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
561
 
8.2%
542
 
7.9%
541
 
7.9%
540
 
7.9%
538
 
7.9%
538
 
7.9%
537
 
7.8%
427
 
6.2%
391
 
5.7%
197
 
2.9%
Other values (151) 2029
29.7%
Decimal Number
ValueCountFrequency (%)
1 364
17.8%
2 276
13.5%
3 226
11.0%
5 212
10.4%
4 210
10.3%
6 183
8.9%
7 159
7.8%
8 147
7.2%
9 142
 
6.9%
0 129
 
6.3%
Space Separator
ValueCountFrequency (%)
2076
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 421
100.0%
Open Punctuation
ValueCountFrequency (%)
( 23
100.0%
Close Punctuation
ValueCountFrequency (%)
) 23
100.0%
Other Punctuation
ValueCountFrequency (%)
, 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 6841
59.8%
Common 4592
40.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
561
 
8.2%
542
 
7.9%
541
 
7.9%
540
 
7.9%
538
 
7.9%
538
 
7.9%
537
 
7.8%
427
 
6.2%
391
 
5.7%
197
 
2.9%
Other values (151) 2029
29.7%
Common
ValueCountFrequency (%)
2076
45.2%
- 421
 
9.2%
1 364
 
7.9%
2 276
 
6.0%
3 226
 
4.9%
5 212
 
4.6%
4 210
 
4.6%
6 183
 
4.0%
7 159
 
3.5%
8 147
 
3.2%
Other values (5) 318
 
6.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 6841
59.8%
ASCII 4592
40.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2076
45.2%
- 421
 
9.2%
1 364
 
7.9%
2 276
 
6.0%
3 226
 
4.9%
5 212
 
4.6%
4 210
 
4.6%
6 183
 
4.0%
7 159
 
3.5%
8 147
 
3.2%
Other values (5) 318
 
6.9%
Hangul
ValueCountFrequency (%)
561
 
8.2%
542
 
7.9%
541
 
7.9%
540
 
7.9%
538
 
7.9%
538
 
7.9%
537
 
7.8%
427
 
6.2%
391
 
5.7%
197
 
2.9%
Other values (151) 2029
29.7%

설치연도
Real number (ℝ)

HIGH CORRELATION 

Distinct26
Distinct (%)4.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2015.3618
Minimum1987
Maximum2022
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size4.9 KiB
2023-12-12T12:26:30.948184image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1987
5-th percentile2005
Q12012
median2016
Q32020
95-th percentile2022
Maximum2022
Range35
Interquartile range (IQR)8

Descriptive statistics

Standard deviation5.3741867
Coefficient of variation (CV)0.0026666114
Kurtosis1.4734155
Mean2015.3618
Median Absolute Deviation (MAD)4
Skewness-1.0204382
Sum1086280
Variance28.881882
MonotonicityNot monotonic
2023-12-12T12:26:31.092793image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=26)
ValueCountFrequency (%)
2020 72
13.4%
2012 58
10.8%
2019 48
8.9%
2014 48
8.9%
2022 46
 
8.5%
2021 45
 
8.3%
2013 35
 
6.5%
2017 27
 
5.0%
2015 25
 
4.6%
2018 20
 
3.7%
Other values (16) 115
21.3%
ValueCountFrequency (%)
1987 1
 
0.2%
1996 1
 
0.2%
1999 2
 
0.4%
2000 3
0.6%
2001 4
0.7%
2002 6
1.1%
2003 3
0.6%
2004 4
0.7%
2005 4
0.7%
2006 7
1.3%
ValueCountFrequency (%)
2022 46
8.5%
2021 45
8.3%
2020 72
13.4%
2019 48
8.9%
2018 20
 
3.7%
2017 27
 
5.0%
2016 20
 
3.7%
2015 25
 
4.6%
2014 48
8.9%
2013 35
6.5%

형태
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct3
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size4.3 KiB
농촌형(쉘타)
453 
도시형
85 
농촌형(벽돌)
 
1

Length

Max length7
Median length7
Mean length6.3692022
Min length3

Unique

Unique1 ?
Unique (%)0.2%

Sample

1st row도시형
2nd row도시형
3rd row도시형
4th row도시형
5th row도시형

Common Values

ValueCountFrequency (%)
농촌형(쉘타) 453
84.0%
도시형 85
 
15.8%
농촌형(벽돌) 1
 
0.2%

Length

2023-12-12T12:26:31.261931image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T12:26:31.402560image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
농촌형(쉘타 453
84.0%
도시형 85
 
15.8%
농촌형(벽돌 1
 
0.2%

데이터기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size4.3 KiB
2023-08-24
539 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2023-08-24
2nd row2023-08-24
3rd row2023-08-24
4th row2023-08-24
5th row2023-08-24

Common Values

ValueCountFrequency (%)
2023-08-24 539
100.0%

Length

2023-12-12T12:26:31.546533image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T12:26:31.681170image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2023-08-24 539
100.0%

Interactions

2023-12-12T12:26:27.588163image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T12:26:27.315249image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T12:26:27.726134image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T12:26:27.451600image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T12:26:31.764671image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번설치연도형태
연번1.0000.2890.702
설치연도0.2891.0000.815
형태0.7020.8151.000
2023-12-12T12:26:32.212627image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번설치연도형태
연번1.0000.0810.556
설치연도0.0811.0000.743
형태0.5560.7431.000

Missing values

2023-12-12T12:26:27.911371image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T12:26:28.056034image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번승강장명승강장 위치설치연도형태데이터기준일자
01공설시장전라북도 남원시 금동 197-392019도시형2023-08-24
12남원초교전라북도 남원시 금동 326-12013도시형2023-08-24
23남원초교전라북도 남원시 금동 3262013도시형2023-08-24
34공설시장(금동)전라북도 남원시 금동 566-472020도시형2023-08-24
45노인복지회관전라북도 남원시 금동 356-12013도시형2023-08-24
56메카센트럴A전라북도 남원시 조산동 163-22013도시형2023-08-24
67메카센트럴A전라북도 남원시 조산동 295-42013도시형2023-08-24
78미륭장전라북도 남원시 금동 566-512015도시형2023-08-24
89보건소전라북도 남원시 조산동 4552020농촌형(쉘타)2023-08-24
910시외버스정류장전라북도 남원시 금동 292-262020도시형2023-08-24
연번승강장명승강장 위치설치연도형태데이터기준일자
529530하주전라북도 남원시 주천면 주천리 539-52000농촌형(쉘타)2023-08-24
530531회덕전라북도 남원시 주천면 덕치리 168-32006농촌형(쉘타)2023-08-24
531532제바위전라북도 남원시 주천면 용담리 350-352014농촌형(쉘타)2023-08-24
532533배촌전라북도 남원시 주천면 배덕리 517-12022농촌형(쉘타)2023-08-24
533534은송(행정)전라북도 남원시 주천면 호기리 산 64-12018농촌형(쉘타)2023-08-24
534535호경전라북도 남원시 주천면 호경리 4782018농촌형(쉘타)2023-08-24
535536호곡전라북도 남원시 주천면 호기리 482-102019농촌형(쉘타)2023-08-24
536537고기리전라북도 남원시 주천면 고기리 산 4-22020농촌형(쉘타)2023-08-24
537538육모정전라북도 남원시 주천면 호경리 16-62020도시형2023-08-24
538539외용궁전라북도 남원시 주천면 용궁리 산31-22021농촌형(쉘타)2023-08-24