Overview

Dataset statistics

Number of variables7
Number of observations5341
Missing cells214
Missing cells (%)0.6%
Duplicate rows12
Duplicate rows (%)0.2%
Total size in memory292.2 KiB
Average record size in memory56.0 B

Variable types

Categorical3
Text3
DateTime1

Dataset

Description경기도 수원시 도시계획시설정보로, 시설종류, 시설 상세 종류, 시설명, 최종변경일, 최종변경고시번호 등에 대한 데이터를 포함합니다.
URLhttps://www.data.go.kr/data/15119087/fileData.do

Alerts

시군구명 has constant value ""Constant
데이터기준일자 has constant value ""Constant
Dataset has 12 (0.2%) duplicate rowsDuplicates
시설종류 is highly imbalanced (52.3%)Imbalance
시설상세 has 214 (4.0%) missing valuesMissing

Reproduction

Analysis started2023-12-12 10:13:29.651933
Analysis finished2023-12-12 10:13:30.411992
Duration0.76 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시군구명
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size41.9 KiB
수원시
5341 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row수원시
2nd row수원시
3rd row수원시
4th row수원시
5th row수원시

Common Values

ValueCountFrequency (%)
수원시 5341
100.0%

Length

2023-12-12T19:13:30.504282image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T19:13:30.639085image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
수원시 5341
100.0%

시설종류
Categorical

IMBALANCE 

Distinct36
Distinct (%)0.7%
Missing0
Missing (%)0.0%
Memory size41.9 KiB
일반도로
3030 
녹지
604 
공원
416 
학교
 
227
공공공지
 
212
Other values (31)
852 

Length

Max length13
Median length4
Mean length3.7079199
Min length2

Unique

Unique4 ?
Unique (%)0.1%

Sample

1st row일반철도
2nd row일반철도
3rd row일반철도
4th row일반철도
5th row일반철도

Common Values

ValueCountFrequency (%)
일반도로 3030
56.7%
녹지 604
 
11.3%
공원 416
 
7.8%
학교 227
 
4.3%
공공공지 212
 
4.0%
보행자전용도로 201
 
3.8%
노외주차장 189
 
3.5%
공공청사 122
 
2.3%
광장 53
 
1.0%
하천 32
 
0.6%
Other values (26) 255
 
4.8%

Length

2023-12-12T19:13:30.791184image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
일반도로 3030
56.6%
녹지 604
 
11.3%
공원 416
 
7.8%
학교 227
 
4.2%
공공공지 212
 
4.0%
보행자전용도로 201
 
3.8%
노외주차장 189
 
3.5%
공공청사 122
 
2.3%
광장 53
 
1.0%
하천 32
 
0.6%
Other values (28) 269
 
5.0%

시설상세
Text

MISSING 

Distinct66
Distinct (%)1.3%
Missing214
Missing (%)4.0%
Memory size41.9 KiB
2023-12-12T19:13:31.009280image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length4
Mean length4.2535596
Min length3

Characters and Unicode

Total characters21808
Distinct characters110
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique12 ?
Unique (%)0.2%

Sample

1st row주간선도로
2nd row국지도로
3rd row주간선도로
4th row보조간선도로
5th row집산도로
ValueCountFrequency (%)
국지도로 2297
44.8%
집산도로 486
 
9.5%
완충녹지 485
 
9.5%
특수도로 231
 
4.5%
어린이공원 221
 
4.3%
공공공지 212
 
4.1%
보조간선도로 144
 
2.8%
자치단체청사 108
 
2.1%
주간선도로 106
 
2.1%
초등학교 102
 
2.0%
Other values (57) 736
 
14.4%
2023-12-12T19:13:31.364590image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3281
15.0%
3274
15.0%
3167
14.5%
2316
 
10.6%
1106
 
5.1%
604
 
2.8%
488
 
2.2%
486
 
2.2%
485
 
2.2%
485
 
2.2%
Other values (100) 6116
28.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 21793
99.9%
Close Punctuation 7
 
< 0.1%
Open Punctuation 7
 
< 0.1%
Space Separator 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3281
15.1%
3274
15.0%
3167
14.5%
2316
 
10.6%
1106
 
5.1%
604
 
2.8%
488
 
2.2%
486
 
2.2%
485
 
2.2%
485
 
2.2%
Other values (97) 6101
28.0%
Close Punctuation
ValueCountFrequency (%)
) 7
100.0%
Open Punctuation
ValueCountFrequency (%)
( 7
100.0%
Space Separator
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 21793
99.9%
Common 15
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3281
15.1%
3274
15.0%
3167
14.5%
2316
 
10.6%
1106
 
5.1%
604
 
2.8%
488
 
2.2%
486
 
2.2%
485
 
2.2%
485
 
2.2%
Other values (97) 6101
28.0%
Common
ValueCountFrequency (%)
) 7
46.7%
( 7
46.7%
1
 
6.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 21793
99.9%
ASCII 15
 
0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
3281
15.1%
3274
15.0%
3167
14.5%
2316
 
10.6%
1106
 
5.1%
604
 
2.8%
488
 
2.2%
486
 
2.2%
485
 
2.2%
485
 
2.2%
Other values (97) 6101
28.0%
ASCII
ValueCountFrequency (%)
) 7
46.7%
( 7
46.7%
1
 
6.7%
Distinct5261
Distinct (%)98.5%
Missing0
Missing (%)0.0%
Memory size41.9 KiB
2023-12-12T19:13:31.625997image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length25
Median length22
Mean length10.079199
Min length3

Characters and Unicode

Total characters53833
Distinct characters316
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique5221 ?
Unique (%)97.8%

Sample

1st row철도9(매교역)
2nd row철도8(시청역)
3rd row철도7(매탄역)
4th row철도6(방죽역)
5th row철도5(영통역)
ValueCountFrequency (%)
소매시장 23
 
0.4%
11
 
0.2%
10
 
0.2%
6
 
0.1%
배수지 5
 
0.1%
전기공급설비 5
 
0.1%
5
 
0.1%
4
 
0.1%
4
 
0.1%
cablehead부지 4
 
0.1%
Other values (5294) 5357
98.6%
2023-12-12T19:13:32.114379image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 5234
 
9.7%
1 3978
 
7.4%
3437
 
6.4%
2 3419
 
6.4%
3331
 
6.2%
3290
 
6.1%
3270
 
6.1%
2552
 
4.7%
3 2482
 
4.6%
1350
 
2.5%
Other values (306) 21490
39.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 25197
46.8%
Decimal Number 17744
33.0%
Dash Punctuation 5234
 
9.7%
Space Separator 3331
 
6.2%
Close Punctuation 1128
 
2.1%
Open Punctuation 1127
 
2.1%
Uppercase Letter 51
 
0.1%
Other Punctuation 20
 
< 0.1%
Connector Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3437
13.6%
3290
13.1%
3270
13.0%
2552
 
10.1%
1350
 
5.4%
933
 
3.7%
766
 
3.0%
751
 
3.0%
604
 
2.4%
545
 
2.2%
Other values (278) 7699
30.6%
Decimal Number
ValueCountFrequency (%)
1 3978
22.4%
2 3419
19.3%
3 2482
14.0%
4 1338
 
7.5%
5 1135
 
6.4%
9 1108
 
6.2%
0 1086
 
6.1%
6 1076
 
6.1%
7 1072
 
6.0%
8 1050
 
5.9%
Uppercase Letter
ValueCountFrequency (%)
A 8
15.7%
B 8
15.7%
E 8
15.7%
C 5
9.8%
D 5
9.8%
S 5
9.8%
H 4
7.8%
L 4
7.8%
K 3
 
5.9%
R 1
 
2.0%
Other Punctuation
ValueCountFrequency (%)
, 17
85.0%
. 2
 
10.0%
& 1
 
5.0%
Dash Punctuation
ValueCountFrequency (%)
- 5234
100.0%
Space Separator
ValueCountFrequency (%)
3331
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1128
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1127
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 28585
53.1%
Hangul 25197
46.8%
Latin 51
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3437
13.6%
3290
13.1%
3270
13.0%
2552
 
10.1%
1350
 
5.4%
933
 
3.7%
766
 
3.0%
751
 
3.0%
604
 
2.4%
545
 
2.2%
Other values (278) 7699
30.6%
Common
ValueCountFrequency (%)
- 5234
18.3%
1 3978
13.9%
2 3419
12.0%
3331
11.7%
3 2482
8.7%
4 1338
 
4.7%
5 1135
 
4.0%
) 1128
 
3.9%
( 1127
 
3.9%
9 1108
 
3.9%
Other values (8) 4305
15.1%
Latin
ValueCountFrequency (%)
A 8
15.7%
B 8
15.7%
E 8
15.7%
C 5
9.8%
D 5
9.8%
S 5
9.8%
H 4
7.8%
L 4
7.8%
K 3
 
5.9%
R 1
 
2.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 28636
53.2%
Hangul 25197
46.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 5234
18.3%
1 3978
13.9%
2 3419
11.9%
3331
11.6%
3 2482
8.7%
4 1338
 
4.7%
5 1135
 
4.0%
) 1128
 
3.9%
( 1127
 
3.9%
9 1108
 
3.9%
Other values (18) 4356
15.2%
Hangul
ValueCountFrequency (%)
3437
13.6%
3290
13.1%
3270
13.0%
2552
 
10.1%
1350
 
5.4%
933
 
3.7%
766
 
3.0%
751
 
3.0%
604
 
2.4%
545
 
2.2%
Other values (278) 7699
30.6%
Distinct399
Distinct (%)7.5%
Missing0
Missing (%)0.0%
Memory size41.9 KiB
Minimum1969-06-11 00:00:00
Maximum2021-11-18 00:00:00
2023-12-12T19:13:32.301209image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T19:13:32.452703image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct436
Distinct (%)8.2%
Missing0
Missing (%)0.0%
Memory size41.9 KiB
2023-12-12T19:13:32.818581image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length18
Median length14
Mean length14.171129
Min length3

Characters and Unicode

Total characters75688
Distinct characters36
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique251 ?
Unique (%)4.7%

Sample

1st row국토교통부고시제2014-238호
2nd row국토교통부고시제2014-238호
3rd row국토교통부고시제2014-238호
4th row수원시고시제2011-61호
5th row수원시고시제2011-61호
ValueCountFrequency (%)
수원시고시제2011-61호 3466
64.6%
수원시고시제2009-106호 182
 
3.4%
국토해양부고시제2011-609호 152
 
2.8%
확인중 152
 
2.8%
국토교통부고시제2014-373호 119
 
2.2%
국토해양부고시제2011-962호 96
 
1.8%
국토해양부고시제2011-869호 92
 
1.7%
수원시고시제2014-117호 62
 
1.2%
수원시고시제2011-243호 25
 
0.5%
수원시고시제2011-201호 23
 
0.4%
Other values (428) 995
 
18.5%
2023-12-12T19:13:33.334301image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 13077
17.3%
9791
12.9%
0 6003
7.9%
2 5959
7.9%
- 5190
 
6.9%
5187
 
6.9%
5184
 
6.8%
5181
 
6.8%
4607
 
6.1%
4607
 
6.1%
Other values (26) 10902
14.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 37867
50.0%
Decimal Number 32607
43.1%
Dash Punctuation 5190
 
6.9%
Space Separator 23
 
< 0.1%
Other Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
9791
25.9%
5187
13.7%
5184
13.7%
5181
13.7%
4607
12.2%
4607
12.2%
555
 
1.5%
552
 
1.5%
552
 
1.5%
363
 
1.0%
Other values (13) 1288
 
3.4%
Decimal Number
ValueCountFrequency (%)
1 13077
40.1%
0 6003
18.4%
2 5959
18.3%
6 4225
 
13.0%
9 831
 
2.5%
3 699
 
2.1%
7 553
 
1.7%
4 532
 
1.6%
8 414
 
1.3%
5 314
 
1.0%
Dash Punctuation
ValueCountFrequency (%)
- 5190
100.0%
Space Separator
ValueCountFrequency (%)
23
100.0%
Other Punctuation
ValueCountFrequency (%)
? 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 37867
50.0%
Common 37821
50.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
9791
25.9%
5187
13.7%
5184
13.7%
5181
13.7%
4607
12.2%
4607
12.2%
555
 
1.5%
552
 
1.5%
552
 
1.5%
363
 
1.0%
Other values (13) 1288
 
3.4%
Common
ValueCountFrequency (%)
1 13077
34.6%
0 6003
15.9%
2 5959
15.8%
- 5190
 
13.7%
6 4225
 
11.2%
9 831
 
2.2%
3 699
 
1.8%
7 553
 
1.5%
4 532
 
1.4%
8 414
 
1.1%
Other values (3) 338
 
0.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 37867
50.0%
ASCII 37821
50.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 13077
34.6%
0 6003
15.9%
2 5959
15.8%
- 5190
 
13.7%
6 4225
 
11.2%
9 831
 
2.2%
3 699
 
1.8%
7 553
 
1.5%
4 532
 
1.4%
8 414
 
1.1%
Other values (3) 338
 
0.9%
Hangul
ValueCountFrequency (%)
9791
25.9%
5187
13.7%
5184
13.7%
5181
13.7%
4607
12.2%
4607
12.2%
555
 
1.5%
552
 
1.5%
552
 
1.5%
363
 
1.0%
Other values (13) 1288
 
3.4%

데이터기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size41.9 KiB
2023-08-09
5341 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2023-08-09
2nd row2023-08-09
3rd row2023-08-09
4th row2023-08-09
5th row2023-08-09

Common Values

ValueCountFrequency (%)
2023-08-09 5341
100.0%

Length

2023-12-12T19:13:33.493918image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T19:13:33.604991image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2023-08-09 5341
100.0%

Correlations

2023-12-12T19:13:33.683503image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시설종류시설상세
시설종류1.0000.999
시설상세0.9991.000

Missing values

2023-12-12T19:13:30.186390image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T19:13:30.339337image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

시군구명시설종류시설상세시설명최종변경일최종변경고시번호데이터기준일자
0수원시일반철도<NA>철도9(매교역)2014-05-08국토교통부고시제2014-238호2023-08-09
1수원시일반철도<NA>철도8(시청역)2014-05-08국토교통부고시제2014-238호2023-08-09
2수원시일반철도<NA>철도7(매탄역)2014-05-08국토교통부고시제2014-238호2023-08-09
3수원시일반철도<NA>철도6(방죽역)2011-05-16수원시고시제2011-61호2023-08-09
4수원시일반철도<NA>철도5(영통역)2011-05-16수원시고시제2011-61호2023-08-09
5수원시일반철도<NA>(광교지구)철도4(신분당 연장선 역사부지)2011-10-27국토해양부고시제2011-609호2023-08-09
6수원시일반철도<NA>철도4(영덕역)2011-05-16수원시고시제2011-61호2023-08-09
7수원시일반철도<NA>철도3(신분당선 연장)2014-05-08국토교통부고시제2014-238호2023-08-09
8수원시일반철도<NA>철도10(일반철도-수인선)2011-05-16수원시고시제2011-61호2023-08-09
9수원시일반철도<NA>철도2(수인선)2014-09-23국토교통부고시제2014-554호2023-08-09
시군구명시설종류시설상세시설명최종변경일최종변경고시번호데이터기준일자
5331수원시하수도공공하수처리시설황구지천공공하수처리시설(하수도시설4)2019-06-07수원시고시제2019-179호2023-08-09
5332수원시폐기물처리 및 재활용시설최종처분시설쓰레기적환장(폐기물처리시설8)2017-08-29수원시고시제2017-236호2023-08-09
5333수원시폐기물처리 및 재활용시설최종처분시설쓰레기소각장(폐기물처리시설2)2011-05-16수원시고시제2011-61호2023-08-09
5334수원시폐기물처리 및 재활용시설최종처분시설음식물쓰레기처리시설(폐기물처리시설7)2018-05-28수원시고시제2018-153호2023-08-09
5335수원시폐기물처리 및 재활용시설최종처분시설쓰레기적환장(폐기물처리시설8)2011-05-16수원시고시제2011-61호2023-08-09
5336수원시폐기물처리 및 재활용시설재활용시설재활용품보관창고(폐기물처리시설3)2011-05-16수원시고시제2011-61호2023-08-09
5337수원시폐기물처리 및 재활용시설건설폐기물시설건설폐기물처리시설(폐기물처리시설5)2011-05-16수원시고시제2011-61호2023-08-09
5338수원시폐기물처리 및 재활용시설건설폐기물시설건설폐기물 처리시설(폐기물처리시설6)2011-05-16수원시고시제2011-61호2023-08-09
5339수원시수질오염방지시설공공폐수처리시설폐수종말처리장(수질오염방지시설9)2011-05-16수원시고시제2011-61호2023-08-09
5340수원시수질오염방지시설분뇨처리시설수원분뇨처리장(수질오염방지시설1)2013-06-05수원시고시제2013-159호2023-08-09

Duplicate rows

Most frequently occurring

시군구명시설종류시설상세시설명최종변경일최종변경고시번호데이터기준일자# duplicates
3수원시시장대규모점포및임시시장소매시장2011-05-16수원시고시제2011-61호2023-08-0922
7수원시전기공급설비배전사업소전기공급설비2011-05-16수원시고시제2011-61호2023-08-095
10수원시전기공급설비송전선로송전시설2011-05-16수원시고시제2011-61호2023-08-094
1수원시수도공급설비배수시설배수지2011-05-16수원시고시제2011-61호2023-08-093
8수원시전기공급설비송전선로CABLEHEAD부지2011-05-16수원시고시제2011-61호2023-08-093
9수원시전기공급설비송전선로송전선로2011-05-16수원시고시제2011-61호2023-08-093
11수원시전기공급설비송전선로송전철탑2011-05-16수원시고시제2011-61호2023-08-093
0수원시노외주차장<NA>주차장-1342011-05-16수원시고시제2011-61호2023-08-092
2수원시수도공급설비배수시설배수지2011-12-29국토해양부고시제2011-869호2023-08-092
4수원시일반도로국지도로소로2-1399호선2018-07-16수원시고시제2018-215호2023-08-092