Overview

Dataset statistics

Number of variables5
Number of observations93
Missing cells3
Missing cells (%)0.6%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory3.8 KiB
Average record size in memory41.4 B

Variable types

Categorical2
Text3

Dataset

Description대전광역시 유성구 관내에 있는 특정토양오염관리대상시설에 현황으로 시도명,시군구명, 상호명, 소재지도로명주소, 소재지지번주소 등의 데이터를 제공합니다.
URLhttps://www.data.go.kr/data/15118310/fileData.do

Alerts

시도명 has constant value ""Constant
시군구명 has constant value ""Constant
소재지도로명주소 has 3 (3.2%) missing valuesMissing

Reproduction

Analysis started2023-12-12 19:41:22.743094
Analysis finished2023-12-12 19:41:23.853441
Duration1.11 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시도명
Categorical

CONSTANT 

Distinct1
Distinct (%)1.1%
Missing0
Missing (%)0.0%
Memory size876.0 B
대전광역시
93 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row대전광역시
2nd row대전광역시
3rd row대전광역시
4th row대전광역시
5th row대전광역시

Common Values

ValueCountFrequency (%)
대전광역시 93
100.0%

Length

2023-12-13T04:41:23.937497image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T04:41:24.046122image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
대전광역시 93
100.0%

시군구명
Categorical

CONSTANT 

Distinct1
Distinct (%)1.1%
Missing0
Missing (%)0.0%
Memory size876.0 B
유성구
93 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row유성구
2nd row유성구
3rd row유성구
4th row유성구
5th row유성구

Common Values

ValueCountFrequency (%)
유성구 93
100.0%

Length

2023-12-13T04:41:24.151361image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T04:41:24.245664image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
유성구 93
100.0%
Distinct92
Distinct (%)98.9%
Missing0
Missing (%)0.0%
Memory size876.0 B
2023-12-13T04:41:24.439066image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length19
Median length15
Mean length9.2150538
Min length4

Characters and Unicode

Total characters857
Distinct characters175
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique91 ?
Unique (%)97.8%

Sample

1st row한전원자력연료주식회사
2nd row위성주유소
3rd row현대오일뱅크㈜ 진잠셀프주유소
4th row㈜구암상사주유소
5th row의료법인성전의료재단 신생병원
ValueCountFrequency (%)
현대오일뱅크㈜ 3
 
2.7%
육군정보통신학교 2
 
1.8%
현대오일뱅크㈜직영 2
 
1.8%
sk에너지㈜ 2
 
1.8%
국립대전현충원 1
 
0.9%
지에스칼텍스㈜정다운주유소 1
 
0.9%
제7867부대 1
 
0.9%
육군 1
 
0.9%
제이에너지㈜ 1
 
0.9%
광장석유 1
 
0.9%
Other values (95) 95
86.4%
2023-12-13T04:41:25.117052image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
63
 
7.4%
58
 
6.8%
58
 
6.8%
41
 
4.8%
32
 
3.7%
23
 
2.7%
22
 
2.6%
19
 
2.2%
17
 
2.0%
14
 
1.6%
Other values (165) 510
59.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 749
87.4%
Other Symbol 41
 
4.8%
Uppercase Letter 25
 
2.9%
Decimal Number 18
 
2.1%
Space Separator 17
 
2.0%
Lowercase Letter 3
 
0.4%
Close Punctuation 2
 
0.2%
Open Punctuation 2
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
63
 
8.4%
58
 
7.7%
58
 
7.7%
32
 
4.3%
23
 
3.1%
22
 
2.9%
19
 
2.5%
14
 
1.9%
12
 
1.6%
12
 
1.6%
Other values (143) 436
58.2%
Decimal Number
ValueCountFrequency (%)
1 3
16.7%
3 3
16.7%
7 2
11.1%
9 2
11.1%
6 2
11.1%
5 2
11.1%
0 2
11.1%
8 1
 
5.6%
2 1
 
5.6%
Uppercase Letter
ValueCountFrequency (%)
S 10
40.0%
K 8
32.0%
I 2
 
8.0%
C 2
 
8.0%
G 2
 
8.0%
H 1
 
4.0%
Lowercase Letter
ValueCountFrequency (%)
l 1
33.3%
e 1
33.3%
f 1
33.3%
Other Symbol
ValueCountFrequency (%)
41
100.0%
Space Separator
ValueCountFrequency (%)
17
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 790
92.2%
Common 39
 
4.6%
Latin 28
 
3.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
63
 
8.0%
58
 
7.3%
58
 
7.3%
41
 
5.2%
32
 
4.1%
23
 
2.9%
22
 
2.8%
19
 
2.4%
14
 
1.8%
12
 
1.5%
Other values (144) 448
56.7%
Common
ValueCountFrequency (%)
17
43.6%
1 3
 
7.7%
3 3
 
7.7%
) 2
 
5.1%
7 2
 
5.1%
9 2
 
5.1%
6 2
 
5.1%
( 2
 
5.1%
5 2
 
5.1%
0 2
 
5.1%
Other values (2) 2
 
5.1%
Latin
ValueCountFrequency (%)
S 10
35.7%
K 8
28.6%
I 2
 
7.1%
C 2
 
7.1%
G 2
 
7.1%
l 1
 
3.6%
e 1
 
3.6%
H 1
 
3.6%
f 1
 
3.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 749
87.4%
ASCII 67
 
7.8%
None 41
 
4.8%

Most frequent character per block

Hangul
ValueCountFrequency (%)
63
 
8.4%
58
 
7.7%
58
 
7.7%
32
 
4.3%
23
 
3.1%
22
 
2.9%
19
 
2.5%
14
 
1.9%
12
 
1.6%
12
 
1.6%
Other values (143) 436
58.2%
None
ValueCountFrequency (%)
41
100.0%
ASCII
ValueCountFrequency (%)
17
25.4%
S 10
14.9%
K 8
11.9%
1 3
 
4.5%
3 3
 
4.5%
) 2
 
3.0%
7 2
 
3.0%
9 2
 
3.0%
6 2
 
3.0%
( 2
 
3.0%
Other values (11) 16
23.9%
Distinct86
Distinct (%)95.6%
Missing3
Missing (%)3.2%
Memory size876.0 B
2023-12-13T04:41:25.375342image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length35
Median length30
Mean length23.833333
Min length20

Characters and Unicode

Total characters2145
Distinct characters111
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique83 ?
Unique (%)92.2%

Sample

1st row대전광역시 유성구 대덕대로989번길 242(덕진동)
2nd row대전광역시 유성구 현충원로 489(장대동)
3rd row대전광역시 유성구 계백로 899(원내동)
4th row대전광역시 유성구 유성대로654번길 32(구암동)
5th row대전광역시 유성구 진잠옛로135번길 87(학하동)
ValueCountFrequency (%)
대전광역시 90
24.7%
유성구 90
24.7%
현충원로 10
 
2.7%
유성대로 9
 
2.5%
엑스포로 8
 
2.2%
대덕대로 7
 
1.9%
계백로 7
 
1.9%
북유성대로 4
 
1.1%
사서함 3
 
0.8%
가정북로 3
 
0.8%
Other values (120) 134
36.7%
2023-12-13T04:41:25.777033image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
275
 
12.8%
141
 
6.6%
115
 
5.4%
110
 
5.1%
98
 
4.6%
91
 
4.2%
91
 
4.2%
90
 
4.2%
90
 
4.2%
90
 
4.2%
Other values (101) 954
44.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1369
63.8%
Decimal Number 319
 
14.9%
Space Separator 275
 
12.8%
Open Punctuation 86
 
4.0%
Close Punctuation 86
 
4.0%
Dash Punctuation 9
 
0.4%
Other Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
141
 
10.3%
115
 
8.4%
110
 
8.0%
98
 
7.2%
91
 
6.6%
91
 
6.6%
90
 
6.6%
90
 
6.6%
90
 
6.6%
87
 
6.4%
Other values (86) 366
26.7%
Decimal Number
ValueCountFrequency (%)
1 54
16.9%
2 45
14.1%
5 38
11.9%
8 37
11.6%
9 30
9.4%
3 29
9.1%
4 29
9.1%
7 23
7.2%
6 22
6.9%
0 12
 
3.8%
Space Separator
ValueCountFrequency (%)
275
100.0%
Open Punctuation
ValueCountFrequency (%)
( 86
100.0%
Close Punctuation
ValueCountFrequency (%)
) 86
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 9
100.0%
Other Punctuation
ValueCountFrequency (%)
, 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1369
63.8%
Common 776
36.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
141
 
10.3%
115
 
8.4%
110
 
8.0%
98
 
7.2%
91
 
6.6%
91
 
6.6%
90
 
6.6%
90
 
6.6%
90
 
6.6%
87
 
6.4%
Other values (86) 366
26.7%
Common
ValueCountFrequency (%)
275
35.4%
( 86
 
11.1%
) 86
 
11.1%
1 54
 
7.0%
2 45
 
5.8%
5 38
 
4.9%
8 37
 
4.8%
9 30
 
3.9%
3 29
 
3.7%
4 29
 
3.7%
Other values (5) 67
 
8.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1369
63.8%
ASCII 776
36.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
275
35.4%
( 86
 
11.1%
) 86
 
11.1%
1 54
 
7.0%
2 45
 
5.8%
5 38
 
4.9%
8 37
 
4.8%
9 30
 
3.9%
3 29
 
3.7%
4 29
 
3.7%
Other values (5) 67
 
8.6%
Hangul
ValueCountFrequency (%)
141
 
10.3%
115
 
8.4%
110
 
8.0%
98
 
7.2%
91
 
6.6%
91
 
6.6%
90
 
6.6%
90
 
6.6%
90
 
6.6%
87
 
6.4%
Other values (86) 366
26.7%
Distinct88
Distinct (%)94.6%
Missing0
Missing (%)0.0%
Memory size876.0 B
2023-12-13T04:41:26.123304image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length27
Median length25
Mean length18.322581
Min length6

Characters and Unicode

Total characters1704
Distinct characters76
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique85 ?
Unique (%)91.4%

Sample

1st row대전광역시 유성구 덕진동 493
2nd row대전광역시 유성구 장대동 224-14
3rd row대전광역시 유성구 원내동 402-7
4th row대전광역시 유성구 구암동 624-2
5th row대전광역시 유성구 학하동 682
ValueCountFrequency (%)
대전광역시 93
25.1%
유성구 90
24.3%
원촌동 8
 
2.2%
장대동 8
 
2.2%
추목동 7
 
1.9%
사서함 6
 
1.6%
구암동 5
 
1.3%
화암동 5
 
1.3%
신성동 4
 
1.1%
원내동 4
 
1.1%
Other values (118) 141
38.0%
2023-12-13T04:41:26.594265image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
282
16.5%
101
 
5.9%
95
 
5.6%
94
 
5.5%
94
 
5.5%
93
 
5.5%
93
 
5.5%
93
 
5.5%
91
 
5.3%
90
 
5.3%
Other values (66) 578
33.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1020
59.9%
Decimal Number 335
 
19.7%
Space Separator 282
 
16.5%
Dash Punctuation 67
 
3.9%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
101
9.9%
95
9.3%
94
9.2%
94
9.2%
93
9.1%
93
9.1%
93
9.1%
91
8.9%
90
8.8%
12
 
1.2%
Other values (54) 164
16.1%
Decimal Number
ValueCountFrequency (%)
1 71
21.2%
2 41
12.2%
4 40
11.9%
3 38
11.3%
5 30
9.0%
7 26
 
7.8%
6 24
 
7.2%
0 24
 
7.2%
8 22
 
6.6%
9 19
 
5.7%
Space Separator
ValueCountFrequency (%)
282
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 67
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1020
59.9%
Common 684
40.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
101
9.9%
95
9.3%
94
9.2%
94
9.2%
93
9.1%
93
9.1%
93
9.1%
91
8.9%
90
8.8%
12
 
1.2%
Other values (54) 164
16.1%
Common
ValueCountFrequency (%)
282
41.2%
1 71
 
10.4%
- 67
 
9.8%
2 41
 
6.0%
4 40
 
5.8%
3 38
 
5.6%
5 30
 
4.4%
7 26
 
3.8%
6 24
 
3.5%
0 24
 
3.5%
Other values (2) 41
 
6.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1020
59.9%
ASCII 684
40.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
282
41.2%
1 71
 
10.4%
- 67
 
9.8%
2 41
 
6.0%
4 40
 
5.8%
3 38
 
5.6%
5 30
 
4.4%
7 26
 
3.8%
6 24
 
3.5%
0 24
 
3.5%
Other values (2) 41
 
6.0%
Hangul
ValueCountFrequency (%)
101
9.9%
95
9.3%
94
9.2%
94
9.2%
93
9.1%
93
9.1%
93
9.1%
91
8.9%
90
8.8%
12
 
1.2%
Other values (54) 164
16.1%

Correlations

2023-12-13T04:41:26.712491image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
상호명소재지도로명주소소재지지번주소
상호명1.0000.9980.998
소재지도로명주소0.9981.0000.999
소재지지번주소0.9980.9991.000

Missing values

2023-12-13T04:41:23.685274image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T04:41:23.808058image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

시도명시군구명상호명소재지도로명주소소재지지번주소
0대전광역시유성구한전원자력연료주식회사대전광역시 유성구 대덕대로989번길 242(덕진동)대전광역시 유성구 덕진동 493
1대전광역시유성구위성주유소대전광역시 유성구 현충원로 489(장대동)대전광역시 유성구 장대동 224-14
2대전광역시유성구현대오일뱅크㈜ 진잠셀프주유소대전광역시 유성구 계백로 899(원내동)대전광역시 유성구 원내동 402-7
3대전광역시유성구㈜구암상사주유소대전광역시 유성구 유성대로654번길 32(구암동)대전광역시 유성구 구암동 624-2
4대전광역시유성구의료법인성전의료재단 신생병원대전광역시 유성구 진잠옛로135번길 87(학하동)대전광역시 유성구 학하동 682
5대전광역시유성구대전광역시시립정신병원대전광역시 유성구 진잠옛로135번길 87(학하동)대전광역시 유성구 학하동 682
6대전광역시유성구한국항공우주연구원대전광역시 유성구 과학로 169-84(어은동), 인프라종합지원실대전광역시 유성구 어은동 45
7대전광역시유성구한밭대로주유소대전광역시 유성구 한밭대로 305(장대동)대전광역시 유성구 장대동 38-7
8대전광역시유성구㈜유성하나로주유소대전광역시 유성구 한밭대로 295(장대동)대전광역시 유성구 장대동 20-21
9대전광역시유성구㈜상익 유성주유소대전광역시 유성구 현충원로 342(구암동)대전광역시 유성구 구암동 672
시도명시군구명상호명소재지도로명주소소재지지번주소
83대전광역시유성구㈜이맥솔루션대전광역시 유성구 유성대로1184번길 38(신성동)대전광역시 유성구 신성동 490
84대전광역시유성구롯데마트 서대전점 주유소대전광역시 유성구 유성대로 26-37(원내동)대전광역시 유성구 원내동 33-13
85대전광역시유성구자운대근무지원단<NA>대전광역시 유성구 추목동 사서함 78-24호
86대전광역시유성구한국과학기술정보연구원대전광역시 유성구 대학로 245(어은동)대전광역시 유성구 어은동 52-11
87대전광역시유성구㈜한수도로산업대전광역시 유성구 유성대로1205번길 6-20(자운동)대전광역시 유성구 자운동 556
88대전광역시유성구예성주유소대전광역시 유성구 도안동로 488(봉명동)대전광역시 유성구 봉명동 1040-2
89대전광역시유성구지에스칼텍스㈜대덕밸리주유소대전광역시 유성구 대덕대로 938(화암동)대전광역시 유성구 화암동 119-2
90대전광역시유성구신성주유소대전광역시 유성구 가정북로 163(신성동)대전광역시 유성구 신성동 479
91대전광역시유성구한밭대앞Self주유소대전광역시 유성구 학하서로 111(덕명동)대전광역시 유성구 덕명동 613-1
92대전광역시유성구육군정보통신학교대전광역시 유성구 자운로97번길 265(신봉동)대전광역시 유성구 추목동 사서함 78-301호