Overview

Dataset statistics

Number of variables6
Number of observations80
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory4.0 KiB
Average record size in memory50.6 B

Variable types

Text2
DateTime3
Categorical1

Dataset

Description경상남도 양산시의 토양오염도 검사 대상 현황에 대한 정보로 상호명, 도로명주소, 소방법상 완공검사일, 토양오염도검사연도, 토양오염도 검사 예정시작일, 토양오염도 검사 예정종료일 의 항목을 제공합니다.
Author경상남도 양산시
URLhttps://www.data.go.kr/data/15105438/fileData.do

Alerts

토양오염도검사연도 has constant value ""Constant
상호 has unique valuesUnique

Reproduction

Analysis started2023-12-12 12:13:38.404645
Analysis finished2023-12-12 12:13:39.406537
Duration1 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

상호
Text

UNIQUE 

Distinct80
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size772.0 B
2023-12-12T21:13:39.626557image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length28
Median length17
Mean length8.95
Min length4

Characters and Unicode

Total characters716
Distinct characters161
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique80 ?
Unique (%)100.0%

Sample

1st row(주)화인에너지 호포주유소
2nd row극동유화(주) 제1공장
3rd row지에스칼텍스정유(주)부성주유소
4th row비케이에너지(주) 석산주유소
5th row남양산IC주유소
ValueCountFrequency (%)
극동유화(주 2
 
2.1%
동아타이어공업(주 2
 
2.1%
주)화인에너지 1
 
1.0%
광신리소스(주 1
 
1.0%
성광주유소 1
 
1.0%
세종석유(주 1
 
1.0%
경성화학공업(주 1
 
1.0%
한영세탁재료상사 1
 
1.0%
주)기센에너지 1
 
1.0%
동헌산업(주 1
 
1.0%
Other values (84) 84
87.5%
2023-12-12T21:13:40.117868image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
86
 
12.0%
) 55
 
7.7%
( 54
 
7.5%
43
 
6.0%
35
 
4.9%
20
 
2.8%
19
 
2.7%
14
 
2.0%
12
 
1.7%
12
 
1.7%
Other values (151) 366
51.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 568
79.3%
Close Punctuation 55
 
7.7%
Open Punctuation 54
 
7.5%
Space Separator 19
 
2.7%
Uppercase Letter 17
 
2.4%
Decimal Number 3
 
0.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
86
 
15.1%
43
 
7.6%
35
 
6.2%
20
 
3.5%
14
 
2.5%
12
 
2.1%
12
 
2.1%
12
 
2.1%
10
 
1.8%
9
 
1.6%
Other values (135) 315
55.5%
Uppercase Letter
ValueCountFrequency (%)
I 3
17.6%
C 3
17.6%
D 2
11.8%
T 2
11.8%
G 1
 
5.9%
A 1
 
5.9%
P 1
 
5.9%
R 1
 
5.9%
S 1
 
5.9%
O 1
 
5.9%
Decimal Number
ValueCountFrequency (%)
2 2
66.7%
1 1
33.3%
Close Punctuation
ValueCountFrequency (%)
) 55
100.0%
Open Punctuation
ValueCountFrequency (%)
( 54
100.0%
Space Separator
ValueCountFrequency (%)
19
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 568
79.3%
Common 131
 
18.3%
Latin 17
 
2.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
86
 
15.1%
43
 
7.6%
35
 
6.2%
20
 
3.5%
14
 
2.5%
12
 
2.1%
12
 
2.1%
12
 
2.1%
10
 
1.8%
9
 
1.6%
Other values (135) 315
55.5%
Latin
ValueCountFrequency (%)
I 3
17.6%
C 3
17.6%
D 2
11.8%
T 2
11.8%
G 1
 
5.9%
A 1
 
5.9%
P 1
 
5.9%
R 1
 
5.9%
S 1
 
5.9%
O 1
 
5.9%
Common
ValueCountFrequency (%)
) 55
42.0%
( 54
41.2%
19
 
14.5%
2 2
 
1.5%
1 1
 
0.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 568
79.3%
ASCII 148
 
20.7%

Most frequent character per block

Hangul
ValueCountFrequency (%)
86
 
15.1%
43
 
7.6%
35
 
6.2%
20
 
3.5%
14
 
2.5%
12
 
2.1%
12
 
2.1%
12
 
2.1%
10
 
1.8%
9
 
1.6%
Other values (135) 315
55.5%
ASCII
ValueCountFrequency (%)
) 55
37.2%
( 54
36.5%
19
 
12.8%
I 3
 
2.0%
C 3
 
2.0%
2 2
 
1.4%
D 2
 
1.4%
T 2
 
1.4%
G 1
 
0.7%
1 1
 
0.7%
Other values (6) 6
 
4.1%
Distinct79
Distinct (%)98.8%
Missing0
Missing (%)0.0%
Memory size772.0 B
2023-12-12T21:13:40.454758image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length28
Median length24.5
Mean length20.75
Min length15

Characters and Unicode

Total characters1660
Distinct characters87
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique78 ?
Unique (%)97.5%

Sample

1st row경상남도 양산시 동면 양산대로 21
2nd row경상남도 양산시 양산시 어실로 101(유산동)
3rd row경상남도 양산시 양산대로 1024
4th row경상남도 양산시 동면 양산대로 602
5th row경상남도 양산시 동면 양산대로 538
ValueCountFrequency (%)
양산시 81
23.7%
경상남도 80
23.4%
양산대로 18
 
5.3%
상북면 9
 
2.6%
하북면 8
 
2.3%
동면 6
 
1.8%
어실로 4
 
1.2%
웅상대로 3
 
0.9%
소주공단2길 2
 
0.6%
78 2
 
0.6%
Other values (123) 129
37.7%
2023-12-12T21:13:40.972464image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
269
16.2%
126
 
7.6%
99
 
6.0%
93
 
5.6%
87
 
5.2%
81
 
4.9%
80
 
4.8%
80
 
4.8%
1 58
 
3.5%
41
 
2.5%
Other values (77) 646
38.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1062
64.0%
Space Separator 269
 
16.2%
Decimal Number 258
 
15.5%
Open Punctuation 31
 
1.9%
Close Punctuation 31
 
1.9%
Dash Punctuation 9
 
0.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
126
11.9%
99
 
9.3%
93
 
8.8%
87
 
8.2%
81
 
7.6%
80
 
7.5%
80
 
7.5%
41
 
3.9%
41
 
3.9%
39
 
3.7%
Other values (63) 295
27.8%
Decimal Number
ValueCountFrequency (%)
1 58
22.5%
2 39
15.1%
3 32
12.4%
5 27
10.5%
0 25
9.7%
7 18
 
7.0%
4 16
 
6.2%
9 15
 
5.8%
8 15
 
5.8%
6 13
 
5.0%
Space Separator
ValueCountFrequency (%)
269
100.0%
Open Punctuation
ValueCountFrequency (%)
( 31
100.0%
Close Punctuation
ValueCountFrequency (%)
) 31
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 9
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1062
64.0%
Common 598
36.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
126
11.9%
99
 
9.3%
93
 
8.8%
87
 
8.2%
81
 
7.6%
80
 
7.5%
80
 
7.5%
41
 
3.9%
41
 
3.9%
39
 
3.7%
Other values (63) 295
27.8%
Common
ValueCountFrequency (%)
269
45.0%
1 58
 
9.7%
2 39
 
6.5%
3 32
 
5.4%
( 31
 
5.2%
) 31
 
5.2%
5 27
 
4.5%
0 25
 
4.2%
7 18
 
3.0%
4 16
 
2.7%
Other values (4) 52
 
8.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1062
64.0%
ASCII 598
36.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
269
45.0%
1 58
 
9.7%
2 39
 
6.5%
3 32
 
5.4%
( 31
 
5.2%
) 31
 
5.2%
5 27
 
4.5%
0 25
 
4.2%
7 18
 
3.0%
4 16
 
2.7%
Other values (4) 52
 
8.7%
Hangul
ValueCountFrequency (%)
126
11.9%
99
 
9.3%
93
 
8.8%
87
 
8.2%
81
 
7.6%
80
 
7.5%
80
 
7.5%
41
 
3.9%
41
 
3.9%
39
 
3.7%
Other values (63) 295
27.8%
Distinct77
Distinct (%)96.2%
Missing0
Missing (%)0.0%
Memory size772.0 B
Minimum1981-12-29 00:00:00
Maximum2017-09-06 00:00:00
2023-12-12T21:13:41.133271image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:13:41.286806image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

토양오염도검사연도
Categorical

CONSTANT 

Distinct1
Distinct (%)1.2%
Missing0
Missing (%)0.0%
Memory size772.0 B
2022
80 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2022
2nd row2022
3rd row2022
4th row2022
5th row2022

Common Values

ValueCountFrequency (%)
2022 80
100.0%

Length

2023-12-12T21:13:41.445233image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T21:13:41.558788image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2022 80
100.0%
Distinct72
Distinct (%)90.0%
Missing0
Missing (%)0.0%
Memory size772.0 B
Minimum2021-07-18 00:00:00
Maximum2022-12-31 00:00:00
2023-12-12T21:13:41.690101image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:13:41.892749image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct73
Distinct (%)91.2%
Missing0
Missing (%)0.0%
Memory size772.0 B
Minimum2021-10-16 00:00:00
Maximum2023-03-31 00:00:00
2023-12-12T21:13:42.337444image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:13:42.531916image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

Correlations

2023-12-12T21:13:42.674657image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
상호도로명주소소방법상 완공검사일토양오염도 검사 예정시작일토양오염도 검사 예정종료일
상호1.0001.0001.0001.0001.000
도로명주소1.0001.0001.0001.0001.000
소방법상 완공검사일1.0001.0001.0001.0000.999
토양오염도 검사 예정시작일1.0001.0001.0001.0000.998
토양오염도 검사 예정종료일1.0001.0000.9990.9981.000

Missing values

2023-12-12T21:13:39.193148image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T21:13:39.341982image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

상호도로명주소소방법상 완공검사일토양오염도검사연도토양오염도 검사 예정시작일토양오염도 검사 예정종료일
0(주)화인에너지 호포주유소경상남도 양산시 동면 양산대로 211995-12-0820222021-12-082022-03-08
1극동유화(주) 제1공장경상남도 양산시 양산시 어실로 101(유산동)1985-07-1920222022-07-192022-10-17
2지에스칼텍스정유(주)부성주유소경상남도 양산시 양산대로 10242004-12-3120222021-12-312022-03-31
3비케이에너지(주) 석산주유소경상남도 양산시 동면 양산대로 6021986-11-0620222021-11-062022-02-04
4남양산IC주유소경상남도 양산시 동면 양산대로 5381987-01-1520222022-01-152022-04-15
5영신주유소경상남도 양산시 하북면 양산대로 19371995-12-0120222021-12-012022-03-01
6(주)DRB동일경상남도 양산시 산막공단북2길 392002-11-1920222021-11-192022-02-17
7넥센타이어(주)경상남도 양산시 충렬로 355(유산동)1985-11-1320222022-11-132023-02-11
8(주)동일리조트)경상남도 양산시 하북면 신평남부길78-1301983-12-2920222022-12-292023-03-29
9새인산업(주)경상남도 양산시 명곡로 217(신기동)2012-12-2820222022-12-282023-03-28
상호도로명주소소방법상 완공검사일토양오염도검사연도토양오염도 검사 예정시작일토양오염도 검사 예정종료일
70대진주유소경상남도 양산시 웅상대로 7451997-07-2620222022-07-262022-10-23
71(주)지석산업경상남도 양산시 소주공단2길 782003-09-1920222022-09-192022-12-17
72(주)진보아스콘경상남도 양산시 소주공단2길 782003-09-1920222022-09-192022-12-17
73송학제지(주)경상남도 양산시 주남로 501981-12-2920222022-12-292023-03-28
74(주)부일산업개발경상남도 양산시 신덕계1길 242012-03-0220222022-03-022022-05-30
75신우씨앤씨경상남도 양산시 소주공단5길732003-09-0520222022-09-052022-12-03
76비엔철강(주)케미칼경상남도 양산시 소주공단1길63(주남동)2012-04-3020222022-04-302022-07-28
77진광(주)경상남도 양산시 그린공단3길 25-35(매곡동)2017-09-0620222022-09-062022-12-04
78도림통산(주)경상남도 양산시 웅상농공단지길 481999-07-2820222022-07-282022-10-25
79스피드케미칼(주)경상남도 양산시 장기터1길 382007-11-0520222022-11-052023-02-02