Overview

Dataset statistics

Number of variables7
Number of observations35
Missing cells16
Missing cells (%)6.5%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.1 KiB
Average record size in memory61.8 B

Variable types

Text3
DateTime1
Numeric2
Categorical1

Dataset

Description경상남도 거제시 소독업체현황(신고일자, 소독업소명칭, 소재지, 위도, 경도, 전화번호, 기준일자)에 대한 정보를 제공합니다.
Author경상남도 거제시
URLhttps://bigdata.gyeongnam.go.kr/index.gn?menuCd=DOM_000000114002001000&publicdatapk=3079316

Alerts

기준일자 has constant value ""Constant
전화번호 has 16 (45.7%) missing valuesMissing
소독업소명칭 has unique valuesUnique
신고일자 has unique valuesUnique

Reproduction

Analysis started2023-12-10 23:21:47.350204
Analysis finished2023-12-10 23:21:48.077403
Duration0.73 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

소독업소명칭
Text

UNIQUE 

Distinct35
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size412.0 B
2023-12-11T08:21:48.225455image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length17
Median length12
Mean length7.0571429
Min length3

Characters and Unicode

Total characters247
Distinct characters103
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique35 ?
Unique (%)100.0%

Sample

1st row주식회사 서휘바이오
2nd row클린스쿨경남
3rd row바르다방역
4th row에코유
5th row(주) 거영
ValueCountFrequency (%)
주식회사 3
 
6.1%
벌레 1
 
2.0%
우리환경 1
 
2.0%
주)에코크린 1
 
2.0%
에코유방역 1
 
2.0%
에코유환경 1
 
2.0%
새마을김반장 1
 
2.0%
유)참성실한기업 1
 
2.0%
주)지타운 1
 
2.0%
키즈119 1
 
2.0%
Other values (37) 37
75.5%
2023-12-11T08:21:48.536638image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
14
 
5.7%
12
 
4.9%
( 11
 
4.5%
) 11
 
4.5%
10
 
4.0%
7
 
2.8%
6
 
2.4%
5
 
2.0%
5
 
2.0%
5
 
2.0%
Other values (93) 161
65.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 196
79.4%
Space Separator 14
 
5.7%
Open Punctuation 11
 
4.5%
Close Punctuation 11
 
4.5%
Uppercase Letter 6
 
2.4%
Lowercase Letter 4
 
1.6%
Decimal Number 3
 
1.2%
Other Punctuation 2
 
0.8%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
12
 
6.1%
10
 
5.1%
7
 
3.6%
6
 
3.1%
5
 
2.6%
5
 
2.6%
5
 
2.6%
4
 
2.0%
4
 
2.0%
4
 
2.0%
Other values (78) 134
68.4%
Uppercase Letter
ValueCountFrequency (%)
C 2
33.3%
N 1
16.7%
I 1
16.7%
E 1
16.7%
Z 1
16.7%
Lowercase Letter
ValueCountFrequency (%)
p 1
25.0%
k 1
25.0%
i 1
25.0%
m 1
25.0%
Decimal Number
ValueCountFrequency (%)
1 2
66.7%
9 1
33.3%
Space Separator
ValueCountFrequency (%)
14
100.0%
Open Punctuation
ValueCountFrequency (%)
( 11
100.0%
Close Punctuation
ValueCountFrequency (%)
) 11
100.0%
Other Punctuation
ValueCountFrequency (%)
. 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 196
79.4%
Common 41
 
16.6%
Latin 10
 
4.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
12
 
6.1%
10
 
5.1%
7
 
3.6%
6
 
3.1%
5
 
2.6%
5
 
2.6%
5
 
2.6%
4
 
2.0%
4
 
2.0%
4
 
2.0%
Other values (78) 134
68.4%
Latin
ValueCountFrequency (%)
C 2
20.0%
p 1
10.0%
k 1
10.0%
i 1
10.0%
m 1
10.0%
N 1
10.0%
I 1
10.0%
E 1
10.0%
Z 1
10.0%
Common
ValueCountFrequency (%)
14
34.1%
( 11
26.8%
) 11
26.8%
1 2
 
4.9%
. 2
 
4.9%
9 1
 
2.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 196
79.4%
ASCII 51
 
20.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
14
27.5%
( 11
21.6%
) 11
21.6%
1 2
 
3.9%
. 2
 
3.9%
C 2
 
3.9%
p 1
 
2.0%
9 1
 
2.0%
k 1
 
2.0%
i 1
 
2.0%
Other values (5) 5
 
9.8%
Hangul
ValueCountFrequency (%)
12
 
6.1%
10
 
5.1%
7
 
3.6%
6
 
3.1%
5
 
2.6%
5
 
2.6%
5
 
2.6%
4
 
2.0%
4
 
2.0%
4
 
2.0%
Other values (78) 134
68.4%

신고일자
Date

UNIQUE 

Distinct35
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size412.0 B
Minimum1985-06-12 00:00:00
Maximum2023-08-07 00:00:00
2023-12-11T08:21:48.663564image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:21:48.809988image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=35)

주소
Text

Distinct33
Distinct (%)94.3%
Missing0
Missing (%)0.0%
Memory size412.0 B
2023-12-11T08:21:49.083874image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length47
Median length35
Mean length27.971429
Min length21

Characters and Unicode

Total characters979
Distinct characters94
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique31 ?
Unique (%)88.6%

Sample

1st row경상남도 거제시 장평로 16-12, 14층 1401호 (장평동)
2nd row경상남도 거제시 거제면 읍내로7길 22-1
3rd row경상남도 거제시 연초면 연하해안로 436, 단독주택 1층
4th row경상남도 거제시 계룡로 126, 배진빌딩 2층 (고현동)
5th row경상남도 거제시 서문로 19, 3층 (고현동)
ValueCountFrequency (%)
경상남도 35
 
16.7%
거제시 35
 
16.7%
고현동 8
 
3.8%
1층 7
 
3.3%
15 4
 
1.9%
장평동 4
 
1.9%
연초면 3
 
1.4%
101호 3
 
1.4%
옥포동 3
 
1.4%
상동동 3
 
1.4%
Other values (89) 104
49.8%
2023-12-11T08:21:49.506019image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
174
 
17.8%
1 55
 
5.6%
43
 
4.4%
41
 
4.2%
41
 
4.2%
41
 
4.2%
35
 
3.6%
35
 
3.6%
35
 
3.6%
35
 
3.6%
Other values (84) 444
45.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 550
56.2%
Space Separator 174
 
17.8%
Decimal Number 174
 
17.8%
Open Punctuation 27
 
2.8%
Close Punctuation 25
 
2.6%
Other Punctuation 24
 
2.5%
Dash Punctuation 3
 
0.3%
Uppercase Letter 2
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
43
 
7.8%
41
 
7.5%
41
 
7.5%
41
 
7.5%
35
 
6.4%
35
 
6.4%
35
 
6.4%
35
 
6.4%
33
 
6.0%
16
 
2.9%
Other values (67) 195
35.5%
Decimal Number
ValueCountFrequency (%)
1 55
31.6%
2 29
16.7%
0 20
 
11.5%
3 13
 
7.5%
6 12
 
6.9%
4 10
 
5.7%
9 9
 
5.2%
7 9
 
5.2%
8 9
 
5.2%
5 8
 
4.6%
Other Punctuation
ValueCountFrequency (%)
, 22
91.7%
. 2
 
8.3%
Space Separator
ValueCountFrequency (%)
174
100.0%
Open Punctuation
ValueCountFrequency (%)
( 27
100.0%
Close Punctuation
ValueCountFrequency (%)
) 25
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 3
100.0%
Uppercase Letter
ValueCountFrequency (%)
B 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 550
56.2%
Common 427
43.6%
Latin 2
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
43
 
7.8%
41
 
7.5%
41
 
7.5%
41
 
7.5%
35
 
6.4%
35
 
6.4%
35
 
6.4%
35
 
6.4%
33
 
6.0%
16
 
2.9%
Other values (67) 195
35.5%
Common
ValueCountFrequency (%)
174
40.7%
1 55
 
12.9%
2 29
 
6.8%
( 27
 
6.3%
) 25
 
5.9%
, 22
 
5.2%
0 20
 
4.7%
3 13
 
3.0%
6 12
 
2.8%
4 10
 
2.3%
Other values (6) 40
 
9.4%
Latin
ValueCountFrequency (%)
B 2
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 550
56.2%
ASCII 429
43.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
174
40.6%
1 55
 
12.8%
2 29
 
6.8%
( 27
 
6.3%
) 25
 
5.8%
, 22
 
5.1%
0 20
 
4.7%
3 13
 
3.0%
6 12
 
2.8%
4 10
 
2.3%
Other values (7) 42
 
9.8%
Hangul
ValueCountFrequency (%)
43
 
7.8%
41
 
7.5%
41
 
7.5%
41
 
7.5%
35
 
6.4%
35
 
6.4%
35
 
6.4%
35
 
6.4%
33
 
6.0%
16
 
2.9%
Other values (67) 195
35.5%

위도
Real number (ℝ)

Distinct33
Distinct (%)94.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean128.63861
Minimum128.51651
Maximum128.73068
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size447.0 B
2023-12-11T08:21:49.634630image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum128.51651
5-th percentile128.57401
Q1128.61748
median128.6269
Q3128.6728
95-th percentile128.72181
Maximum128.73068
Range0.214161
Interquartile range (IQR)0.055313

Descriptive statistics

Standard deviation0.046216082
Coefficient of variation (CV)0.00035927069
Kurtosis0.48699187
Mean128.63861
Median Absolute Deviation (MAD)0.013476
Skewness0.11167455
Sum4502.3513
Variance0.0021359262
MonotonicityNot monotonic
2023-12-11T08:21:49.750669image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=33)
ValueCountFrequency (%)
128.574008 2
 
5.7%
128.632386 2
 
5.7%
128.616665 1
 
2.9%
128.62168 1
 
2.9%
128.648894 1
 
2.9%
128.699493 1
 
2.9%
128.6895 1
 
2.9%
128.612587 1
 
2.9%
128.622398 1
 
2.9%
128.640372 1
 
2.9%
Other values (23) 23
65.7%
ValueCountFrequency (%)
128.516515 1
2.9%
128.574008 2
5.7%
128.590727 1
2.9%
128.60812 1
2.9%
128.612587 1
2.9%
128.614731 1
2.9%
128.614761 1
2.9%
128.616665 1
2.9%
128.618305 1
2.9%
128.619386 1
2.9%
ValueCountFrequency (%)
128.730676 1
2.9%
128.722811 1
2.9%
128.721379 1
2.9%
128.699493 1
2.9%
128.696404 1
2.9%
128.695727 1
2.9%
128.690007 1
2.9%
128.6895 1
2.9%
128.689194 1
2.9%
128.656402 1
2.9%

경도
Real number (ℝ)

Distinct33
Distinct (%)94.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean34.884741
Minimum34.822327
Maximum34.922062
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size447.0 B
2023-12-11T08:21:49.874307image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum34.822327
5-th percentile34.857217
Q134.875287
median34.888211
Q334.893409
95-th percentile34.912887
Maximum34.922062
Range0.0997356
Interquartile range (IQR)0.0181221

Descriptive statistics

Standard deviation0.01893267
Coefficient of variation (CV)0.00054272066
Kurtosis2.6264243
Mean34.884741
Median Absolute Deviation (MAD)0.0062263
Skewness-0.97285223
Sum1220.9659
Variance0.00035844598
MonotonicityNot monotonic
2023-12-11T08:21:50.032268image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=33)
ValueCountFrequency (%)
34.883726 2
 
5.7%
34.8718351 2
 
5.7%
34.8916116 1
 
2.9%
34.8901839 1
 
2.9%
34.8655379 1
 
2.9%
34.9093097 1
 
2.9%
34.8973541 1
 
2.9%
34.8933961 1
 
2.9%
34.8827569 1
 
2.9%
34.8934212 1
 
2.9%
Other values (23) 23
65.7%
ValueCountFrequency (%)
34.8223267 1
2.9%
34.8476919 1
2.9%
34.8612985 1
2.9%
34.8649479 1
2.9%
34.8655379 1
2.9%
34.8661146 1
2.9%
34.8701827 1
2.9%
34.8718351 2
5.7%
34.878738 1
2.9%
34.8810177 1
2.9%
ValueCountFrequency (%)
34.9220623 1
2.9%
34.9143518 1
2.9%
34.9122593 1
2.9%
34.9093097 1
2.9%
34.8997358 1
2.9%
34.8973541 1
2.9%
34.8944373 1
2.9%
34.8942047 1
2.9%
34.8934212 1
2.9%
34.8933961 1
2.9%

전화번호
Text

MISSING 

Distinct19
Distinct (%)100.0%
Missing16
Missing (%)45.7%
Memory size412.0 B
2023-12-11T08:21:50.216263image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length11.842105
Min length9

Characters and Unicode

Total characters225
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique19 ?
Unique (%)100.0%

Sample

1st row055-635-6163
2nd row055-634-6161
3rd row055-687-3161
4th row055-636-3755
5th row1522-1871
ValueCountFrequency (%)
055-635-6163 1
 
5.3%
055-637-6701 1
 
5.3%
055-688-5252 1
 
5.3%
055-689-0115 1
 
5.3%
055-688-0111 1
 
5.3%
055-633-3204 1
 
5.3%
055-633-2088 1
 
5.3%
055-688-5760 1
 
5.3%
055-638-5114 1
 
5.3%
055-687-7772 1
 
5.3%
Other values (9) 9
47.4%
2023-12-11T08:21:50.529211image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5 49
21.8%
- 37
16.4%
6 30
13.3%
0 26
11.6%
1 21
9.3%
3 17
 
7.6%
8 15
 
6.7%
7 12
 
5.3%
2 11
 
4.9%
9 4
 
1.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 188
83.6%
Dash Punctuation 37
 
16.4%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
5 49
26.1%
6 30
16.0%
0 26
13.8%
1 21
11.2%
3 17
 
9.0%
8 15
 
8.0%
7 12
 
6.4%
2 11
 
5.9%
9 4
 
2.1%
4 3
 
1.6%
Dash Punctuation
ValueCountFrequency (%)
- 37
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 225
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
5 49
21.8%
- 37
16.4%
6 30
13.3%
0 26
11.6%
1 21
9.3%
3 17
 
7.6%
8 15
 
6.7%
7 12
 
5.3%
2 11
 
4.9%
9 4
 
1.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 225
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
5 49
21.8%
- 37
16.4%
6 30
13.3%
0 26
11.6%
1 21
9.3%
3 17
 
7.6%
8 15
 
6.7%
7 12
 
5.3%
2 11
 
4.9%
9 4
 
1.8%

기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)2.9%
Missing0
Missing (%)0.0%
Memory size412.0 B
2023-09-15
35 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2023-09-15
2nd row2023-09-15
3rd row2023-09-15
4th row2023-09-15
5th row2023-09-15

Common Values

ValueCountFrequency (%)
2023-09-15 35
100.0%

Length

2023-12-11T08:21:50.652727image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T08:21:50.738787image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2023-09-15 35
100.0%

Interactions

2023-12-11T08:21:47.745233image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:21:47.612521image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:21:47.808798image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:21:47.675370image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T08:21:50.796214image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
소독업소명칭신고일자주소위도경도전화번호
소독업소명칭1.0001.0001.0001.0001.0001.000
신고일자1.0001.0001.0001.0001.0001.000
주소1.0001.0001.0001.0001.0001.000
위도1.0001.0001.0001.0000.6971.000
경도1.0001.0001.0000.6971.0001.000
전화번호1.0001.0001.0001.0001.0001.000
2023-12-11T08:21:50.897257image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
위도경도
위도1.000-0.084
경도-0.0841.000

Missing values

2023-12-11T08:21:47.908847image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T08:21:48.037602image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

소독업소명칭신고일자주소위도경도전화번호기준일자
0주식회사 서휘바이오2023-08-07경상남도 거제시 장평로 16-12, 14층 1401호 (장평동)128.61666534.891612055-635-61632023-09-15
1클린스쿨경남2023-03-02경상남도 거제시 거제면 읍내로7길 22-1128.59072734.847692055-634-61612023-09-15
2바르다방역2022-10-13경상남도 거제시 연초면 연하해안로 436, 단독주택 1층128.61941134.912259<NA>2023-09-15
3에코유2022-04-22경상남도 거제시 계룡로 126, 배진빌딩 2층 (고현동)128.62237534.881018<NA>2023-09-15
4(주) 거영2022-01-21경상남도 거제시 서문로 19, 3층 (고현동)128.61830534.888211055-687-31612023-09-15
5장승포 마을관리 사회적 협동조합2021-02-22경상남도 거제시 장승포로7길 12, 4층 402호 (장승포동)128.73067634.866115<NA>2023-09-15
6(주)클린2021-02-01경상남도 거제시 고현천로 74, 1층 (고현동)128.62689634.885662<NA>2023-09-15
7세프로경남2020-12-18경상남도 거제시 상동11길 15, 1층 (상동동)128.63238634.871835055-636-37552023-09-15
8제우스 클린2020-10-26경상남도 거제시 옥포대첩로2길 2 (옥포동)128.68919434.893068<NA>2023-09-15
9(주)참성실한기업2020-08-14경상남도 거제시 고현로14길 15, 1층 (고현동)128.62746734.886889<NA>2023-09-15
소독업소명칭신고일자주소위도경도전화번호기준일자
25벌레 잡는 사람들2015-11-17경상남도 거제시 옥포로26길 3 (옥포동)128.689534.897354055-688-52522023-09-15
26(주)에코크린2015-03-17경상남도 거제시 장평로6길 23, B동 103호 (장평동, 대한아파트 상가동)128.61258734.893396055-637-67012023-09-15
27에코유방역. 에코유환경2007-05-23경상남도 거제시 고현로2길 36 (고현동)128.62239834.882757055-638-51142023-09-15
28대진환경2007-04-25경상남도 거제시 거제대로 4537-1 (수월동)128.64037234.893421<NA>2023-09-15
29(주)맑은환경산업2005-06-03경상남도 거제시 연초면 거제북로 130128.65640234.922062055-688-57602023-09-15
30토탈환경산업2004-12-04경상남도 거제시 중곡2로2길 14 (고현동)128.62564234.894205055-633-20882023-09-15
31거제방역2003-10-21경상남도 거제시 중곡1로2길 18 (고현동)128.62712334.894437055-633-32042023-09-15
32제일환경산업2002-04-02경상남도 거제시 사등면 성포로1길 15128.51651534.914352055-688-01112023-09-15
33(주)웰리브1985-07-02경상남도 거제시 옥포로 122, 5층 (옥포동)128.69572734.888435055-689-01152023-09-15
34주식회사 티에스1985-06-12경상남도 거제시 연초면 연하해안로 271, B동 101호128.62449334.899736055-636-11172023-09-15