Overview

Dataset statistics

Number of variables4
Number of observations84
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.8 KiB
Average record size in memory34.6 B

Variable types

Numeric1
Text2
Categorical1

Dataset

Description대전광역시 유성구 관내에 있는 소독업 현황에 대한 데이터로 업소명, 업소 소재지, 업소 전화번호 등의 항목을 제공합니다.
Author대전광역시 유성구
URLhttps://www.data.go.kr/data/15080693/fileData.do

Alerts

연번 has unique valuesUnique
업소소재지 has unique valuesUnique

Reproduction

Analysis started2023-12-12 14:35:09.194468
Analysis finished2023-12-12 14:35:09.756397
Duration0.56 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct84
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean42.5
Minimum1
Maximum84
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size888.0 B
2023-12-12T23:35:09.833813image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile5.15
Q121.75
median42.5
Q363.25
95-th percentile79.85
Maximum84
Range83
Interquartile range (IQR)41.5

Descriptive statistics

Standard deviation24.392622
Coefficient of variation (CV)0.57394404
Kurtosis-1.2
Mean42.5
Median Absolute Deviation (MAD)21
Skewness0
Sum3570
Variance595
MonotonicityStrictly increasing
2023-12-12T23:35:10.021871image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
1.2%
55 1
 
1.2%
63 1
 
1.2%
62 1
 
1.2%
61 1
 
1.2%
60 1
 
1.2%
59 1
 
1.2%
58 1
 
1.2%
57 1
 
1.2%
56 1
 
1.2%
Other values (74) 74
88.1%
ValueCountFrequency (%)
1 1
1.2%
2 1
1.2%
3 1
1.2%
4 1
1.2%
5 1
1.2%
6 1
1.2%
7 1
1.2%
8 1
1.2%
9 1
1.2%
10 1
1.2%
ValueCountFrequency (%)
84 1
1.2%
83 1
1.2%
82 1
1.2%
81 1
1.2%
80 1
1.2%
79 1
1.2%
78 1
1.2%
77 1
1.2%
76 1
1.2%
75 1
1.2%
Distinct83
Distinct (%)98.8%
Missing0
Missing (%)0.0%
Memory size804.0 B
2023-12-12T23:35:10.330945image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length17
Median length13
Mean length7.7738095
Min length2

Characters and Unicode

Total characters653
Distinct characters180
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique82 ?
Unique (%)97.6%

Sample

1st row주식회사 홈밸런스주택관리
2nd row주식회사 그린이앤씨
3rd row수빈환경주택관리
4th row블루가드
5th row(주)베가
ValueCountFrequency (%)
주식회사 17
 
15.7%
그린f5(대전유성본부 2
 
1.9%
합자회사 2
 
1.9%
주)백경 1
 
0.9%
주)진화 1
 
0.9%
대전방역 1
 
0.9%
주)신성엠에스 1
 
0.9%
성진기업(주 1
 
0.9%
합)신성기업 1
 
0.9%
신우이레산업 1
 
0.9%
Other values (80) 80
74.1%
2023-12-12T23:35:10.816822image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
57
 
8.7%
( 38
 
5.8%
) 38
 
5.8%
24
 
3.7%
23
 
3.5%
20
 
3.1%
19
 
2.9%
18
 
2.8%
15
 
2.3%
14
 
2.1%
Other values (170) 387
59.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 542
83.0%
Open Punctuation 38
 
5.8%
Close Punctuation 38
 
5.8%
Space Separator 24
 
3.7%
Decimal Number 5
 
0.8%
Uppercase Letter 5
 
0.8%
Other Punctuation 1
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
57
 
10.5%
23
 
4.2%
20
 
3.7%
19
 
3.5%
18
 
3.3%
15
 
2.8%
14
 
2.6%
11
 
2.0%
10
 
1.8%
10
 
1.8%
Other values (159) 345
63.7%
Uppercase Letter
ValueCountFrequency (%)
F 2
40.0%
G 1
20.0%
R 1
20.0%
E 1
20.0%
Decimal Number
ValueCountFrequency (%)
5 3
60.0%
3 1
 
20.0%
6 1
 
20.0%
Open Punctuation
ValueCountFrequency (%)
( 38
100.0%
Close Punctuation
ValueCountFrequency (%)
) 38
100.0%
Space Separator
ValueCountFrequency (%)
24
100.0%
Other Punctuation
ValueCountFrequency (%)
· 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 542
83.0%
Common 106
 
16.2%
Latin 5
 
0.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
57
 
10.5%
23
 
4.2%
20
 
3.7%
19
 
3.5%
18
 
3.3%
15
 
2.8%
14
 
2.6%
11
 
2.0%
10
 
1.8%
10
 
1.8%
Other values (159) 345
63.7%
Common
ValueCountFrequency (%)
( 38
35.8%
) 38
35.8%
24
22.6%
5 3
 
2.8%
· 1
 
0.9%
3 1
 
0.9%
6 1
 
0.9%
Latin
ValueCountFrequency (%)
F 2
40.0%
G 1
20.0%
R 1
20.0%
E 1
20.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 542
83.0%
ASCII 110
 
16.8%
None 1
 
0.2%

Most frequent character per block

Hangul
ValueCountFrequency (%)
57
 
10.5%
23
 
4.2%
20
 
3.7%
19
 
3.5%
18
 
3.3%
15
 
2.8%
14
 
2.6%
11
 
2.0%
10
 
1.8%
10
 
1.8%
Other values (159) 345
63.7%
ASCII
ValueCountFrequency (%)
( 38
34.5%
) 38
34.5%
24
21.8%
5 3
 
2.7%
F 2
 
1.8%
3 1
 
0.9%
G 1
 
0.9%
R 1
 
0.9%
E 1
 
0.9%
6 1
 
0.9%
None
ValueCountFrequency (%)
· 1
100.0%

업소소재지
Text

UNIQUE 

Distinct84
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size804.0 B
2023-12-12T23:35:11.140487image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length50
Median length39
Mean length33.690476
Min length22

Characters and Unicode

Total characters2830
Distinct characters140
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique84 ?
Unique (%)100.0%

Sample

1st row대전광역시 유성구 월드컵대로275번길 53, 103호 (구암동)
2nd row대전광역시 유성구 신성남로 115, 1층 102호 (신성동)
3rd row대전광역시 유성구 계룡로26번길 35, 1층 102호 (구암동)
4th row대전광역시 유성구 은구비남로7번길 19, 3층 301호 (지족동)
5th row대전광역시 유성구 학하로 33, 제상가동 1층 103호 (계산동, 학하리슈빌 학의뜰아파트)
ValueCountFrequency (%)
대전광역시 84
 
15.6%
유성구 84
 
15.6%
1층 14
 
2.6%
봉명동 12
 
2.2%
구암동 11
 
2.0%
지상1층 10
 
1.9%
2층 9
 
1.7%
원내동 8
 
1.5%
도룡동 6
 
1.1%
102호 6
 
1.1%
Other values (198) 294
54.6%
2023-12-12T23:35:11.690005image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
455
 
16.1%
1 132
 
4.7%
126
 
4.5%
107
 
3.8%
98
 
3.5%
96
 
3.4%
94
 
3.3%
86
 
3.0%
, 86
 
3.0%
) 84
 
3.0%
Other values (130) 1466
51.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1603
56.6%
Decimal Number 494
 
17.5%
Space Separator 455
 
16.1%
Other Punctuation 86
 
3.0%
Close Punctuation 84
 
3.0%
Open Punctuation 84
 
3.0%
Dash Punctuation 18
 
0.6%
Uppercase Letter 6
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
126
 
7.9%
107
 
6.7%
98
 
6.1%
96
 
6.0%
94
 
5.9%
86
 
5.4%
84
 
5.2%
84
 
5.2%
84
 
5.2%
84
 
5.2%
Other values (109) 660
41.2%
Decimal Number
ValueCountFrequency (%)
1 132
26.7%
2 74
15.0%
0 51
 
10.3%
3 51
 
10.3%
5 51
 
10.3%
7 36
 
7.3%
6 28
 
5.7%
4 27
 
5.5%
8 23
 
4.7%
9 21
 
4.3%
Uppercase Letter
ValueCountFrequency (%)
F 1
16.7%
T 1
16.7%
S 1
16.7%
I 1
16.7%
A 1
16.7%
K 1
16.7%
Space Separator
ValueCountFrequency (%)
455
100.0%
Other Punctuation
ValueCountFrequency (%)
, 86
100.0%
Close Punctuation
ValueCountFrequency (%)
) 84
100.0%
Open Punctuation
ValueCountFrequency (%)
( 84
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 18
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1603
56.6%
Common 1221
43.1%
Latin 6
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
126
 
7.9%
107
 
6.7%
98
 
6.1%
96
 
6.0%
94
 
5.9%
86
 
5.4%
84
 
5.2%
84
 
5.2%
84
 
5.2%
84
 
5.2%
Other values (109) 660
41.2%
Common
ValueCountFrequency (%)
455
37.3%
1 132
 
10.8%
, 86
 
7.0%
) 84
 
6.9%
( 84
 
6.9%
2 74
 
6.1%
0 51
 
4.2%
3 51
 
4.2%
5 51
 
4.2%
7 36
 
2.9%
Other values (5) 117
 
9.6%
Latin
ValueCountFrequency (%)
F 1
16.7%
T 1
16.7%
S 1
16.7%
I 1
16.7%
A 1
16.7%
K 1
16.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1603
56.6%
ASCII 1227
43.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
455
37.1%
1 132
 
10.8%
, 86
 
7.0%
) 84
 
6.8%
( 84
 
6.8%
2 74
 
6.0%
0 51
 
4.2%
3 51
 
4.2%
5 51
 
4.2%
7 36
 
2.9%
Other values (11) 123
 
10.0%
Hangul
ValueCountFrequency (%)
126
 
7.9%
107
 
6.7%
98
 
6.1%
96
 
6.0%
94
 
5.9%
86
 
5.4%
84
 
5.2%
84
 
5.2%
84
 
5.2%
84
 
5.2%
Other values (109) 660
41.2%

전화번호
Categorical

Distinct29
Distinct (%)34.5%
Missing0
Missing (%)0.0%
Memory size804.0 B
042-000-0000
56 
042-478-6697
 
1
042-826-5836
 
1
042-826-1182
 
1
1588-2071
 
1
Other values (24)
24 

Length

Max length12
Median length12
Mean length11.964286
Min length9

Unique

Unique28 ?
Unique (%)33.3%

Sample

1st row042-000-0000
2nd row042-000-0000
3rd row042-000-0000
4th row042-000-0000
5th row042-000-0000

Common Values

ValueCountFrequency (%)
042-000-0000 56
66.7%
042-478-6697 1
 
1.2%
042-826-5836 1
 
1.2%
042-826-1182 1
 
1.2%
1588-2071 1
 
1.2%
042-252-3900 1
 
1.2%
042-253-5000 1
 
1.2%
042-365-1736 1
 
1.2%
042-931-4377 1
 
1.2%
042-531-8250 1
 
1.2%
Other values (19) 19
 
22.6%

Length

2023-12-12T23:35:11.860456image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
042-000-0000 56
65.9%
042-478-6697 1
 
1.2%
042-545-1688 1
 
1.2%
042-536-2650 1
 
1.2%
042-822-3610 1
 
1.2%
042-543-8703 1
 
1.2%
042-825-1142 1
 
1.2%
042-822-2130 1
 
1.2%
042-826-1081 1
 
1.2%
042-383-4436 1
 
1.2%
Other values (20) 20
 
23.5%

Interactions

2023-12-12T23:35:09.505507image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T23:35:11.979406image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번업소명업소소재지전화번호
연번1.0000.9471.0000.333
업소명0.9471.0001.0000.946
업소소재지1.0001.0001.0001.000
전화번호0.3330.9461.0001.000
2023-12-12T23:35:12.106151image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번전화번호
연번1.0000.084
전화번호0.0841.000

Missing values

2023-12-12T23:35:09.632349image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T23:35:09.721179image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번업소명업소소재지전화번호
01주식회사 홈밸런스주택관리대전광역시 유성구 월드컵대로275번길 53, 103호 (구암동)042-000-0000
12주식회사 그린이앤씨대전광역시 유성구 신성남로 115, 1층 102호 (신성동)042-000-0000
23수빈환경주택관리대전광역시 유성구 계룡로26번길 35, 1층 102호 (구암동)042-000-0000
34블루가드대전광역시 유성구 은구비남로7번길 19, 3층 301호 (지족동)042-000-0000
45(주)베가대전광역시 유성구 학하로 33, 제상가동 1층 103호 (계산동, 학하리슈빌 학의뜰아파트)042-000-0000
56(주)원메디대전광역시 유성구 복용남로 55, 지상1층 (복용동)042-000-0000
67그린F5(대전유성본부)대전광역시 유성구 대학로81번길 32-19, 102호 (궁동)042-000-0000
78누리헬스케어대전광역시 유성구 교촌로10번길 9(교촌동)042-000-0000
89크린앤해피대전광역시 유성구 동서대로5번길 21-5, 삼성빌리지 1층 (구암동)042-000-0000
910주식회사 테크노월드대전광역시 유성구 테크노중앙로 54, 3층 306호 (관평동)042-000-0000
연번업소명업소소재지전화번호
7475밝은미래환경대전광역시 유성구 계룡로105번길 15 (봉명동, 한진오피스텔 309호)042-822-2130
7576(주)중부환경엔지니어링대전광역시 유성구 진잠로124번길 15-15, 2층 (원내동)042-000-0000
7677세기위생방역공사대전광역시 유성구 노은동로75번길 12 (노은동)042-825-1142
7778청솔환경대전광역시 유성구 진잠로 97 (교촌동)042-543-8703
7879(주)그린존대전광역시 유성구 유성대로821번길 31 (장대동,101호)042-822-3610
7980합자회사 지엔지대전광역시 유성구 신성남로 115 (신성동)042-000-0000
8081합자회사 세진밀레니엄대전광역시 유성구 대덕대로590번길 12-13, 지상1층 107호 (도룡동)042-536-2650
8182미소방역대전광역시 유성구 계룡로46번길 70, 1층 (구암동)042-000-0000
8283고려용역대전광역시 유성구 용계로41번길 23-17 (용계동)042-545-1688
8384(주)글로벌종합환경대전광역시 유성구 어은로48번길 1 (어은동)042-861-5050