Overview

Dataset statistics

Number of variables5
Number of observations71
Missing cells4
Missing cells (%)1.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory3.0 KiB
Average record size in memory42.9 B

Variable types

Numeric1
Categorical1
Text3

Dataset

Description대구광역시 수성구 소독업체(영업구분, 소독업소명, 사무실소재지, 연락처) 정보 제공입니다. Provides information on disinfection companies in Suseong-gu, Daegu Metropolitan City (business division, disinfection business name, office location, contact information)
URLhttps://www.data.go.kr/data/15080732/fileData.do

Alerts

영업구분 has constant value ""Constant
전화번호 has 4 (5.6%) missing valuesMissing
순번 has unique valuesUnique
소독업소명칭 has unique valuesUnique
소재지(도로명) has unique valuesUnique

Reproduction

Analysis started2023-12-12 15:38:24.049862
Analysis finished2023-12-12 15:38:24.642835
Duration0.59 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순번
Real number (ℝ)

UNIQUE 

Distinct71
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean36
Minimum1
Maximum71
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size771.0 B
2023-12-13T00:38:24.737469image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile4.5
Q118.5
median36
Q353.5
95-th percentile67.5
Maximum71
Range70
Interquartile range (IQR)35

Descriptive statistics

Standard deviation20.639767
Coefficient of variation (CV)0.57332687
Kurtosis-1.2
Mean36
Median Absolute Deviation (MAD)18
Skewness0
Sum2556
Variance426
MonotonicityStrictly increasing
2023-12-13T00:38:25.284574image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
1.4%
2 1
 
1.4%
53 1
 
1.4%
52 1
 
1.4%
51 1
 
1.4%
50 1
 
1.4%
49 1
 
1.4%
48 1
 
1.4%
47 1
 
1.4%
46 1
 
1.4%
Other values (61) 61
85.9%
ValueCountFrequency (%)
1 1
1.4%
2 1
1.4%
3 1
1.4%
4 1
1.4%
5 1
1.4%
6 1
1.4%
7 1
1.4%
8 1
1.4%
9 1
1.4%
10 1
1.4%
ValueCountFrequency (%)
71 1
1.4%
70 1
1.4%
69 1
1.4%
68 1
1.4%
67 1
1.4%
66 1
1.4%
65 1
1.4%
64 1
1.4%
63 1
1.4%
62 1
1.4%

영업구분
Categorical

CONSTANT 

Distinct1
Distinct (%)1.4%
Missing0
Missing (%)0.0%
Memory size700.0 B
영업중
71 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row영업중
2nd row영업중
3rd row영업중
4th row영업중
5th row영업중

Common Values

ValueCountFrequency (%)
영업중 71
100.0%

Length

2023-12-13T00:38:25.491405image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T00:38:25.647253image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
영업중 71
100.0%

소독업소명칭
Text

UNIQUE 

Distinct71
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size700.0 B
2023-12-13T00:38:25.957114image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length17
Median length11
Mean length7.0704225
Min length2

Characters and Unicode

Total characters502
Distinct characters172
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique71 ?
Unique (%)100.0%

Sample

1st row푸레자(PUREZA)
2nd row주식회사 디지티모빌리티
3rd row화담주식회사
4th row미르크린파워
5th row원스톱 방역
ValueCountFrequency (%)
주식회사 7
 
8.1%
푸레자(pureza 1
 
1.2%
ak환경 1
 
1.2%
대구수성시니어클럽 1
 
1.2%
주)동영솔루션 1
 
1.2%
주)원방역공사 1
 
1.2%
월드그린 1
 
1.2%
하나씨앤에스 1
 
1.2%
제이앤에스개발 1
 
1.2%
주)청소하는마을 1
 
1.2%
Other values (70) 70
81.4%
2023-12-13T00:38:26.557174image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
30
 
6.0%
( 25
 
5.0%
) 25
 
5.0%
17
 
3.4%
15
 
3.0%
15
 
3.0%
12
 
2.4%
10
 
2.0%
9
 
1.8%
9
 
1.8%
Other values (162) 335
66.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 418
83.3%
Open Punctuation 25
 
5.0%
Close Punctuation 25
 
5.0%
Uppercase Letter 18
 
3.6%
Space Separator 15
 
3.0%
Other Punctuation 1
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
30
 
7.2%
17
 
4.1%
15
 
3.6%
12
 
2.9%
10
 
2.4%
9
 
2.2%
9
 
2.2%
8
 
1.9%
8
 
1.9%
8
 
1.9%
Other values (145) 292
69.9%
Uppercase Letter
ValueCountFrequency (%)
E 3
16.7%
S 3
16.7%
A 2
11.1%
Z 1
 
5.6%
R 1
 
5.6%
U 1
 
5.6%
K 1
 
5.6%
P 1
 
5.6%
G 1
 
5.6%
I 1
 
5.6%
Other values (3) 3
16.7%
Open Punctuation
ValueCountFrequency (%)
( 25
100.0%
Close Punctuation
ValueCountFrequency (%)
) 25
100.0%
Space Separator
ValueCountFrequency (%)
15
100.0%
Other Punctuation
ValueCountFrequency (%)
, 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 418
83.3%
Common 66
 
13.1%
Latin 18
 
3.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
30
 
7.2%
17
 
4.1%
15
 
3.6%
12
 
2.9%
10
 
2.4%
9
 
2.2%
9
 
2.2%
8
 
1.9%
8
 
1.9%
8
 
1.9%
Other values (145) 292
69.9%
Latin
ValueCountFrequency (%)
E 3
16.7%
S 3
16.7%
A 2
11.1%
Z 1
 
5.6%
R 1
 
5.6%
U 1
 
5.6%
K 1
 
5.6%
P 1
 
5.6%
G 1
 
5.6%
I 1
 
5.6%
Other values (3) 3
16.7%
Common
ValueCountFrequency (%)
( 25
37.9%
) 25
37.9%
15
22.7%
, 1
 
1.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 418
83.3%
ASCII 84
 
16.7%

Most frequent character per block

Hangul
ValueCountFrequency (%)
30
 
7.2%
17
 
4.1%
15
 
3.6%
12
 
2.9%
10
 
2.4%
9
 
2.2%
9
 
2.2%
8
 
1.9%
8
 
1.9%
8
 
1.9%
Other values (145) 292
69.9%
ASCII
ValueCountFrequency (%)
( 25
29.8%
) 25
29.8%
15
17.9%
E 3
 
3.6%
S 3
 
3.6%
A 2
 
2.4%
Z 1
 
1.2%
R 1
 
1.2%
U 1
 
1.2%
K 1
 
1.2%
Other values (7) 7
 
8.3%
Distinct71
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size700.0 B
2023-12-13T00:38:27.051330image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length41
Median length32
Mean length27.197183
Min length18

Characters and Unicode

Total characters1931
Distinct characters84
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique71 ?
Unique (%)100.0%

Sample

1st row대구광역시 수성구 들안로81길37(수성동4가)
2nd row대구광역시 수성구 무학로 196(지산동)
3rd row대구광역시 수성구 달구벌대로627길 22-19(매호동)
4th row대구광역시 수성구 국채보상로 960, 1층 (범어동)
5th row대구광역시 수성구 동대구로 386, 킹덤오피스텔 1509동 1호 (범어동)
ValueCountFrequency (%)
대구광역시 71
18.2%
수성구 69
 
17.7%
1층 16
 
4.1%
만촌동 12
 
3.1%
지산동 11
 
2.8%
2층 10
 
2.6%
3층 7
 
1.8%
범어동 6
 
1.5%
중동 6
 
1.5%
상동 6
 
1.5%
Other values (141) 176
45.1%
2023-12-13T00:38:27.582079image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
321
 
16.6%
147
 
7.6%
88
 
4.6%
82
 
4.2%
79
 
4.1%
78
 
4.0%
78
 
4.0%
1 75
 
3.9%
71
 
3.7%
71
 
3.7%
Other values (74) 841
43.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1113
57.6%
Space Separator 321
 
16.6%
Decimal Number 297
 
15.4%
Open Punctuation 69
 
3.6%
Close Punctuation 69
 
3.6%
Other Punctuation 47
 
2.4%
Dash Punctuation 15
 
0.8%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
147
13.2%
88
 
7.9%
82
 
7.4%
79
 
7.1%
78
 
7.0%
78
 
7.0%
71
 
6.4%
71
 
6.4%
69
 
6.2%
39
 
3.5%
Other values (59) 311
27.9%
Decimal Number
ValueCountFrequency (%)
1 75
25.3%
2 44
14.8%
4 38
12.8%
3 34
11.4%
6 26
 
8.8%
7 21
 
7.1%
5 18
 
6.1%
9 15
 
5.1%
0 14
 
4.7%
8 12
 
4.0%
Space Separator
ValueCountFrequency (%)
321
100.0%
Open Punctuation
ValueCountFrequency (%)
( 69
100.0%
Close Punctuation
ValueCountFrequency (%)
) 69
100.0%
Other Punctuation
ValueCountFrequency (%)
, 47
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 15
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1113
57.6%
Common 818
42.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
147
13.2%
88
 
7.9%
82
 
7.4%
79
 
7.1%
78
 
7.0%
78
 
7.0%
71
 
6.4%
71
 
6.4%
69
 
6.2%
39
 
3.5%
Other values (59) 311
27.9%
Common
ValueCountFrequency (%)
321
39.2%
1 75
 
9.2%
( 69
 
8.4%
) 69
 
8.4%
, 47
 
5.7%
2 44
 
5.4%
4 38
 
4.6%
3 34
 
4.2%
6 26
 
3.2%
7 21
 
2.6%
Other values (5) 74
 
9.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1113
57.6%
ASCII 818
42.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
321
39.2%
1 75
 
9.2%
( 69
 
8.4%
) 69
 
8.4%
, 47
 
5.7%
2 44
 
5.4%
4 38
 
4.6%
3 34
 
4.2%
6 26
 
3.2%
7 21
 
2.6%
Other values (5) 74
 
9.0%
Hangul
ValueCountFrequency (%)
147
13.2%
88
 
7.9%
82
 
7.4%
79
 
7.1%
78
 
7.0%
78
 
7.0%
71
 
6.4%
71
 
6.4%
69
 
6.2%
39
 
3.5%
Other values (59) 311
27.9%

전화번호
Text

MISSING 

Distinct67
Distinct (%)100.0%
Missing4
Missing (%)5.6%
Memory size700.0 B
2023-12-13T00:38:27.928655image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length12
Mean length11.671642
Min length9

Characters and Unicode

Total characters782
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique67 ?
Unique (%)100.0%

Sample

1st row0507-1318-8118
2nd row053-767-4542
3rd row1522-7657
4th row053-716-3549
5th row1670-1559
ValueCountFrequency (%)
1599-1854 1
 
1.5%
053-766-8234 1
 
1.5%
053-751-3112 1
 
1.5%
053-784-6080 1
 
1.5%
053-286-9212 1
 
1.5%
053-765-3114 1
 
1.5%
053-752-8255 1
 
1.5%
053-958-5670 1
 
1.5%
053-765-4288 1
 
1.5%
053-755-9328 1
 
1.5%
Other values (57) 57
85.1%
2023-12-13T00:38:28.395899image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 125
16.0%
5 106
13.6%
3 99
12.7%
0 82
10.5%
7 75
9.6%
6 63
8.1%
1 60
7.7%
2 51
6.5%
4 49
 
6.3%
8 40
 
5.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 657
84.0%
Dash Punctuation 125
 
16.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
5 106
16.1%
3 99
15.1%
0 82
12.5%
7 75
11.4%
6 63
9.6%
1 60
9.1%
2 51
7.8%
4 49
7.5%
8 40
 
6.1%
9 32
 
4.9%
Dash Punctuation
ValueCountFrequency (%)
- 125
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 782
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 125
16.0%
5 106
13.6%
3 99
12.7%
0 82
10.5%
7 75
9.6%
6 63
8.1%
1 60
7.7%
2 51
6.5%
4 49
 
6.3%
8 40
 
5.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 782
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 125
16.0%
5 106
13.6%
3 99
12.7%
0 82
10.5%
7 75
9.6%
6 63
8.1%
1 60
7.7%
2 51
6.5%
4 49
 
6.3%
8 40
 
5.1%

Interactions

2023-12-13T00:38:24.327705image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T00:38:28.525225image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번소독업소명칭소재지(도로명)전화번호
순번1.0001.0001.0001.000
소독업소명칭1.0001.0001.0001.000
소재지(도로명)1.0001.0001.0001.000
전화번호1.0001.0001.0001.000

Missing values

2023-12-13T00:38:24.478617image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T00:38:24.594549image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

순번영업구분소독업소명칭소재지(도로명)전화번호
01영업중푸레자(PUREZA)대구광역시 수성구 들안로81길37(수성동4가)0507-1318-8118
12영업중주식회사 디지티모빌리티대구광역시 수성구 무학로 196(지산동)053-767-4542
23영업중화담주식회사대구광역시 수성구 달구벌대로627길 22-19(매호동)1522-7657
34영업중미르크린파워대구광역시 수성구 국채보상로 960, 1층 (범어동)053-716-3549
45영업중원스톱 방역대구광역시 수성구 동대구로 386, 킹덤오피스텔 1509동 1호 (범어동)<NA>
56영업중안방대구광역시 수성구 범어로34길 8(범어동)1670-1559
67영업중주식회사 한영대구광역시 수성구 동대구로12길 75, 2층 (지산동)053-761-3114
78영업중대호대구광역시 수성구 지범로 43-11, 1층 (지산동)053-745-0529
89영업중준크린서비스대구광역시 수성구 무학로 173, 3층 (지산동)053-566-6979
910영업중청춘클린대구광역시 수성구 범안로 21-1, 1층 (범물동)1599-1854
순번영업구분소독업소명칭소재지(도로명)전화번호
6162영업중금호엔지니어링대구광역시 수성구 시지로 50-1, 수성알파빌딩 (시지동)053-746-2044
6263영업중(주)고신대구광역시 수성구 상화로4길 30 (상동)053-252-1421
6364영업중주식회사 대성에코홀딩스대구광역시 수성구 만촌로4길 10 (만촌동)053-741-3318
6465영업중(주)티지환경대구광역시 수성구 희망로 229, 황금스퀘어 602호 (황금동)053-753-7125
6566영업중동우씨엠(주)대구광역시 수성구 화랑로32길 67, 1층 (만촌동)053-742-3344
6667영업중영풍방역공사대구광역시 수성구 공경로 49 (만촌동)053-768-6375
6768영업중미래환경대구광역시 수성구 국채보상로 924 (범어동)053-742-8233
6869영업중푸른방역공사대구광역시 수성구 청수로45길 41 (황금동)053-766-4703
6970영업중대경기업대구광역시 수성구 욱수천로 15-2 (욱수동)053-768-2152
7071영업중방역의 실력대구광역시 수성구 청호로 240 (황금동)<NA>