Overview

Dataset statistics

Number of variables6
Number of observations40
Missing cells45
Missing cells (%)18.8%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.0 KiB
Average record size in memory52.3 B

Variable types

Numeric1
Text5

Dataset

Description대구광역시_동구_특화사업_20200428
Author대구광역시 동구
URLhttp://data.daegu.go.kr/open/data/dataView.do?dataSetId=3054443&dataSetDetailId=30544432a9689432b372&provdMethod=FILE

Alerts

Unnamed: 5 has constant value ""Constant
전화 has 6 (15.0%) missing valuesMissing
Unnamed: 5 has 39 (97.5%) missing valuesMissing
번호 has unique valuesUnique
사업체명 has unique valuesUnique

Reproduction

Analysis started2024-04-19 05:41:20.252178
Analysis finished2024-04-19 05:41:20.937014
Duration0.68 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

번호
Real number (ℝ)

UNIQUE 

Distinct40
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean20.5
Minimum1
Maximum40
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size492.0 B
2024-04-19T14:41:21.016098image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2.95
Q110.75
median20.5
Q330.25
95-th percentile38.05
Maximum40
Range39
Interquartile range (IQR)19.5

Descriptive statistics

Standard deviation11.690452
Coefficient of variation (CV)0.57026595
Kurtosis-1.2
Mean20.5
Median Absolute Deviation (MAD)10
Skewness0
Sum820
Variance136.66667
MonotonicityStrictly increasing
2024-04-19T14:41:21.161763image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=40)
ValueCountFrequency (%)
1 1
 
2.5%
22 1
 
2.5%
24 1
 
2.5%
25 1
 
2.5%
26 1
 
2.5%
27 1
 
2.5%
28 1
 
2.5%
29 1
 
2.5%
30 1
 
2.5%
31 1
 
2.5%
Other values (30) 30
75.0%
ValueCountFrequency (%)
1 1
2.5%
2 1
2.5%
3 1
2.5%
4 1
2.5%
5 1
2.5%
6 1
2.5%
7 1
2.5%
8 1
2.5%
9 1
2.5%
10 1
2.5%
ValueCountFrequency (%)
40 1
2.5%
39 1
2.5%
38 1
2.5%
37 1
2.5%
36 1
2.5%
35 1
2.5%
34 1
2.5%
33 1
2.5%
32 1
2.5%
31 1
2.5%

사업체명
Text

UNIQUE 

Distinct40
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size452.0 B
2024-04-19T14:41:21.444296image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length9
Median length8
Mean length4.925
Min length3

Characters and Unicode

Total characters197
Distinct characters95
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique40 ?
Unique (%)100.0%

Sample

1st row유산수단
2nd row불로국악기제작소
3rd row아름다운공예
4th row예목공방
5th row인목전통창호연구소
ValueCountFrequency (%)
유산수단 1
 
2.5%
불로국악기제작소 1
 
2.5%
삼성공예사 1
 
2.5%
홍익차 1
 
2.5%
새벽공예 1
 
2.5%
상주요(부부도예 1
 
2.5%
기천도자문화원 1
 
2.5%
옹크씨 1
 
2.5%
함월도예 1
 
2.5%
왕산악국악기 1
 
2.5%
Other values (30) 30
75.0%
2024-04-19T14:41:21.886998image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
24
 
12.2%
23
 
11.7%
9
 
4.6%
4
 
2.0%
4
 
2.0%
4
 
2.0%
4
 
2.0%
4
 
2.0%
4
 
2.0%
4
 
2.0%
Other values (85) 113
57.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 195
99.0%
Open Punctuation 1
 
0.5%
Close Punctuation 1
 
0.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
24
 
12.3%
23
 
11.8%
9
 
4.6%
4
 
2.1%
4
 
2.1%
4
 
2.1%
4
 
2.1%
4
 
2.1%
4
 
2.1%
4
 
2.1%
Other values (83) 111
56.9%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 195
99.0%
Common 2
 
1.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
24
 
12.3%
23
 
11.8%
9
 
4.6%
4
 
2.1%
4
 
2.1%
4
 
2.1%
4
 
2.1%
4
 
2.1%
4
 
2.1%
4
 
2.1%
Other values (83) 111
56.9%
Common
ValueCountFrequency (%)
( 1
50.0%
) 1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 195
99.0%
ASCII 2
 
1.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
24
 
12.3%
23
 
11.8%
9
 
4.6%
4
 
2.1%
4
 
2.1%
4
 
2.1%
4
 
2.1%
4
 
2.1%
4
 
2.1%
4
 
2.1%
Other values (83) 111
56.9%
ASCII
ValueCountFrequency (%)
( 1
50.0%
) 1
50.0%
Distinct37
Distinct (%)92.5%
Missing0
Missing (%)0.0%
Memory size452.0 B
2024-04-19T14:41:22.105630image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length8
Mean length5.4
Min length2

Characters and Unicode

Total characters216
Distinct characters76
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique35 ?
Unique (%)87.5%

Sample

1st row장롱셋트외50여종
2nd row농악기,전품목
3rd row옻칠,자개
4th row괴목
5th row창호문,사찰,주택
ValueCountFrequency (%)
서각 3
 
6.8%
2
 
4.5%
2
 
4.5%
장롱셋트외50여종 2
 
4.5%
목공예제품 1
 
2.3%
조각류 1
 
2.3%
다기류 1
 
2.3%
목검류 1
 
2.3%
불자함 1
 
2.3%
국악기 1
 
2.3%
Other values (29) 29
65.9%
2024-04-19T14:41:22.445462image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
17
 
7.9%
, 15
 
6.9%
14
 
6.5%
11
 
5.1%
11
 
5.1%
7
 
3.2%
6
 
2.8%
5
 
2.3%
5
 
2.3%
5
 
2.3%
Other values (66) 120
55.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 187
86.6%
Other Punctuation 15
 
6.9%
Decimal Number 10
 
4.6%
Space Separator 4
 
1.9%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
17
 
9.1%
14
 
7.5%
11
 
5.9%
11
 
5.9%
7
 
3.7%
6
 
3.2%
5
 
2.7%
5
 
2.7%
5
 
2.7%
5
 
2.7%
Other values (61) 101
54.0%
Decimal Number
ValueCountFrequency (%)
0 5
50.0%
5 3
30.0%
2 2
 
20.0%
Other Punctuation
ValueCountFrequency (%)
, 15
100.0%
Space Separator
ValueCountFrequency (%)
4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 187
86.6%
Common 29
 
13.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
17
 
9.1%
14
 
7.5%
11
 
5.9%
11
 
5.9%
7
 
3.7%
6
 
3.2%
5
 
2.7%
5
 
2.7%
5
 
2.7%
5
 
2.7%
Other values (61) 101
54.0%
Common
ValueCountFrequency (%)
, 15
51.7%
0 5
 
17.2%
4
 
13.8%
5 3
 
10.3%
2 2
 
6.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 187
86.6%
ASCII 29
 
13.4%

Most frequent character per block

Hangul
ValueCountFrequency (%)
17
 
9.1%
14
 
7.5%
11
 
5.9%
11
 
5.9%
7
 
3.7%
6
 
3.2%
5
 
2.7%
5
 
2.7%
5
 
2.7%
5
 
2.7%
Other values (61) 101
54.0%
ASCII
ValueCountFrequency (%)
, 15
51.7%
0 5
 
17.2%
4
 
13.8%
5 3
 
10.3%
2 2
 
6.9%
Distinct39
Distinct (%)97.5%
Missing0
Missing (%)0.0%
Memory size452.0 B
2024-04-19T14:41:22.717948image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length38
Median length22
Mean length19.75
Min length13

Characters and Unicode

Total characters790
Distinct characters95
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique38 ?
Unique (%)95.0%

Sample

1st row대구 동구 용천로 480(신무동)
2nd row대구 동구 팔공로 28길 30(불로동)
3rd row대구 동구 사복로 138 (숙천동)
4th row대구 동구 팔공로 975 (미곡동)
5th row대구 동구 안심로 475-1 (괴전동)
ValueCountFrequency (%)
대구 24
 
13.3%
동구 21
 
11.6%
영천시 8
 
4.4%
팔공로 8
 
4.4%
28길 4
 
2.2%
불로동 4
 
2.2%
팔공로28길 3
 
1.7%
경산시 3
 
1.7%
도동 3
 
1.7%
경북 2
 
1.1%
Other values (93) 101
55.8%
2024-04-19T14:41:23.121308image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
151
19.1%
47
 
5.9%
46
 
5.8%
1 45
 
5.7%
35
 
4.4%
25
 
3.2%
2 23
 
2.9%
( 23
 
2.9%
) 23
 
2.9%
21
 
2.7%
Other values (85) 351
44.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 405
51.3%
Decimal Number 170
21.5%
Space Separator 151
 
19.1%
Open Punctuation 23
 
2.9%
Close Punctuation 23
 
2.9%
Dash Punctuation 16
 
2.0%
Other Punctuation 2
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
47
 
11.6%
46
 
11.4%
35
 
8.6%
25
 
6.2%
21
 
5.2%
17
 
4.2%
16
 
4.0%
14
 
3.5%
14
 
3.5%
11
 
2.7%
Other values (70) 159
39.3%
Decimal Number
ValueCountFrequency (%)
1 45
26.5%
2 23
13.5%
3 17
 
10.0%
6 14
 
8.2%
7 13
 
7.6%
0 13
 
7.6%
4 13
 
7.6%
5 13
 
7.6%
8 12
 
7.1%
9 7
 
4.1%
Space Separator
ValueCountFrequency (%)
151
100.0%
Open Punctuation
ValueCountFrequency (%)
( 23
100.0%
Close Punctuation
ValueCountFrequency (%)
) 23
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 16
100.0%
Other Punctuation
ValueCountFrequency (%)
, 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 405
51.3%
Common 385
48.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
47
 
11.6%
46
 
11.4%
35
 
8.6%
25
 
6.2%
21
 
5.2%
17
 
4.2%
16
 
4.0%
14
 
3.5%
14
 
3.5%
11
 
2.7%
Other values (70) 159
39.3%
Common
ValueCountFrequency (%)
151
39.2%
1 45
 
11.7%
2 23
 
6.0%
( 23
 
6.0%
) 23
 
6.0%
3 17
 
4.4%
- 16
 
4.2%
6 14
 
3.6%
7 13
 
3.4%
0 13
 
3.4%
Other values (5) 47
 
12.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 405
51.3%
ASCII 385
48.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
151
39.2%
1 45
 
11.7%
2 23
 
6.0%
( 23
 
6.0%
) 23
 
6.0%
3 17
 
4.4%
- 16
 
4.2%
6 14
 
3.6%
7 13
 
3.4%
0 13
 
3.4%
Other values (5) 47
 
12.2%
Hangul
ValueCountFrequency (%)
47
 
11.6%
46
 
11.4%
35
 
8.6%
25
 
6.2%
21
 
5.2%
17
 
4.2%
16
 
4.0%
14
 
3.5%
14
 
3.5%
11
 
2.7%
Other values (70) 159
39.3%

전화
Text

MISSING 

Distinct34
Distinct (%)100.0%
Missing6
Missing (%)15.0%
Memory size452.0 B
2024-04-19T14:41:23.352123image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length12
Min length12

Characters and Unicode

Total characters408
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique34 ?
Unique (%)100.0%

Sample

1st row053-981-1917
2nd row053-984-7721
3rd row053-963-7155
4th row053-983-9842
5th row053-961-0800
ValueCountFrequency (%)
054-336-1172 1
 
2.9%
053-984-0213 1
 
2.9%
053-983-6803 1
 
2.9%
054-976-5000 1
 
2.9%
053-981-8889 1
 
2.9%
054-332-9605 1
 
2.9%
054-534-7910 1
 
2.9%
053-425-0015 1
 
2.9%
053-982-5782 1
 
2.9%
054-336-3071 1
 
2.9%
Other values (24) 24
70.6%
2024-04-19T14:41:23.680891image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 68
16.7%
0 55
13.5%
3 55
13.5%
5 54
13.2%
9 35
8.6%
8 34
8.3%
1 30
7.4%
7 23
 
5.6%
4 20
 
4.9%
6 17
 
4.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 340
83.3%
Dash Punctuation 68
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 55
16.2%
3 55
16.2%
5 54
15.9%
9 35
10.3%
8 34
10.0%
1 30
8.8%
7 23
6.8%
4 20
 
5.9%
6 17
 
5.0%
2 17
 
5.0%
Dash Punctuation
ValueCountFrequency (%)
- 68
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 408
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 68
16.7%
0 55
13.5%
3 55
13.5%
5 54
13.2%
9 35
8.6%
8 34
8.3%
1 30
7.4%
7 23
 
5.6%
4 20
 
4.9%
6 17
 
4.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 408
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 68
16.7%
0 55
13.5%
3 55
13.5%
5 54
13.2%
9 35
8.6%
8 34
8.3%
1 30
7.4%
7 23
 
5.6%
4 20
 
4.9%
6 17
 
4.2%

Unnamed: 5
Text

CONSTANT  MISSING 

Distinct1
Distinct (%)100.0%
Missing39
Missing (%)97.5%
Memory size452.0 B
2024-04-19T14:41:23.850516image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length12
Min length12

Characters and Unicode

Total characters12
Distinct characters8
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1 ?
Unique (%)100.0%

Sample

1st row053-981-5836
ValueCountFrequency (%)
053-981-5836 1
100.0%
2024-04-19T14:41:24.123904image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5 2
16.7%
3 2
16.7%
- 2
16.7%
8 2
16.7%
0 1
8.3%
9 1
8.3%
1 1
8.3%
6 1
8.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 10
83.3%
Dash Punctuation 2
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
5 2
20.0%
3 2
20.0%
8 2
20.0%
0 1
10.0%
9 1
10.0%
1 1
10.0%
6 1
10.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 12
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
5 2
16.7%
3 2
16.7%
- 2
16.7%
8 2
16.7%
0 1
8.3%
9 1
8.3%
1 1
8.3%
6 1
8.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 12
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
5 2
16.7%
3 2
16.7%
- 2
16.7%
8 2
16.7%
0 1
8.3%
9 1
8.3%
1 1
8.3%
6 1
8.3%

Interactions

2024-04-19T14:41:20.567104image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-19T14:41:24.229874image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호사업체명주요생산품소재지 주소전화
번호1.0001.0000.7820.9321.000
사업체명1.0001.0001.0001.0001.000
주요생산품0.7821.0001.0000.9811.000
소재지 주소0.9321.0000.9811.0001.000
전화1.0001.0001.0001.0001.000

Missing values

2024-04-19T14:41:20.694398image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-19T14:41:20.800050image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2024-04-19T14:41:20.891730image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

번호사업체명주요생산품소재지 주소전화Unnamed: 5
01유산수단장롱셋트외50여종대구 동구 용천로 480(신무동)053-981-1917<NA>
12불로국악기제작소농악기,전품목대구 동구 팔공로 28길 30(불로동)053-984-7721<NA>
23아름다운공예옻칠,자개대구 동구 사복로 138 (숙천동)053-963-7155<NA>
34예목공방괴목대구 동구 팔공로 975 (미곡동)053-983-9842<NA>
45인목전통창호연구소창호문,사찰,주택대구 동구 안심로 475-1 (괴전동)053-961-0800<NA>
56한성목공예목공예소품대구 동구 팔공로 28길 73-15 (도동)053-984-3189<NA>
67풍년국악기농악기경산시 와촌면 새터길 55 (신한리)053-984-9771<NA>
78대영공예사오동다기함영천시 북안면 유하큰길 112054-335-0977<NA>
89대혁농산서각영천시 청통면 금송로 1354054-336-0157<NA>
910참선목공예품목탁종류,다기류영천시 하이브리드로 774 (본촌동)054-336-1172<NA>
번호사업체명주요생산품소재지 주소전화Unnamed: 5
3031이가목검목검류대구 달성군 가창면 가창로 1011-17053-984-0213<NA>
3132한국조각공예서각대구 동구 팔공로28길 61-3(도동)053-985-0757<NA>
3233예인공방조각류대구 동구 팔공로27길 19-10(불로동)<NA>053-981-5836
3334휴천각실서각대구 동구 팔공로101길 55, 206-105(지묘동, 팔공보성2차)053-984-1717<NA>
3435청마공예목공예제품경산시 압량면 가일길34053-817-6831<NA>
3536대동목공소창호문,사찰대구 동구 팔공로27길 27-3(불로동)053-983-3372<NA>
3637신정공예찻상대구 동구 팔공로201길 13(백안동)053-983-1291<NA>
3738천안공예불자함대구 동구 팔공로28길 162(도동)053-981-0164<NA>
3839다해공방다기세트대구 동구 팔공로 851(미곡동)053-982-5782<NA>
3940명인당도장종류20여종대구 동구 불로동 192053-985-1551<NA>