Overview

Dataset statistics

Number of variables5
Number of observations113
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory4.5 KiB
Average record size in memory41.2 B

Variable types

Categorical2
Text2
DateTime1

Dataset

Description제주특별자치도 제주시 관내 소방시설업 관련 현황 데이터를 제공합니다.
Author제주특별자치도 제주시
URLhttps://www.data.go.kr/data/3082485/fileData.do

Alerts

데이터기준일자 has constant value ""Constant
구분 is highly overall correlated with 등록업종High correlation
등록업종 is highly overall correlated with 구분High correlation
등록업종 is highly imbalanced (56.6%)Imbalance

Reproduction

Analysis started2023-12-12 20:31:06.713345
Analysis finished2023-12-12 20:31:07.131479
Duration0.42 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

구분
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)2.7%
Missing0
Missing (%)0.0%
Memory size1.0 KiB
공사업
88 
설계업
13 
감리업
12 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row공사업
2nd row공사업
3rd row공사업
4th row공사업
5th row공사업

Common Values

ValueCountFrequency (%)
공사업 88
77.9%
설계업 13
 
11.5%
감리업 12
 
10.6%

Length

2023-12-13T05:31:07.197395image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T05:31:07.291061image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
공사업 88
77.9%
설계업 13
 
11.5%
감리업 12
 
10.6%

상호
Text

Distinct99
Distinct (%)87.6%
Missing0
Missing (%)0.0%
Memory size1.0 KiB
2023-12-13T05:31:07.556346image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length8.3274336
Min length3

Characters and Unicode

Total characters941
Distinct characters121
Distinct categories5 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique88 ?
Unique (%)77.9%

Sample

1st row주식회사 씨에이치피산업
2nd row주식회사 삼화방재
3rd row영보건설 주식회사
4th row주식회사 신성엔지니어링
5th row주식회사 한성이엔지
ValueCountFrequency (%)
주식회사 50
30.7%
주)대경엔지니어링 3
 
1.8%
한성이엔지 3
 
1.8%
주)윤엔지니어링 3
 
1.8%
신아이엔씨 2
 
1.2%
다온엔지니어링 2
 
1.2%
퐁낭eng 2
 
1.2%
대영기업 2
 
1.2%
아이앤 2
 
1.2%
주)종합전기기술사사무소 2
 
1.2%
Other values (90) 92
56.4%
2023-12-13T05:31:08.065397image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
110
 
11.7%
( 57
 
6.1%
) 57
 
6.1%
57
 
6.1%
51
 
5.4%
50
 
5.3%
50
 
5.3%
31
 
3.3%
27
 
2.9%
27
 
2.9%
Other values (111) 424
45.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 771
81.9%
Open Punctuation 57
 
6.1%
Close Punctuation 57
 
6.1%
Space Separator 50
 
5.3%
Uppercase Letter 6
 
0.6%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
110
 
14.3%
57
 
7.4%
51
 
6.6%
50
 
6.5%
31
 
4.0%
27
 
3.5%
27
 
3.5%
25
 
3.2%
21
 
2.7%
21
 
2.7%
Other values (105) 351
45.5%
Uppercase Letter
ValueCountFrequency (%)
N 2
33.3%
E 2
33.3%
G 2
33.3%
Open Punctuation
ValueCountFrequency (%)
( 57
100.0%
Close Punctuation
ValueCountFrequency (%)
) 57
100.0%
Space Separator
ValueCountFrequency (%)
50
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 771
81.9%
Common 164
 
17.4%
Latin 6
 
0.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
110
 
14.3%
57
 
7.4%
51
 
6.6%
50
 
6.5%
31
 
4.0%
27
 
3.5%
27
 
3.5%
25
 
3.2%
21
 
2.7%
21
 
2.7%
Other values (105) 351
45.5%
Common
ValueCountFrequency (%)
( 57
34.8%
) 57
34.8%
50
30.5%
Latin
ValueCountFrequency (%)
N 2
33.3%
E 2
33.3%
G 2
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 771
81.9%
ASCII 170
 
18.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
110
 
14.3%
57
 
7.4%
51
 
6.6%
50
 
6.5%
31
 
4.0%
27
 
3.5%
27
 
3.5%
25
 
3.2%
21
 
2.7%
21
 
2.7%
Other values (105) 351
45.5%
ASCII
ValueCountFrequency (%)
( 57
33.5%
) 57
33.5%
50
29.4%
N 2
 
1.2%
E 2
 
1.2%
G 2
 
1.2%
Distinct99
Distinct (%)87.6%
Missing0
Missing (%)0.0%
Memory size1.0 KiB
2023-12-13T05:31:08.374382image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.026549
Min length12

Characters and Unicode

Total characters1359
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique88 ?
Unique (%)77.9%

Sample

1st row070-8119-4318
2nd row064-724-1193
3rd row064-746-8857
4th row064-751-2255
5th row064-751-0815
ValueCountFrequency (%)
064-744-4112 3
 
2.7%
064-755-1864 3
 
2.7%
064-751-0815 3
 
2.7%
064-805-8483 2
 
1.8%
064-727-0466 2
 
1.8%
064-702-7287 2
 
1.8%
064-746-8229 2
 
1.8%
064-747-5597 2
 
1.8%
064-725-2002 2
 
1.8%
064-725-3690 2
 
1.8%
Other values (89) 90
79.6%
2023-12-13T05:31:08.847589image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 226
16.6%
4 204
15.0%
0 174
12.8%
7 163
12.0%
6 156
11.5%
1 107
7.9%
2 90
 
6.6%
5 80
 
5.9%
9 62
 
4.6%
3 57
 
4.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1133
83.4%
Dash Punctuation 226
 
16.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
4 204
18.0%
0 174
15.4%
7 163
14.4%
6 156
13.8%
1 107
9.4%
2 90
7.9%
5 80
 
7.1%
9 62
 
5.5%
3 57
 
5.0%
8 40
 
3.5%
Dash Punctuation
ValueCountFrequency (%)
- 226
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1359
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 226
16.6%
4 204
15.0%
0 174
12.8%
7 163
12.0%
6 156
11.5%
1 107
7.9%
2 90
 
6.6%
5 80
 
5.9%
9 62
 
4.6%
3 57
 
4.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1359
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 226
16.6%
4 204
15.0%
0 174
12.8%
7 163
12.0%
6 156
11.5%
1 107
7.9%
2 90
 
6.6%
5 80
 
5.9%
9 62
 
4.6%
3 57
 
4.2%

등록업종
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct4
Distinct (%)3.5%
Missing0
Missing (%)0.0%
Memory size1.0 KiB
전문
90 
일반(전기). 일반(기계)
20 
일반(기계)
 
2
일반(전기)
 
1

Length

Max length14
Median length2
Mean length4.2300885
Min length2

Unique

Unique1 ?
Unique (%)0.9%

Sample

1st row전문
2nd row전문
3rd row전문
4th row전문
5th row전문

Common Values

ValueCountFrequency (%)
전문 90
79.6%
일반(전기). 일반(기계) 20
 
17.7%
일반(기계) 2
 
1.8%
일반(전기) 1
 
0.9%

Length

2023-12-13T05:31:08.996030image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T05:31:09.094215image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
전문 90
67.7%
일반(기계 22
 
16.5%
일반(전기 21
 
15.8%

데이터기준일자
Date

CONSTANT 

Distinct1
Distinct (%)0.9%
Missing0
Missing (%)0.0%
Memory size1.0 KiB
Minimum2020-08-31 00:00:00
Maximum2020-08-31 00:00:00
2023-12-13T05:31:09.180449image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T05:31:09.292525image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Correlations

2023-12-13T05:31:09.375956image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구분상호전화번호등록업종
구분1.0000.0000.0000.613
상호0.0001.0001.0000.822
전화번호0.0001.0001.0000.822
등록업종0.6130.8220.8221.000
2023-12-13T05:31:09.480590image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
등록업종구분
등록업종1.0000.627
구분0.6271.000
2023-12-13T05:31:09.554753image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구분등록업종
구분1.0000.627
등록업종0.6271.000

Missing values

2023-12-13T05:31:06.976303image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T05:31:07.083846image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

구분상호전화번호등록업종데이터기준일자
0공사업주식회사 씨에이치피산업070-8119-4318전문2020-08-31
1공사업주식회사 삼화방재064-724-1193전문2020-08-31
2공사업영보건설 주식회사064-746-8857전문2020-08-31
3공사업주식회사 신성엔지니어링064-751-2255전문2020-08-31
4공사업주식회사 한성이엔지064-751-0815전문2020-08-31
5공사업주식회사 이룸이엔씨064-759-9929전문2020-08-31
6공사업주식회사 소방마을064-751-7119일반(전기)2020-08-31
7공사업(주)태현전기064-747-6468전문2020-08-31
8공사업동성전력 주식회사064-758-0377전문2020-08-31
9공사업경희방재 주식회사064-752-0020전문2020-08-31
구분상호전화번호등록업종데이터기준일자
103감리업주식회사 현무엔지니어링064-746-3360일반(전기). 일반(기계)2020-08-31
104감리업신아이엔씨 주식회사064-805-8483일반(전기). 일반(기계)2020-08-31
105감리업주식회사 호승엔지니어링064-702-3497일반(전기). 일반(기계)2020-08-31
106감리업주식회사 태담엔지니어링064-753-0363일반(전기). 일반(기계)2020-08-31
107감리업주식회사 다온엔지니어링064-746-8229전문2020-08-31
108감리업(주)종합전기기술사사무소064-725-2002전문2020-08-31
109감리업(주)웅진엔지니어링064-725-3690전문2020-08-31
110감리업주식회사 성지엔지니어링064-723-1462일반(전기). 일반(기계)2020-08-31
111감리업(주)윤엔지니어링064-755-1864일반(전기). 일반(기계)2020-08-31
112감리업(주)대경엔지니어링064-744-4112전문2020-08-31