Overview

Dataset statistics

Number of variables4
Number of observations34
Missing cells3
Missing cells (%)2.2%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.2 KiB
Average record size in memory35.9 B

Variable types

Categorical1
Text2
DateTime1

Dataset

Description제주특별자치도에 등록된 문화재 수리업과 관련한 데이터로 새부구분(조경업, 보수단청업, 식물보호업), 상호, 연락처 등의 정보를 제공합니다.
URLhttps://www.data.go.kr/data/15056263/fileData.do

Alerts

데이터기준일자 has constant value ""Constant
연락처 has 3 (8.8%) missing valuesMissing

Reproduction

Analysis started2023-12-12 03:55:30.708755
Analysis finished2023-12-12 03:55:31.152143
Duration0.44 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

세부구분
Categorical

Distinct5
Distinct (%)14.7%
Missing0
Missing (%)0.0%
Memory size404.0 B
식물보호업
17 
보수단청업
조경업
문화재실측설계업
 
1
석공사업
 
1

Length

Max length8
Median length5
Mean length4.6470588
Min length3

Unique

Unique2 ?
Unique (%)5.9%

Sample

1st row조경업
2nd row조경업
3rd row조경업
4th row조경업
5th row조경업

Common Values

ValueCountFrequency (%)
식물보호업 17
50.0%
보수단청업 8
23.5%
조경업 7
20.6%
문화재실측설계업 1
 
2.9%
석공사업 1
 
2.9%

Length

2023-12-12T12:55:31.272633image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T12:55:31.449742image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
식물보호업 17
50.0%
보수단청업 8
23.5%
조경업 7
20.6%
문화재실측설계업 1
 
2.9%
석공사업 1
 
2.9%

상호
Text

Distinct31
Distinct (%)91.2%
Missing0
Missing (%)0.0%
Memory size404.0 B
2023-12-12T12:55:31.741866image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length10
Mean length7.7058824
Min length2

Characters and Unicode

Total characters262
Distinct characters63
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique29 ?
Unique (%)85.3%

Sample

1st row(주)동해건설
2nd row삼영건설주식회사
3rd row기덕종합건설주식회사
4th row정담원주식회사
5th row유상건설주식회사
ValueCountFrequency (%)
주)동해건설 3
 
8.3%
유상건설주식회사 2
 
5.6%
주식회사 2
 
5.6%
한명나무병원 1
 
2.8%
삼영건설(주 1
 
2.8%
주)대원종합건설 1
 
2.8%
주)동인종합건설 1
 
2.8%
경림종합건설㈜ 1
 
2.8%
기덕종합건설(주 1
 
2.8%
㈜탐라문화재개발 1
 
2.8%
Other values (22) 22
61.1%
2023-12-12T12:55:32.329458image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
18
 
6.9%
17
 
6.5%
14
 
5.3%
14
 
5.3%
13
 
5.0%
13
 
5.0%
13
 
5.0%
10
 
3.8%
10
 
3.8%
9
 
3.4%
Other values (53) 131
50.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 233
88.9%
Other Symbol 9
 
3.4%
Open Punctuation 9
 
3.4%
Close Punctuation 9
 
3.4%
Space Separator 2
 
0.8%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
18
 
7.7%
17
 
7.3%
14
 
6.0%
14
 
6.0%
13
 
5.6%
13
 
5.6%
13
 
5.6%
10
 
4.3%
10
 
4.3%
9
 
3.9%
Other values (49) 102
43.8%
Other Symbol
ValueCountFrequency (%)
9
100.0%
Open Punctuation
ValueCountFrequency (%)
( 9
100.0%
Close Punctuation
ValueCountFrequency (%)
) 9
100.0%
Space Separator
ValueCountFrequency (%)
2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 242
92.4%
Common 20
 
7.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
18
 
7.4%
17
 
7.0%
14
 
5.8%
14
 
5.8%
13
 
5.4%
13
 
5.4%
13
 
5.4%
10
 
4.1%
10
 
4.1%
9
 
3.7%
Other values (50) 111
45.9%
Common
ValueCountFrequency (%)
( 9
45.0%
) 9
45.0%
2
 
10.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 233
88.9%
ASCII 20
 
7.6%
None 9
 
3.4%

Most frequent character per block

Hangul
ValueCountFrequency (%)
18
 
7.7%
17
 
7.3%
14
 
6.0%
14
 
6.0%
13
 
5.6%
13
 
5.6%
13
 
5.6%
10
 
4.3%
10
 
4.3%
9
 
3.9%
Other values (49) 102
43.8%
None
ValueCountFrequency (%)
9
100.0%
ASCII
ValueCountFrequency (%)
( 9
45.0%
) 9
45.0%
2
 
10.0%

연락처
Text

MISSING 

Distinct27
Distinct (%)87.1%
Missing3
Missing (%)8.8%
Memory size404.0 B
2023-12-12T12:55:32.657261image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.16129
Min length12

Characters and Unicode

Total characters377
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique24 ?
Unique (%)77.4%

Sample

1st row064-732-1607
2nd row064-764-0488
3rd row064-712-4780
4th row064-752-3201
5th row064-904-1997
ValueCountFrequency (%)
064-732-1607 3
 
9.7%
064-764-0488 2
 
6.5%
064-733-1609 2
 
6.5%
064-744-3551 1
 
3.2%
064-702-8931 1
 
3.2%
064-743-0227 1
 
3.2%
064-712-0003 1
 
3.2%
064-749-6112 1
 
3.2%
064-744-2800 1
 
3.2%
064-733-0090 1
 
3.2%
Other values (17) 17
54.8%
2023-12-12T12:55:33.214501image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 62
16.4%
- 62
16.4%
4 48
12.7%
6 44
11.7%
7 44
11.7%
2 27
7.2%
1 26
6.9%
3 22
 
5.8%
9 18
 
4.8%
8 15
 
4.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 315
83.6%
Dash Punctuation 62
 
16.4%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 62
19.7%
4 48
15.2%
6 44
14.0%
7 44
14.0%
2 27
8.6%
1 26
8.3%
3 22
 
7.0%
9 18
 
5.7%
8 15
 
4.8%
5 9
 
2.9%
Dash Punctuation
ValueCountFrequency (%)
- 62
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 377
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 62
16.4%
- 62
16.4%
4 48
12.7%
6 44
11.7%
7 44
11.7%
2 27
7.2%
1 26
6.9%
3 22
 
5.8%
9 18
 
4.8%
8 15
 
4.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 377
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 62
16.4%
- 62
16.4%
4 48
12.7%
6 44
11.7%
7 44
11.7%
2 27
7.2%
1 26
6.9%
3 22
 
5.8%
9 18
 
4.8%
8 15
 
4.0%

데이터기준일자
Date

CONSTANT 

Distinct1
Distinct (%)2.9%
Missing0
Missing (%)0.0%
Memory size404.0 B
Minimum2023-02-20 00:00:00
Maximum2023-02-20 00:00:00
2023-12-12T12:55:33.421068image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T12:55:33.601053image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Correlations

2023-12-12T12:55:33.707456image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
세부구분상호연락처
세부구분1.0000.7780.770
상호0.7781.0001.000
연락처0.7701.0001.000

Missing values

2023-12-12T12:55:30.921546image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T12:55:31.102874image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

세부구분상호연락처데이터기준일자
0조경업(주)동해건설064-732-16072023-02-20
1조경업삼영건설주식회사064-764-04882023-02-20
2조경업기덕종합건설주식회사064-712-47802023-02-20
3조경업정담원주식회사064-752-32012023-02-20
4조경업유상건설주식회사<NA>2023-02-20
5조경업㈜이현이엔지064-904-19972023-02-20
6조경업한라문화유산070-8721-77992023-02-20
7식물보호업별뫼<NA>2023-02-20
8식물보호업한라나무병원070-7767-61892023-02-20
9식물보호업한주나무종합병원(주)064-739-63312023-02-20
세부구분상호연락처데이터기준일자
24문화재실측설계업지태승건축사무소064-723-81002023-02-20
25석공사업㈜탐라문화재개발064-733-00902023-02-20
26보수단청업(주)동해건설064-732-16072023-02-20
27보수단청업기덕종합건설(주)064-744-28002023-02-20
28보수단청업경림종합건설㈜064-749-61122023-02-20
29보수단청업(주)동인종합건설064-733-16092023-02-20
30보수단청업(주)대원종합건설064-712-00032023-02-20
31보수단청업삼영건설(주)064-764-04882023-02-20
32보수단청업태이재㈜064-743-02272023-02-20
33보수단청업유상건설주식회사064-711-90462023-02-20