Overview

Dataset statistics

Number of variables8
Number of observations631
Missing cells14
Missing cells (%)0.3%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory39.6 KiB
Average record size in memory64.2 B

Variable types

Categorical2
Text4
DateTime2

Dataset

Description문화재수리업자 현황입니다. 문화재수리업으로 등록된 수리업체 현황을 시도, 업종, 등록번호, 상호, 주소 등 정보를 확인할 수 있습니다.
Author문화재청
URLhttps://www.data.go.kr/data/15088664/fileData.do

Alerts

데이터기준일자 has constant value ""Constant
연락처 has 14 (2.2%) missing valuesMissing

Reproduction

Analysis started2023-12-12 15:50:38.208147
Analysis finished2023-12-12 15:50:38.879895
Duration0.67 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시도명
Categorical

Distinct15
Distinct (%)2.4%
Missing0
Missing (%)0.0%
Memory size5.1 KiB
경북
135 
충남
83 
경기
71 
전남
69 
서울
62 
Other values (10)
211 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row서울
2nd row서울
3rd row서울
4th row서울
5th row서울

Common Values

ValueCountFrequency (%)
경북 135
21.4%
충남 83
13.2%
경기 71
11.3%
전남 69
10.9%
서울 62
9.8%
경남 53
 
8.4%
전북 51
 
8.1%
충북 30
 
4.8%
강원 29
 
4.6%
제주 29
 
4.6%
Other values (5) 19
 
3.0%

Length

2023-12-13T00:50:38.963264image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
경북 135
21.4%
충남 83
13.2%
경기 71
11.3%
전남 69
10.9%
서울 62
9.8%
경남 53
 
8.4%
전북 51
 
8.1%
충북 30
 
4.8%
강원 29
 
4.6%
제주 29
 
4.6%
Other values (5) 19
 
3.0%

업종
Categorical

Distinct11
Distinct (%)1.7%
Missing0
Missing (%)0.0%
Memory size5.1 KiB
보수단청업
286 
조경업
81 
문화재실측설계업
71 
보존과학업
68 
식물보호업
60 
Other values (6)
65 

Length

Max length8
Median length5
Mean length5.1584786
Min length3

Unique

Unique2 ?
Unique (%)0.3%

Sample

1st row보수단청업
2nd row보수단청업
3rd row보수단청업
4th row보수단청업
5th row보수단청업

Common Values

ValueCountFrequency (%)
보수단청업 286
45.3%
조경업 81
 
12.8%
문화재실측설계업 71
 
11.3%
보존과학업 68
 
10.8%
식물보호업 60
 
9.5%
문화재감리업 52
 
8.2%
단청공사업 7
 
1.1%
번와공사업 2
 
0.3%
석공사업 2
 
0.3%
미장공사업 1
 
0.2%

Length

2023-12-13T00:50:39.097585image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
보수단청업 286
45.3%
조경업 81
 
12.8%
문화재실측설계업 71
 
11.3%
보존과학업 68
 
10.8%
식물보호업 60
 
9.5%
문화재감리업 52
 
8.2%
단청공사업 7
 
1.1%
번와공사업 2
 
0.3%
석공사업 2
 
0.3%
미장공사업 1
 
0.2%
Distinct616
Distinct (%)97.6%
Missing0
Missing (%)0.0%
Memory size5.1 KiB
2023-12-13T00:50:39.451818image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length11.058637
Min length3

Characters and Unicode

Total characters6978
Distinct characters13
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique602 ?
Unique (%)95.4%

Sample

1st row제7호
2nd row제01-01-0009호
3rd row제01-01-0012호
4th row제82호
5th row제01-01-0015호
ValueCountFrequency (%)
제4호 3
 
0.5%
제70호 2
 
0.3%
제01-04-0005호 2
 
0.3%
제38호 2
 
0.3%
제05-17-0015호 2
 
0.3%
제01-15-0021호 2
 
0.3%
제01-14-0015호 2
 
0.3%
제37호 2
 
0.3%
제16호 2
 
0.3%
제05-13-0010호 2
 
0.3%
Other values (606) 610
96.7%
2023-12-13T00:50:39.946812image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 2060
29.5%
- 1108
15.9%
1 1087
15.6%
631
 
9.0%
631
 
9.0%
3 287
 
4.1%
2 252
 
3.6%
6 234
 
3.4%
5 209
 
3.0%
4 204
 
2.9%
Other values (3) 275
 
3.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 4608
66.0%
Other Letter 1262
 
18.1%
Dash Punctuation 1108
 
15.9%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 2060
44.7%
1 1087
23.6%
3 287
 
6.2%
2 252
 
5.5%
6 234
 
5.1%
5 209
 
4.5%
4 204
 
4.4%
7 145
 
3.1%
8 78
 
1.7%
9 52
 
1.1%
Other Letter
ValueCountFrequency (%)
631
50.0%
631
50.0%
Dash Punctuation
ValueCountFrequency (%)
- 1108
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 5716
81.9%
Hangul 1262
 
18.1%

Most frequent character per script

Common
ValueCountFrequency (%)
0 2060
36.0%
- 1108
19.4%
1 1087
19.0%
3 287
 
5.0%
2 252
 
4.4%
6 234
 
4.1%
5 209
 
3.7%
4 204
 
3.6%
7 145
 
2.5%
8 78
 
1.4%
Hangul
ValueCountFrequency (%)
631
50.0%
631
50.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 5716
81.9%
Hangul 1262
 
18.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 2060
36.0%
- 1108
19.4%
1 1087
19.0%
3 287
 
5.0%
2 252
 
4.4%
6 234
 
4.1%
5 209
 
3.7%
4 204
 
3.6%
7 145
 
2.5%
8 78
 
1.4%
Hangul
ValueCountFrequency (%)
631
50.0%
631
50.0%

상호
Text

Distinct565
Distinct (%)89.5%
Missing0
Missing (%)0.0%
Memory size5.1 KiB
2023-12-13T00:50:40.205969image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length17
Median length14
Mean length7.2646593
Min length2

Characters and Unicode

Total characters4584
Distinct characters251
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique502 ?
Unique (%)79.6%

Sample

1st row삼부토건㈜
2nd row희우건설 주식회사
3rd row세림산업 주식회사
4th row원택건설㈜
5th row(주)새한티엠씨
ValueCountFrequency (%)
주식회사 113
 
14.9%
건축사사무소 9
 
1.2%
아람문화재㈜ 3
 
0.4%
㈜동해건설 3
 
0.4%
㈜동인종합건설 3
 
0.4%
㈜건축사사무소 3
 
0.4%
㈜고택 2
 
0.3%
주)디딤건축사사무소 2
 
0.3%
㈜소야원 2
 
0.3%
㈜티지건축사사무소 2
 
0.3%
Other values (560) 618
81.3%
2023-12-13T00:50:40.675691image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
358
 
7.8%
331
 
7.2%
305
 
6.7%
174
 
3.8%
170
 
3.7%
158
 
3.4%
129
 
2.8%
123
 
2.7%
123
 
2.7%
123
 
2.7%
Other values (241) 2590
56.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3995
87.2%
Other Symbol 358
 
7.8%
Space Separator 129
 
2.8%
Open Punctuation 51
 
1.1%
Close Punctuation 51
 
1.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
331
 
8.3%
305
 
7.6%
174
 
4.4%
170
 
4.3%
158
 
4.0%
123
 
3.1%
123
 
3.1%
123
 
3.1%
122
 
3.1%
116
 
2.9%
Other values (237) 2250
56.3%
Other Symbol
ValueCountFrequency (%)
358
100.0%
Space Separator
ValueCountFrequency (%)
129
100.0%
Open Punctuation
ValueCountFrequency (%)
( 51
100.0%
Close Punctuation
ValueCountFrequency (%)
) 51
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4353
95.0%
Common 231
 
5.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
358
 
8.2%
331
 
7.6%
305
 
7.0%
174
 
4.0%
170
 
3.9%
158
 
3.6%
123
 
2.8%
123
 
2.8%
123
 
2.8%
122
 
2.8%
Other values (238) 2366
54.4%
Common
ValueCountFrequency (%)
129
55.8%
( 51
 
22.1%
) 51
 
22.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3995
87.2%
None 358
 
7.8%
ASCII 231
 
5.0%

Most frequent character per block

None
ValueCountFrequency (%)
358
100.0%
Hangul
ValueCountFrequency (%)
331
 
8.3%
305
 
7.6%
174
 
4.4%
170
 
4.3%
158
 
4.0%
123
 
3.1%
123
 
3.1%
123
 
3.1%
122
 
3.1%
116
 
2.9%
Other values (237) 2250
56.3%
ASCII
ValueCountFrequency (%)
129
55.8%
( 51
 
22.1%
) 51
 
22.1%
Distinct138
Distinct (%)21.9%
Missing0
Missing (%)0.0%
Memory size5.1 KiB
2023-12-13T00:50:40.999062image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length7
Median length6
Mean length6.0079239
Min length5

Characters and Unicode

Total characters3791
Distinct characters110
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique42 ?
Unique (%)6.7%

Sample

1st row서울 중구
2nd row서울 광진구
3rd row서울 양천구
4th row서울 광진구
5th row서울 종로구
ValueCountFrequency (%)
경북 135
 
10.7%
충남 83
 
6.6%
경기 71
 
5.6%
전남 69
 
5.5%
서울 62
 
4.9%
경남 53
 
4.2%
전북 51
 
4.0%
경주시 40
 
3.2%
충북 30
 
2.4%
제주 29
 
2.3%
Other values (141) 639
50.6%
2023-12-13T00:50:41.872508image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
631
16.6%
410
 
10.8%
312
 
8.2%
234
 
6.2%
231
 
6.1%
217
 
5.7%
147
 
3.9%
145
 
3.8%
116
 
3.1%
89
 
2.3%
Other values (100) 1259
33.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3160
83.4%
Space Separator 631
 
16.6%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
410
 
13.0%
312
 
9.9%
234
 
7.4%
231
 
7.3%
217
 
6.9%
147
 
4.7%
145
 
4.6%
116
 
3.7%
89
 
2.8%
83
 
2.6%
Other values (99) 1176
37.2%
Space Separator
ValueCountFrequency (%)
631
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3160
83.4%
Common 631
 
16.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
410
 
13.0%
312
 
9.9%
234
 
7.4%
231
 
7.3%
217
 
6.9%
147
 
4.7%
145
 
4.6%
116
 
3.7%
89
 
2.8%
83
 
2.6%
Other values (99) 1176
37.2%
Common
ValueCountFrequency (%)
631
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3160
83.4%
ASCII 631
 
16.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
631
100.0%
Hangul
ValueCountFrequency (%)
410
 
13.0%
312
 
9.9%
234
 
7.4%
231
 
7.3%
217
 
6.9%
147
 
4.7%
145
 
4.6%
116
 
3.7%
89
 
2.8%
83
 
2.6%
Other values (99) 1176
37.2%

연락처
Text

MISSING 

Distinct543
Distinct (%)88.0%
Missing14
Missing (%)2.2%
Memory size5.1 KiB
2023-12-13T00:50:42.143004image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length11.990276
Min length9

Characters and Unicode

Total characters7398
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique474 ?
Unique (%)76.8%

Sample

1st row02-3706-2310
2nd row02-3436-0722
3rd row02-6332-7784
4th row02-498-5471
5th row02-733-9404
ValueCountFrequency (%)
064-732-1607 4
 
0.6%
064-733-1609 3
 
0.5%
050-2324-6124 3
 
0.5%
054-745-3444 3
 
0.5%
031-818-1085 2
 
0.3%
054-932-4802 2
 
0.3%
070-4408-9070 2
 
0.3%
055-674-7197 2
 
0.3%
041-544-3442 2
 
0.3%
031-693-6602 2
 
0.3%
Other values (533) 592
95.9%
2023-12-13T00:50:42.648004image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 1232
16.7%
0 1062
14.4%
3 799
10.8%
5 721
9.7%
4 684
9.2%
1 588
7.9%
7 560
7.6%
6 535
7.2%
2 522
7.1%
8 396
 
5.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 6166
83.3%
Dash Punctuation 1232
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 1062
17.2%
3 799
13.0%
5 721
11.7%
4 684
11.1%
1 588
9.5%
7 560
9.1%
6 535
8.7%
2 522
8.5%
8 396
 
6.4%
9 299
 
4.8%
Dash Punctuation
ValueCountFrequency (%)
- 1232
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 7398
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 1232
16.7%
0 1062
14.4%
3 799
10.8%
5 721
9.7%
4 684
9.2%
1 588
7.9%
7 560
7.6%
6 535
7.2%
2 522
7.1%
8 396
 
5.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 7398
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 1232
16.7%
0 1062
14.4%
3 799
10.8%
5 721
9.7%
4 684
9.2%
1 588
7.9%
7 560
7.6%
6 535
7.2%
2 522
7.1%
8 396
 
5.4%
Distinct472
Distinct (%)74.8%
Missing0
Missing (%)0.0%
Memory size5.1 KiB
Minimum1993-02-15 00:00:00
Maximum2021-06-17 00:00:00
2023-12-13T00:50:42.857678image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T00:50:43.036821image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

데이터기준일자
Date

CONSTANT 

Distinct1
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size5.1 KiB
Minimum2021-08-01 00:00:00
Maximum2021-08-01 00:00:00
2023-12-13T00:50:43.188196image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T00:50:43.326457image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Correlations

2023-12-13T00:50:43.449856image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시도명업종
시도명1.0000.357
업종0.3571.000
2023-12-13T00:50:43.574838image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시도명업종
시도명1.0000.145
업종0.1451.000
2023-12-13T00:50:43.681910image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시도명업종
시도명1.0000.145
업종0.1451.000

Missing values

2023-12-13T00:50:38.661916image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T00:50:38.814272image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

시도명업종등록번호상호영업장 소재지연락처등록일데이터기준일자
0서울보수단청업제7호삼부토건㈜서울 중구02-3706-23101994-02-152021-08-01
1서울보수단청업제01-01-0009호희우건설 주식회사서울 광진구02-3436-07222009-03-312021-08-01
2서울보수단청업제01-01-0012호세림산업 주식회사서울 양천구02-6332-77842011-04-182021-08-01
3서울보수단청업제82호원택건설㈜서울 광진구02-498-54711997-03-052021-08-01
4서울보수단청업제01-01-0015호(주)새한티엠씨서울 종로구02-733-94042013-03-082021-08-01
5서울보수단청업제150호㈜에이치디토건서울 마포구02-707-18072001-02-282021-08-01
6서울보수단청업제84호㈜토형산업서울 마포구02-325-67331997-03-052021-08-01
7서울보수단청업제01-01-0017호㈜동방문화유산서울 종로구02-3411-18912015-03-102021-08-01
8서울보수단청업제01-01-0019호(주)재우씨엔씨서울 노원구02-971-31562015-07-242021-08-01
9서울보수단청업제01-01-0020호한옥협동조합서울 종로구02-742-92722015-12-142021-08-01
시도명업종등록번호상호영업장 소재지연락처등록일데이터기준일자
621제주식물보호업제05-18-0016호주식회사 동해건설제주 제주시064-732-16072020-03-112021-08-01
622제주식물보호업제05-13-0004호한라나무병원제주 제주시070-7767-61892009-02-022021-08-01
623제주식물보호업제2007-01호한강나무병원제주 제주시064-756-30822007-05-182021-08-01
624제주식물보호업제05-15-0035호신한나무종합병원제주 제주시064-721-84672019-03-132021-08-01
625제주식물보호업제05-17-0015호한국나무종합병원제주 제주시064-757-20282003-10-062021-08-01
626제주식물보호업제05-17-0005호한경나무종합병원㈜제주 제주시064-744-35512006-03-152021-08-01
627제주식물보호업제05-15-0032호송원나무종합병원제주 제주시064-743-54032015-03-262021-08-01
628제주식물보호업제05-18-0017호㈜동인종합건설제주 제주시064-733-16092021-04-292021-08-01
629제주식물보호업제05-15-0029호한서나무종합병원㈜제주 서귀포시064-732-44362012-04-122021-08-01
630제주문화재실측설계업제02-18-0001호지태승건축사사무소제주 제주시064-723-81002015-06-022021-08-01