Overview

Dataset statistics

Number of variables6
Number of observations138
Missing cells138
Missing cells (%)16.7%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory6.9 KiB
Average record size in memory51.0 B

Variable types

Numeric1
Categorical1
Text3
Unsupported1

Dataset

Description경상북도 내 문화재수리의 품질향상과 도내 문화재수리업자의 건전한 발전을 도모하기 위한 도내 등록된 문화재수리업체 현황
Author경상북도
URLhttps://www.data.go.kr/data/15056396/fileData.do

Alerts

순번 is highly overall correlated with 구분High correlation
구분 is highly overall correlated with 순번High correlation
비고 has 138 (100.0%) missing valuesMissing
순번 has unique valuesUnique
비고 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2023-12-12 23:33:27.557083
Analysis finished2023-12-12 23:33:28.212939
Duration0.66 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct138
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean69.5
Minimum1
Maximum138
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.3 KiB
2023-12-13T08:33:28.288019image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile7.85
Q135.25
median69.5
Q3103.75
95-th percentile131.15
Maximum138
Range137
Interquartile range (IQR)68.5

Descriptive statistics

Standard deviation39.981246
Coefficient of variation (CV)0.57526972
Kurtosis-1.2
Mean69.5
Median Absolute Deviation (MAD)34.5
Skewness0
Sum9591
Variance1598.5
MonotonicityStrictly increasing
2023-12-13T08:33:28.454844image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.7%
96 1
 
0.7%
90 1
 
0.7%
91 1
 
0.7%
92 1
 
0.7%
93 1
 
0.7%
94 1
 
0.7%
95 1
 
0.7%
97 1
 
0.7%
105 1
 
0.7%
Other values (128) 128
92.8%
ValueCountFrequency (%)
1 1
0.7%
2 1
0.7%
3 1
0.7%
4 1
0.7%
5 1
0.7%
6 1
0.7%
7 1
0.7%
8 1
0.7%
9 1
0.7%
10 1
0.7%
ValueCountFrequency (%)
138 1
0.7%
137 1
0.7%
136 1
0.7%
135 1
0.7%
134 1
0.7%
133 1
0.7%
132 1
0.7%
131 1
0.7%
130 1
0.7%
129 1
0.7%

구분
Categorical

HIGH CORRELATION 

Distinct6
Distinct (%)4.3%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
보수단청업
70 
문화재실측설계업
19 
조경업
15 
보존과학업
12 
식물보호업
11 

Length

Max length8
Median length5
Mean length5.2753623
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row보수단청업
2nd row보수단청업
3rd row보수단청업
4th row보수단청업
5th row보수단청업

Common Values

ValueCountFrequency (%)
보수단청업 70
50.7%
문화재실측설계업 19
 
13.8%
조경업 15
 
10.9%
보존과학업 12
 
8.7%
식물보호업 11
 
8.0%
문화재감리업 11
 
8.0%

Length

2023-12-13T08:33:28.666451image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T08:33:28.776390image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
보수단청업 70
50.7%
문화재실측설계업 19
 
13.8%
조경업 15
 
10.9%
보존과학업 12
 
8.7%
식물보호업 11
 
8.0%
문화재감리업 11
 
8.0%
Distinct124
Distinct (%)89.9%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
2023-12-13T08:33:29.005346image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length11
Mean length6.3333333
Min length3

Characters and Unicode

Total characters874
Distinct characters144
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique111 ?
Unique (%)80.4%

Sample

1st row동신건설㈜
2nd row세방건설㈜
3rd row㈜강인
4th row창성건설㈜
5th row㈜일토종합건설
ValueCountFrequency (%)
건축사사무소 7
 
4.8%
아람문화재 3
 
2.1%
㈜미산 2
 
1.4%
터전건축사사무소 2
 
1.4%
㈜서정 2
 
1.4%
㈜시암문화 2
 
1.4%
우리건축사사무소 2
 
1.4%
주)일진건축사사무소 2
 
1.4%
㈜호연종합건설 2
 
1.4%
일일종합건설(주 2
 
1.4%
Other values (115) 119
82.1%
2023-12-13T08:33:29.409459image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
84
 
9.6%
63
 
7.2%
53
 
6.1%
36
 
4.1%
34
 
3.9%
29
 
3.3%
29
 
3.3%
27
 
3.1%
25
 
2.9%
21
 
2.4%
Other values (134) 473
54.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 745
85.2%
Other Symbol 84
 
9.6%
Close Punctuation 19
 
2.2%
Open Punctuation 19
 
2.2%
Space Separator 7
 
0.8%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
63
 
8.5%
53
 
7.1%
36
 
4.8%
34
 
4.6%
29
 
3.9%
29
 
3.9%
27
 
3.6%
25
 
3.4%
21
 
2.8%
18
 
2.4%
Other values (130) 410
55.0%
Other Symbol
ValueCountFrequency (%)
84
100.0%
Close Punctuation
ValueCountFrequency (%)
) 19
100.0%
Open Punctuation
ValueCountFrequency (%)
( 19
100.0%
Space Separator
ValueCountFrequency (%)
7
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 829
94.9%
Common 45
 
5.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
84
 
10.1%
63
 
7.6%
53
 
6.4%
36
 
4.3%
34
 
4.1%
29
 
3.5%
29
 
3.5%
27
 
3.3%
25
 
3.0%
21
 
2.5%
Other values (131) 428
51.6%
Common
ValueCountFrequency (%)
) 19
42.2%
( 19
42.2%
7
 
15.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 745
85.2%
None 84
 
9.6%
ASCII 45
 
5.1%

Most frequent character per block

None
ValueCountFrequency (%)
84
100.0%
Hangul
ValueCountFrequency (%)
63
 
8.5%
53
 
7.1%
36
 
4.8%
34
 
4.6%
29
 
3.9%
29
 
3.9%
27
 
3.6%
25
 
3.4%
21
 
2.8%
18
 
2.4%
Other values (130) 410
55.0%
ASCII
ValueCountFrequency (%)
) 19
42.2%
( 19
42.2%
7
 
15.6%
Distinct120
Distinct (%)87.0%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
2023-12-13T08:33:29.800723image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length8
Median length3
Mean length3.0724638
Min length3

Characters and Unicode

Total characters424
Distinct characters109
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique103 ?
Unique (%)74.6%

Sample

1st row김근한, 김동한
2nd row장해진, 장태진
3rd row정상철
4th row송병훈
5th row김주윤
ValueCountFrequency (%)
고민실 3
 
2.1%
김기호 2
 
1.4%
이성태 2
 
1.4%
박남규 2
 
1.4%
하용수 2
 
1.4%
윤영희 2
 
1.4%
김용성 2
 
1.4%
최영수 2
 
1.4%
이용명 2
 
1.4%
김윤기 2
 
1.4%
Other values (114) 121
85.2%
2023-12-13T08:33:30.313908image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
34
 
8.0%
23
 
5.4%
17
 
4.0%
14
 
3.3%
13
 
3.1%
13
 
3.1%
13
 
3.1%
11
 
2.6%
11
 
2.6%
11
 
2.6%
Other values (99) 264
62.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 418
98.6%
Space Separator 4
 
0.9%
Other Punctuation 2
 
0.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
34
 
8.1%
23
 
5.5%
17
 
4.1%
14
 
3.3%
13
 
3.1%
13
 
3.1%
13
 
3.1%
11
 
2.6%
11
 
2.6%
11
 
2.6%
Other values (97) 258
61.7%
Space Separator
ValueCountFrequency (%)
4
100.0%
Other Punctuation
ValueCountFrequency (%)
, 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 418
98.6%
Common 6
 
1.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
34
 
8.1%
23
 
5.5%
17
 
4.1%
14
 
3.3%
13
 
3.1%
13
 
3.1%
13
 
3.1%
11
 
2.6%
11
 
2.6%
11
 
2.6%
Other values (97) 258
61.7%
Common
ValueCountFrequency (%)
4
66.7%
, 2
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 418
98.6%
ASCII 6
 
1.4%

Most frequent character per block

Hangul
ValueCountFrequency (%)
34
 
8.1%
23
 
5.5%
17
 
4.1%
14
 
3.3%
13
 
3.1%
13
 
3.1%
13
 
3.1%
11
 
2.6%
11
 
2.6%
11
 
2.6%
Other values (97) 258
61.7%
ASCII
ValueCountFrequency (%)
4
66.7%
, 2
33.3%

주소
Text

Distinct124
Distinct (%)89.9%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
2023-12-13T08:33:30.645190image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length39
Median length33
Mean length19.688406
Min length10

Characters and Unicode

Total characters2717
Distinct characters206
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique111 ?
Unique (%)80.4%

Sample

1st row안동시 서동문로 217(동문동 131)
2nd row안동시 거북골길 145(정상동 361-1)
3rd row경주시 양정로 227, 3층(동천동)
4th row안동시 말구리2길 27, 2층(태화동)
5th row문경시 신흥시장길 19, 나동 207호
ValueCountFrequency (%)
경주시 41
 
7.1%
안동시 27
 
4.7%
2층 17
 
3.0%
영주시 9
 
1.6%
예천군 9
 
1.6%
경산시 8
 
1.4%
봉화군 8
 
1.4%
봉화읍 8
 
1.4%
양정로 7
 
1.2%
성주군 6
 
1.0%
Other values (322) 435
75.7%
2023-12-13T08:33:31.081922image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
437
 
16.1%
2 146
 
5.4%
1 134
 
4.9%
114
 
4.2%
113
 
4.2%
, 85
 
3.1%
80
 
2.9%
70
 
2.6%
70
 
2.6%
3 64
 
2.4%
Other values (196) 1404
51.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1410
51.9%
Decimal Number 631
23.2%
Space Separator 437
 
16.1%
Other Punctuation 85
 
3.1%
Close Punctuation 53
 
2.0%
Open Punctuation 53
 
2.0%
Dash Punctuation 43
 
1.6%
Uppercase Letter 4
 
0.1%
Lowercase Letter 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
114
 
8.1%
113
 
8.0%
80
 
5.7%
70
 
5.0%
70
 
5.0%
58
 
4.1%
54
 
3.8%
54
 
3.8%
37
 
2.6%
36
 
2.6%
Other values (176) 724
51.3%
Decimal Number
ValueCountFrequency (%)
2 146
23.1%
1 134
21.2%
3 64
10.1%
0 62
9.8%
4 58
 
9.2%
5 53
 
8.4%
6 41
 
6.5%
7 33
 
5.2%
9 21
 
3.3%
8 19
 
3.0%
Uppercase Letter
ValueCountFrequency (%)
S 1
25.0%
M 1
25.0%
A 1
25.0%
F 1
25.0%
Space Separator
ValueCountFrequency (%)
437
100.0%
Other Punctuation
ValueCountFrequency (%)
, 85
100.0%
Close Punctuation
ValueCountFrequency (%)
) 53
100.0%
Open Punctuation
ValueCountFrequency (%)
( 53
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 43
100.0%
Lowercase Letter
ValueCountFrequency (%)
v 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1410
51.9%
Common 1302
47.9%
Latin 5
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
114
 
8.1%
113
 
8.0%
80
 
5.7%
70
 
5.0%
70
 
5.0%
58
 
4.1%
54
 
3.8%
54
 
3.8%
37
 
2.6%
36
 
2.6%
Other values (176) 724
51.3%
Common
ValueCountFrequency (%)
437
33.6%
2 146
 
11.2%
1 134
 
10.3%
, 85
 
6.5%
3 64
 
4.9%
0 62
 
4.8%
4 58
 
4.5%
) 53
 
4.1%
5 53
 
4.1%
( 53
 
4.1%
Other values (5) 157
 
12.1%
Latin
ValueCountFrequency (%)
S 1
20.0%
M 1
20.0%
v 1
20.0%
A 1
20.0%
F 1
20.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1410
51.9%
ASCII 1307
48.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
437
33.4%
2 146
 
11.2%
1 134
 
10.3%
, 85
 
6.5%
3 64
 
4.9%
0 62
 
4.7%
4 58
 
4.4%
) 53
 
4.1%
5 53
 
4.1%
( 53
 
4.1%
Other values (10) 162
 
12.4%
Hangul
ValueCountFrequency (%)
114
 
8.1%
113
 
8.0%
80
 
5.7%
70
 
5.0%
70
 
5.0%
58
 
4.1%
54
 
3.8%
54
 
3.8%
37
 
2.6%
36
 
2.6%
Other values (176) 724
51.3%

비고
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing138
Missing (%)100.0%
Memory size1.3 KiB

Interactions

2023-12-13T08:33:27.901435image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T08:33:31.190871image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번구분
순번1.0000.955
구분0.9551.000
2023-12-13T08:33:31.535449image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번구분
순번1.0000.867
구분0.8671.000

Missing values

2023-12-13T08:33:28.040818image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T08:33:28.175899image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

순번구분회사명대표자주소비고
01보수단청업동신건설㈜김근한, 김동한안동시 서동문로 217(동문동 131)<NA>
12보수단청업세방건설㈜장해진, 장태진안동시 거북골길 145(정상동 361-1)<NA>
23보수단청업㈜강인정상철경주시 양정로 227, 3층(동천동)<NA>
34보수단청업창성건설㈜송병훈안동시 말구리2길 27, 2층(태화동)<NA>
45보수단청업㈜일토종합건설김주윤문경시 신흥시장길 19, 나동 207호<NA>
56보수단청업㈜송원노지원안동시 서동문로 217, 4층 F-03호(동문동)<NA>
67보수단청업㈜미산김동혁성주군 선남면 성주로 4240<NA>
78보수단청업㈜유신이경락경주시 광산안길 6-2 이동 제2층201호(마동,에이스빌라)<NA>
89보수단청업세창건설㈜장은성안동시 거북골길 145<NA>
910보수단청업㈜우림산업박순범경주시 초당길5번길 41<NA>
순번구분회사명대표자주소비고
128129문화재감리업(주)일진건축사사무소박남규경주시 중마을길 20,1층(충효동)<NA>
129130문화재감리업창산감리단주식회사이영순안동시 풍일로 2262<NA>
130131문화재감리업터전건축사사무소이성태경산시 성암로12길 40-13, 3층<NA>
131132문화재감리업서헌문화재최규순경산시 솔숲길 102, 2층 201호<NA>
132133문화재감리업㈜태진최이태경주시 알천북로 335, 3층<NA>
133134문화재감리업삼안연구소박왕희청도군 매전면 숲실길 4<NA>
134135문화재감리업우리건축사사무소김태조경주시 양정로 285(동천동)<NA>
135136문화재감리업대경건축사사무소하용수경주시 백률로 17-12, 2층(동천동)<NA>
136137문화재감리업㈜동원건축사사무소이용명경주시 양정로 241-1, 기린빌딩 7층<NA>
137138문화재감리업건축사사무소 정훈박도호안동시 서동문로 217, 4층 5호<NA>