Overview

Dataset statistics

Number of variables12
Number of observations940
Missing cells704
Missing cells (%)6.2%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory89.2 KiB
Average record size in memory97.1 B

Variable types

Numeric1
Categorical4
Text6
DateTime1

Dataset

Description공예품 유통 활성화를 위한 다각적 지원체계 마련, 온·오프라인 유통망의 공예품 판로지원을 통한 공예시장 확대, 공예사업체 정보 데이터로 사업체명, 키워드 등의 항목을 제공합니다.
Author한국공예디자인문화진흥원
URLhttps://www.data.go.kr/data/15124034/fileData.do

Alerts

번호 is highly overall correlated with 사업자 구분High correlation
사업자 구분 is highly overall correlated with 번호High correlation
사용여부 is highly imbalanced (92.9%)Imbalance
영문업체명 has 118 (12.6%) missing valuesMissing
대표자 성명 has 17 (1.8%) missing valuesMissing
창립년도 has 153 (16.3%) missing valuesMissing
홈페이지 has 413 (43.9%) missing valuesMissing
번호 has unique valuesUnique

Reproduction

Analysis started2023-12-12 05:52:43.315697
Analysis finished2023-12-12 05:52:45.056029
Duration1.74 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

번호
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct940
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean470.5
Minimum1
Maximum940
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size8.4 KiB
2023-12-12T14:52:45.137921image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile47.95
Q1235.75
median470.5
Q3705.25
95-th percentile893.05
Maximum940
Range939
Interquartile range (IQR)469.5

Descriptive statistics

Standard deviation271.49893
Coefficient of variation (CV)0.57704341
Kurtosis-1.2
Mean470.5
Median Absolute Deviation (MAD)235
Skewness0
Sum442270
Variance73711.667
MonotonicityStrictly increasing
2023-12-12T14:52:45.288122image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.1%
633 1
 
0.1%
621 1
 
0.1%
622 1
 
0.1%
623 1
 
0.1%
624 1
 
0.1%
625 1
 
0.1%
626 1
 
0.1%
627 1
 
0.1%
628 1
 
0.1%
Other values (930) 930
98.9%
ValueCountFrequency (%)
1 1
0.1%
2 1
0.1%
3 1
0.1%
4 1
0.1%
5 1
0.1%
6 1
0.1%
7 1
0.1%
8 1
0.1%
9 1
0.1%
10 1
0.1%
ValueCountFrequency (%)
940 1
0.1%
939 1
0.1%
938 1
0.1%
937 1
0.1%
936 1
0.1%
935 1
0.1%
934 1
0.1%
933 1
0.1%
932 1
0.1%
931 1
0.1%

공예사업체
Categorical

Distinct2
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size7.5 KiB
공예사업체
835 
공예 외
105 

Length

Max length5
Median length5
Mean length4.8882979
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row공예사업체
2nd row공예사업체
3rd row공예사업체
4th row공예사업체
5th row공예 외

Common Values

ValueCountFrequency (%)
공예사업체 835
88.8%
공예 외 105
 
11.2%

Length

2023-12-12T14:52:45.418865image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T14:52:45.504323image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
공예사업체 835
79.9%
공예 105
 
10.0%
105
 
10.0%
Distinct151
Distinct (%)16.1%
Missing0
Missing (%)0.0%
Memory size7.5 KiB
2023-12-12T14:52:45.607553image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length79
Median length2
Mean length6.3340426
Min length1

Characters and Unicode

Total characters5954
Distinct characters53
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique111 ?
Unique (%)11.8%

Sample

1st row도자 , 특수/복합소재 , 기타
2nd row섬유
3rd row도자
4th row섬유
5th row금속 , 디자인
ValueCountFrequency (%)
510
25.0%
기타 246
12.1%
도자 223
10.9%
섬유 179
 
8.8%
특수/복합소재 164
 
8.1%
111
 
5.4%
금속 80
 
3.9%
종이(한지 77
 
3.8%
포함 77
 
3.8%
가죽 73
 
3.6%
Other values (15) 297
14.6%
2023-12-12T14:52:45.927616image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1097
18.4%
, 510
 
8.6%
322
 
5.4%
246
 
4.1%
239
 
4.0%
236
 
4.0%
/ 224
 
3.8%
223
 
3.7%
198
 
3.3%
179
 
3.0%
Other values (43) 2480
41.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3969
66.7%
Space Separator 1097
 
18.4%
Other Punctuation 734
 
12.3%
Open Punctuation 77
 
1.3%
Close Punctuation 77
 
1.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
322
 
8.1%
246
 
6.2%
239
 
6.0%
236
 
5.9%
223
 
5.6%
198
 
5.0%
179
 
4.5%
164
 
4.1%
164
 
4.1%
164
 
4.1%
Other values (38) 1834
46.2%
Other Punctuation
ValueCountFrequency (%)
, 510
69.5%
/ 224
30.5%
Space Separator
ValueCountFrequency (%)
1097
100.0%
Open Punctuation
ValueCountFrequency (%)
( 77
100.0%
Close Punctuation
ValueCountFrequency (%)
) 77
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3969
66.7%
Common 1985
33.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
322
 
8.1%
246
 
6.2%
239
 
6.0%
236
 
5.9%
223
 
5.6%
198
 
5.0%
179
 
4.5%
164
 
4.1%
164
 
4.1%
164
 
4.1%
Other values (38) 1834
46.2%
Common
ValueCountFrequency (%)
1097
55.3%
, 510
25.7%
/ 224
 
11.3%
( 77
 
3.9%
) 77
 
3.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3969
66.7%
ASCII 1985
33.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1097
55.3%
, 510
25.7%
/ 224
 
11.3%
( 77
 
3.9%
) 77
 
3.9%
Hangul
ValueCountFrequency (%)
322
 
8.1%
246
 
6.2%
239
 
6.0%
236
 
5.9%
223
 
5.6%
198
 
5.0%
179
 
4.5%
164
 
4.1%
164
 
4.1%
164
 
4.1%
Other values (38) 1834
46.2%
Distinct926
Distinct (%)98.5%
Missing0
Missing (%)0.0%
Memory size7.5 KiB
2023-12-12T14:52:46.227550image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length25
Median length17
Mean length7.4297872
Min length1

Characters and Unicode

Total characters6984
Distinct characters561
Distinct categories11 ?
Distinct scripts4 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique912 ?
Unique (%)97.0%

Sample

1st row세컨드찬스
2nd row올가스티치
3rd rowHyun Ceramic
4th row제이엠비스튜디오
5th row이조
ValueCountFrequency (%)
스튜디오 12
 
1.1%
공방 11
 
1.0%
주식회사 6
 
0.5%
도예공방 5
 
0.4%
협동조합 4
 
0.4%
세라믹 4
 
0.4%
전시관 4
 
0.4%
4
 
0.4%
우드스튜디오 3
 
0.3%
도예연구소 3
 
0.3%
Other values (1027) 1060
95.0%
2023-12-12T14:52:46.698066image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
244
 
3.5%
197
 
2.8%
176
 
2.5%
168
 
2.4%
) 155
 
2.2%
( 153
 
2.2%
149
 
2.1%
148
 
2.1%
146
 
2.1%
139
 
2.0%
Other values (551) 5309
76.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 6287
90.0%
Space Separator 176
 
2.5%
Close Punctuation 156
 
2.2%
Open Punctuation 154
 
2.2%
Uppercase Letter 93
 
1.3%
Lowercase Letter 78
 
1.1%
Decimal Number 25
 
0.4%
Other Punctuation 10
 
0.1%
Connector Punctuation 2
 
< 0.1%
Math Symbol 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
244
 
3.9%
197
 
3.1%
168
 
2.7%
149
 
2.4%
148
 
2.4%
146
 
2.3%
139
 
2.2%
129
 
2.1%
125
 
2.0%
115
 
1.8%
Other values (485) 4727
75.2%
Uppercase Letter
ValueCountFrequency (%)
O 10
10.8%
D 9
9.7%
C 8
 
8.6%
N 8
 
8.6%
A 8
 
8.6%
I 7
 
7.5%
T 7
 
7.5%
E 5
 
5.4%
L 5
 
5.4%
M 4
 
4.3%
Other values (12) 22
23.7%
Lowercase Letter
ValueCountFrequency (%)
o 12
15.4%
i 8
10.3%
e 6
 
7.7%
r 6
 
7.7%
a 6
 
7.7%
t 5
 
6.4%
s 5
 
6.4%
y 4
 
5.1%
n 4
 
5.1%
f 4
 
5.1%
Other values (9) 18
23.1%
Decimal Number
ValueCountFrequency (%)
2 6
24.0%
1 5
20.0%
6 3
12.0%
8 3
12.0%
0 2
 
8.0%
5 2
 
8.0%
4 1
 
4.0%
3 1
 
4.0%
7 1
 
4.0%
9 1
 
4.0%
Other Punctuation
ValueCountFrequency (%)
& 3
30.0%
' 2
20.0%
. 2
20.0%
, 1
 
10.0%
/ 1
 
10.0%
· 1
 
10.0%
Close Punctuation
ValueCountFrequency (%)
) 155
99.4%
] 1
 
0.6%
Open Punctuation
ValueCountFrequency (%)
( 153
99.4%
[ 1
 
0.6%
Math Symbol
ValueCountFrequency (%)
< 1
50.0%
> 1
50.0%
Space Separator
ValueCountFrequency (%)
176
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 2
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 6282
89.9%
Common 526
 
7.5%
Latin 171
 
2.4%
Han 5
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
244
 
3.9%
197
 
3.1%
168
 
2.7%
149
 
2.4%
148
 
2.4%
146
 
2.3%
139
 
2.2%
129
 
2.1%
125
 
2.0%
115
 
1.8%
Other values (481) 4722
75.2%
Latin
ValueCountFrequency (%)
o 12
 
7.0%
O 10
 
5.8%
D 9
 
5.3%
C 8
 
4.7%
i 8
 
4.7%
N 8
 
4.7%
A 8
 
4.7%
I 7
 
4.1%
T 7
 
4.1%
e 6
 
3.5%
Other values (31) 88
51.5%
Common
ValueCountFrequency (%)
176
33.5%
) 155
29.5%
( 153
29.1%
2 6
 
1.1%
1 5
 
1.0%
6 3
 
0.6%
& 3
 
0.6%
8 3
 
0.6%
' 2
 
0.4%
. 2
 
0.4%
Other values (15) 18
 
3.4%
Han
ValueCountFrequency (%)
2
40.0%
1
20.0%
1
20.0%
1
20.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 6282
89.9%
ASCII 696
 
10.0%
CJK 5
 
0.1%
None 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
244
 
3.9%
197
 
3.1%
168
 
2.7%
149
 
2.4%
148
 
2.4%
146
 
2.3%
139
 
2.2%
129
 
2.1%
125
 
2.0%
115
 
1.8%
Other values (481) 4722
75.2%
ASCII
ValueCountFrequency (%)
176
25.3%
) 155
22.3%
( 153
22.0%
o 12
 
1.7%
O 10
 
1.4%
D 9
 
1.3%
C 8
 
1.1%
i 8
 
1.1%
N 8
 
1.1%
A 8
 
1.1%
Other values (55) 149
21.4%
CJK
ValueCountFrequency (%)
2
40.0%
1
20.0%
1
20.0%
1
20.0%
None
ValueCountFrequency (%)
· 1
100.0%

영문업체명
Text

MISSING 

Distinct811
Distinct (%)98.7%
Missing118
Missing (%)12.6%
Memory size7.5 KiB
2023-12-12T14:52:47.004085image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length61
Median length42
Mean length16.843066
Min length2

Characters and Unicode

Total characters13845
Distinct characters110
Distinct categories13 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique801 ?
Unique (%)97.4%

Sample

1st rowSecond Chance
2nd rowallgastitch
3rd rowHyun Ceramic
4th rowJMBstudio
5th rowYZO
ValueCountFrequency (%)
korea 72
 
3.8%
foundation 49
 
2.6%
museum 49
 
2.6%
studio 47
 
2.5%
association 46
 
2.4%
art 45
 
2.4%
cultural 39
 
2.1%
of 35
 
1.9%
ceramic 32
 
1.7%
culture 31
 
1.6%
Other values (977) 1440
76.4%
2023-12-12T14:52:47.520178image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
o 1094
 
7.9%
1063
 
7.7%
a 1040
 
7.5%
e 994
 
7.2%
n 811
 
5.9%
t 776
 
5.6%
r 773
 
5.6%
i 699
 
5.0%
u 609
 
4.4%
s 498
 
3.6%
Other values (100) 5488
39.6%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 10174
73.5%
Uppercase Letter 2392
 
17.3%
Space Separator 1063
 
7.7%
Other Punctuation 76
 
0.5%
Other Letter 37
 
0.3%
Decimal Number 36
 
0.3%
Connector Punctuation 23
 
0.2%
Dash Punctuation 19
 
0.1%
Close Punctuation 10
 
0.1%
Open Punctuation 10
 
0.1%
Other values (3) 5
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3
 
8.1%
2
 
5.4%
2
 
5.4%
2
 
5.4%
2
 
5.4%
2
 
5.4%
1
 
2.7%
1
 
2.7%
1
 
2.7%
1
 
2.7%
Other values (20) 20
54.1%
Lowercase Letter
ValueCountFrequency (%)
o 1094
10.8%
a 1040
10.2%
e 994
9.8%
n 811
 
8.0%
t 776
 
7.6%
r 773
 
7.6%
i 699
 
6.9%
u 609
 
6.0%
s 498
 
4.9%
l 450
 
4.4%
Other values (16) 2430
23.9%
Uppercase Letter
ValueCountFrequency (%)
A 254
 
10.6%
C 210
 
8.8%
O 184
 
7.7%
N 174
 
7.3%
I 138
 
5.8%
S 130
 
5.4%
E 123
 
5.1%
M 112
 
4.7%
K 110
 
4.6%
D 99
 
4.1%
Other values (16) 858
35.9%
Decimal Number
ValueCountFrequency (%)
2 8
22.2%
1 8
22.2%
0 4
11.1%
8 3
 
8.3%
9 3
 
8.3%
6 3
 
8.3%
5 2
 
5.6%
7 2
 
5.6%
4 2
 
5.6%
3 1
 
2.8%
Other Punctuation
ValueCountFrequency (%)
& 31
40.8%
. 30
39.5%
' 8
 
10.5%
, 4
 
5.3%
: 2
 
2.6%
/ 1
 
1.3%
Math Symbol
ValueCountFrequency (%)
= 1
33.3%
< 1
33.3%
> 1
33.3%
Close Punctuation
ValueCountFrequency (%)
) 8
80.0%
] 2
 
20.0%
Open Punctuation
ValueCountFrequency (%)
( 8
80.0%
[ 2
 
20.0%
Space Separator
ValueCountFrequency (%)
1063
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 23
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 19
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 12566
90.8%
Common 1242
 
9.0%
Hangul 37
 
0.3%

Most frequent character per script

Latin
ValueCountFrequency (%)
o 1094
 
8.7%
a 1040
 
8.3%
e 994
 
7.9%
n 811
 
6.5%
t 776
 
6.2%
r 773
 
6.2%
i 699
 
5.6%
u 609
 
4.8%
s 498
 
4.0%
l 450
 
3.6%
Other values (42) 4822
38.4%
Hangul
ValueCountFrequency (%)
3
 
8.1%
2
 
5.4%
2
 
5.4%
2
 
5.4%
2
 
5.4%
2
 
5.4%
1
 
2.7%
1
 
2.7%
1
 
2.7%
1
 
2.7%
Other values (20) 20
54.1%
Common
ValueCountFrequency (%)
1063
85.6%
& 31
 
2.5%
. 30
 
2.4%
_ 23
 
1.9%
- 19
 
1.5%
' 8
 
0.6%
2 8
 
0.6%
) 8
 
0.6%
( 8
 
0.6%
1 8
 
0.6%
Other values (18) 36
 
2.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII 13807
99.7%
Hangul 37
 
0.3%
Geometric Shapes 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
o 1094
 
7.9%
1063
 
7.7%
a 1040
 
7.5%
e 994
 
7.2%
n 811
 
5.9%
t 776
 
5.6%
r 773
 
5.6%
i 699
 
5.1%
u 609
 
4.4%
s 498
 
3.6%
Other values (69) 5450
39.5%
Hangul
ValueCountFrequency (%)
3
 
8.1%
2
 
5.4%
2
 
5.4%
2
 
5.4%
2
 
5.4%
2
 
5.4%
1
 
2.7%
1
 
2.7%
1
 
2.7%
1
 
2.7%
Other values (20) 20
54.1%
Geometric Shapes
ValueCountFrequency (%)
1
100.0%

사업자 구분
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size7.5 KiB
개인
556 
법인
384 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row개인
2nd row개인
3rd row개인
4th row법인
5th row개인

Common Values

ValueCountFrequency (%)
개인 556
59.1%
법인 384
40.9%

Length

2023-12-12T14:52:47.725702image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T14:52:47.860566image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
개인 556
59.1%
법인 384
40.9%

공공성
Categorical

Distinct3
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size7.5 KiB
민간
743 
공공
133 
공공/민간
 
64

Length

Max length5
Median length2
Mean length2.2042553
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row민간
2nd row민간
3rd row민간
4th row민간
5th row민간

Common Values

ValueCountFrequency (%)
민간 743
79.0%
공공 133
 
14.1%
공공/민간 64
 
6.8%

Length

2023-12-12T14:52:48.008754image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T14:52:48.133797image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
민간 743
79.0%
공공 133
 
14.1%
공공/민간 64
 
6.8%

대표자 성명
Text

MISSING 

Distinct844
Distinct (%)91.4%
Missing17
Missing (%)1.8%
Memory size7.5 KiB
2023-12-12T14:52:48.492887image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length17
Median length3
Mean length3.0942579
Min length2

Characters and Unicode

Total characters2856
Distinct characters239
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique777 ?
Unique (%)84.2%

Sample

1st row차지현
2nd row이지영
3rd row이주현
4th row최지영
5th row윤준오
ValueCountFrequency (%)
김진희 6
 
0.6%
이준일 3
 
0.3%
3
 
0.3%
이광진 3
 
0.3%
김흥수 3
 
0.3%
김지희 3
 
0.3%
조정환 3
 
0.3%
김지연 3
 
0.3%
김승희 3
 
0.3%
마리 3
 
0.3%
Other values (848) 908
96.5%
2023-12-12T14:52:48.995838image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
173
 
6.1%
137
 
4.8%
125
 
4.4%
88
 
3.1%
79
 
2.8%
74
 
2.6%
73
 
2.6%
69
 
2.4%
56
 
2.0%
50
 
1.8%
Other values (229) 1932
67.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2806
98.2%
Space Separator 18
 
0.6%
Lowercase Letter 18
 
0.6%
Other Punctuation 5
 
0.2%
Open Punctuation 3
 
0.1%
Close Punctuation 3
 
0.1%
Decimal Number 3
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
173
 
6.2%
137
 
4.9%
125
 
4.5%
88
 
3.1%
79
 
2.8%
74
 
2.6%
73
 
2.6%
69
 
2.5%
56
 
2.0%
50
 
1.8%
Other values (213) 1882
67.1%
Lowercase Letter
ValueCountFrequency (%)
e 3
16.7%
a 3
16.7%
l 2
11.1%
r 2
11.1%
h 2
11.1%
d 1
 
5.6%
k 1
 
5.6%
s 1
 
5.6%
b 1
 
5.6%
t 1
 
5.6%
Space Separator
ValueCountFrequency (%)
18
100.0%
Other Punctuation
ValueCountFrequency (%)
, 5
100.0%
Open Punctuation
ValueCountFrequency (%)
( 3
100.0%
Close Punctuation
ValueCountFrequency (%)
) 3
100.0%
Decimal Number
ValueCountFrequency (%)
1 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2806
98.2%
Common 32
 
1.1%
Latin 18
 
0.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
173
 
6.2%
137
 
4.9%
125
 
4.5%
88
 
3.1%
79
 
2.8%
74
 
2.6%
73
 
2.6%
69
 
2.5%
56
 
2.0%
50
 
1.8%
Other values (213) 1882
67.1%
Latin
ValueCountFrequency (%)
e 3
16.7%
a 3
16.7%
l 2
11.1%
r 2
11.1%
h 2
11.1%
d 1
 
5.6%
k 1
 
5.6%
s 1
 
5.6%
b 1
 
5.6%
t 1
 
5.6%
Common
ValueCountFrequency (%)
18
56.2%
, 5
 
15.6%
( 3
 
9.4%
) 3
 
9.4%
1 3
 
9.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2806
98.2%
ASCII 50
 
1.8%

Most frequent character per block

Hangul
ValueCountFrequency (%)
173
 
6.2%
137
 
4.9%
125
 
4.5%
88
 
3.1%
79
 
2.8%
74
 
2.6%
73
 
2.6%
69
 
2.5%
56
 
2.0%
50
 
1.8%
Other values (213) 1882
67.1%
ASCII
ValueCountFrequency (%)
18
36.0%
, 5
 
10.0%
e 3
 
6.0%
a 3
 
6.0%
( 3
 
6.0%
) 3
 
6.0%
1 3
 
6.0%
l 2
 
4.0%
r 2
 
4.0%
h 2
 
4.0%
Other values (6) 6
 
12.0%

창립년도
Date

MISSING 

Distinct69
Distinct (%)8.8%
Missing153
Missing (%)16.3%
Memory size7.5 KiB
Minimum1888-01-01 00:00:00
Maximum2023-01-01 00:00:00
2023-12-12T14:52:49.126289image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T14:52:49.263048image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

홈페이지
Text

MISSING 

Distinct511
Distinct (%)97.0%
Missing413
Missing (%)43.9%
Memory size7.5 KiB
2023-12-12T14:52:49.491782image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length122
Median length41
Mean length24.711575
Min length7

Characters and Unicode

Total characters13023
Distinct characters86
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique496 ?
Unique (%)94.1%

Sample

1st rowhttps://www.instagram.com/second_chance_jeju
2nd rowhttp://allgastitch.modoo.at
3rd rowyzo.co.kr
4th roweoeoart.com
5th rowwww.kostick.kr
ValueCountFrequency (%)
www.k-craft.co.kr 3
 
0.6%
http://www.handicraft.co.kr 3
 
0.6%
https://blog.naver.com/sypark50213 2
 
0.4%
http://www.gnhand.co.kr 2
 
0.4%
http://www.mmca.go.kr/main.do 2
 
0.4%
http://www.buan.go.kr/buancela/index.buan 2
 
0.4%
http://www.hanjiworld.org 2
 
0.4%
http://www.chf.or.kr 2
 
0.4%
http://www.naturaldyeing.or.kr 2
 
0.4%
www.nongbang.co.kr 2
 
0.4%
Other values (501) 508
95.8%
2023-12-12T14:52:49.873766image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
. 1158
 
8.9%
t 995
 
7.6%
/ 992
 
7.6%
o 984
 
7.6%
w 935
 
7.2%
r 755
 
5.8%
a 690
 
5.3%
m 561
 
4.3%
c 558
 
4.3%
e 493
 
3.8%
Other values (76) 4902
37.6%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 10083
77.4%
Other Punctuation 2491
 
19.1%
Decimal Number 280
 
2.2%
Connector Punctuation 63
 
0.5%
Uppercase Letter 42
 
0.3%
Dash Punctuation 29
 
0.2%
Other Letter 20
 
0.2%
Math Symbol 10
 
0.1%
Space Separator 3
 
< 0.1%
Open Punctuation 1
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
t 995
 
9.9%
o 984
 
9.8%
w 935
 
9.3%
r 755
 
7.5%
a 690
 
6.8%
m 561
 
5.6%
c 558
 
5.5%
e 493
 
4.9%
n 479
 
4.8%
h 469
 
4.7%
Other values (16) 3164
31.4%
Uppercase Letter
ValueCountFrequency (%)
O 5
11.9%
I 4
 
9.5%
W 4
 
9.5%
T 3
 
7.1%
B 3
 
7.1%
N 3
 
7.1%
D 3
 
7.1%
Z 2
 
4.8%
U 2
 
4.8%
F 2
 
4.8%
Other values (9) 11
26.2%
Other Letter
ValueCountFrequency (%)
2
 
10.0%
2
 
10.0%
2
 
10.0%
1
 
5.0%
1
 
5.0%
1
 
5.0%
1
 
5.0%
1
 
5.0%
1
 
5.0%
1
 
5.0%
Other values (7) 7
35.0%
Decimal Number
ValueCountFrequency (%)
0 65
23.2%
2 45
16.1%
1 41
14.6%
5 26
 
9.3%
9 20
 
7.1%
8 18
 
6.4%
7 18
 
6.4%
4 17
 
6.1%
6 16
 
5.7%
3 14
 
5.0%
Other Punctuation
ValueCountFrequency (%)
. 1158
46.5%
/ 992
39.8%
: 321
 
12.9%
@ 5
 
0.2%
? 5
 
0.2%
, 5
 
0.2%
& 3
 
0.1%
% 2
 
0.1%
Connector Punctuation
ValueCountFrequency (%)
_ 63
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 29
100.0%
Math Symbol
ValueCountFrequency (%)
= 10
100.0%
Space Separator
ValueCountFrequency (%)
3
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 10125
77.7%
Common 2878
 
22.1%
Hangul 20
 
0.2%

Most frequent character per script

Latin
ValueCountFrequency (%)
t 995
 
9.8%
o 984
 
9.7%
w 935
 
9.2%
r 755
 
7.5%
a 690
 
6.8%
m 561
 
5.5%
c 558
 
5.5%
e 493
 
4.9%
n 479
 
4.7%
h 469
 
4.6%
Other values (35) 3206
31.7%
Common
ValueCountFrequency (%)
. 1158
40.2%
/ 992
34.5%
: 321
 
11.2%
0 65
 
2.3%
_ 63
 
2.2%
2 45
 
1.6%
1 41
 
1.4%
- 29
 
1.0%
5 26
 
0.9%
9 20
 
0.7%
Other values (14) 118
 
4.1%
Hangul
ValueCountFrequency (%)
2
 
10.0%
2
 
10.0%
2
 
10.0%
1
 
5.0%
1
 
5.0%
1
 
5.0%
1
 
5.0%
1
 
5.0%
1
 
5.0%
1
 
5.0%
Other values (7) 7
35.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 13003
99.8%
Hangul 20
 
0.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
. 1158
 
8.9%
t 995
 
7.7%
/ 992
 
7.6%
o 984
 
7.6%
w 935
 
7.2%
r 755
 
5.8%
a 690
 
5.3%
m 561
 
4.3%
c 558
 
4.3%
e 493
 
3.8%
Other values (59) 4882
37.5%
Hangul
ValueCountFrequency (%)
2
 
10.0%
2
 
10.0%
2
 
10.0%
1
 
5.0%
1
 
5.0%
1
 
5.0%
1
 
5.0%
1
 
5.0%
1
 
5.0%
1
 
5.0%
Other values (7) 7
35.0%
Distinct870
Distinct (%)92.8%
Missing3
Missing (%)0.3%
Memory size7.5 KiB
2023-12-12T14:52:50.264471image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length61
Median length37
Mean length18.639274
Min length2

Characters and Unicode

Total characters17465
Distinct characters461
Distinct categories11 ?
Distinct scripts4 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique835 ?
Unique (%)89.1%

Sample

1st row제주
2nd row청주시 상당구 단재로 118-2
3rd row경기도 고양시 일산동구 공릉천로 187번길
4th row서울시 송파구 올림픽로35가길 11, 잠실 한신코아 오피스텔 1409호
5th row경북 상주시 내서면 능암1길 26-3
ValueCountFrequency (%)
서울특별시 134
 
3.4%
경기도 131
 
3.3%
서울시 89
 
2.2%
서울 83
 
2.1%
1층 60
 
1.5%
종로구 58
 
1.5%
2층 47
 
1.2%
중구 35
 
0.9%
강원도 35
 
0.9%
3층 27
 
0.7%
Other values (1871) 3269
82.4%
2023-12-12T14:52:50.977409image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3031
 
17.4%
1 724
 
4.1%
694
 
4.0%
596
 
3.4%
583
 
3.3%
2 483
 
2.8%
424
 
2.4%
3 394
 
2.3%
336
 
1.9%
318
 
1.8%
Other values (451) 9882
56.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 10702
61.3%
Decimal Number 3159
 
18.1%
Space Separator 3031
 
17.4%
Other Punctuation 211
 
1.2%
Dash Punctuation 189
 
1.1%
Uppercase Letter 69
 
0.4%
Close Punctuation 36
 
0.2%
Open Punctuation 36
 
0.2%
Lowercase Letter 27
 
0.2%
Math Symbol 4
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
694
 
6.5%
596
 
5.6%
583
 
5.4%
424
 
4.0%
336
 
3.1%
318
 
3.0%
288
 
2.7%
270
 
2.5%
237
 
2.2%
207
 
1.9%
Other values (397) 6749
63.1%
Uppercase Letter
ValueCountFrequency (%)
B 20
29.0%
A 6
 
8.7%
T 5
 
7.2%
S 5
 
7.2%
C 4
 
5.8%
E 4
 
5.8%
F 4
 
5.8%
K 3
 
4.3%
H 2
 
2.9%
U 2
 
2.9%
Other values (10) 14
20.3%
Lowercase Letter
ValueCountFrequency (%)
b 8
29.6%
o 3
 
11.1%
r 2
 
7.4%
n 2
 
7.4%
i 2
 
7.4%
f 2
 
7.4%
c 1
 
3.7%
g 1
 
3.7%
w 1
 
3.7%
e 1
 
3.7%
Other values (4) 4
14.8%
Decimal Number
ValueCountFrequency (%)
1 724
22.9%
2 483
15.3%
3 394
12.5%
0 302
9.6%
4 275
 
8.7%
5 246
 
7.8%
6 234
 
7.4%
7 207
 
6.6%
8 156
 
4.9%
9 138
 
4.4%
Other Punctuation
ValueCountFrequency (%)
, 194
91.9%
. 14
 
6.6%
& 2
 
0.9%
/ 1
 
0.5%
Space Separator
ValueCountFrequency (%)
3031
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 189
100.0%
Close Punctuation
ValueCountFrequency (%)
) 36
100.0%
Open Punctuation
ValueCountFrequency (%)
( 36
100.0%
Math Symbol
ValueCountFrequency (%)
~ 4
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 10701
61.3%
Common 6667
38.2%
Latin 96
 
0.5%
Han 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
694
 
6.5%
596
 
5.6%
583
 
5.4%
424
 
4.0%
336
 
3.1%
318
 
3.0%
288
 
2.7%
270
 
2.5%
237
 
2.2%
207
 
1.9%
Other values (396) 6748
63.1%
Latin
ValueCountFrequency (%)
B 20
20.8%
b 8
 
8.3%
A 6
 
6.2%
T 5
 
5.2%
S 5
 
5.2%
C 4
 
4.2%
E 4
 
4.2%
F 4
 
4.2%
o 3
 
3.1%
K 3
 
3.1%
Other values (24) 34
35.4%
Common
ValueCountFrequency (%)
3031
45.5%
1 724
 
10.9%
2 483
 
7.2%
3 394
 
5.9%
0 302
 
4.5%
4 275
 
4.1%
5 246
 
3.7%
6 234
 
3.5%
7 207
 
3.1%
, 194
 
2.9%
Other values (10) 577
 
8.7%
Han
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 10701
61.3%
ASCII 6763
38.7%
CJK 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3031
44.8%
1 724
 
10.7%
2 483
 
7.1%
3 394
 
5.8%
0 302
 
4.5%
4 275
 
4.1%
5 246
 
3.6%
6 234
 
3.5%
7 207
 
3.1%
, 194
 
2.9%
Other values (44) 673
 
10.0%
Hangul
ValueCountFrequency (%)
694
 
6.5%
596
 
5.6%
583
 
5.4%
424
 
4.0%
336
 
3.1%
318
 
3.0%
288
 
2.7%
270
 
2.5%
237
 
2.2%
207
 
1.9%
Other values (396) 6748
63.1%
CJK
ValueCountFrequency (%)
1
100.0%

사용여부
Categorical

IMBALANCE 

Distinct2
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size7.5 KiB
사용
932 
미사용
 
8

Length

Max length3
Median length2
Mean length2.0085106
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row사용
2nd row사용
3rd row사용
4th row사용
5th row사용

Common Values

ValueCountFrequency (%)
사용 932
99.1%
미사용 8
 
0.9%

Length

2023-12-12T14:52:51.126541image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T14:52:51.209538image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
사용 932
99.1%
미사용 8
 
0.9%

Interactions

2023-12-12T14:52:44.456856image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T14:52:51.265924image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호공예사업체사업자 구분공공성창립년도사용여부
번호1.0000.4290.8830.5890.6690.169
공예사업체0.4291.0000.3000.2840.1870.000
사업자 구분0.8830.3001.0000.2730.6290.108
공공성0.5890.2840.2731.0000.4480.000
창립년도0.6690.1870.6290.4481.0000.000
사용여부0.1690.0000.1080.0000.0001.000
2023-12-12T14:52:51.360083image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
사용여부공공성공예사업체사업자 구분
사용여부1.0000.0000.0000.069
공공성0.0001.0000.4590.443
공예사업체0.0000.4591.0000.194
사업자 구분0.0690.4430.1941.000
2023-12-12T14:52:51.446870image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호공예사업체사업자 구분공공성사용여부
번호1.0000.3280.7150.4310.129
공예사업체0.3281.0000.1940.4590.000
사업자 구분0.7150.1941.0000.4430.069
공공성0.4310.4590.4431.0000.000
사용여부0.1290.0000.0690.0001.000

Missing values

2023-12-12T14:52:44.623725image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T14:52:44.835218image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-12T14:52:44.973505image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

번호공예사업체공예분류업체명영문업체명사업자 구분공공성대표자 성명창립년도홈페이지본사 소재지사용여부
01공예사업체도자 , 특수/복합소재 , 기타세컨드찬스Second Chance개인민간차지현2020-01-01https://www.instagram.com/second_chance_jeju제주사용
12공예사업체섬유올가스티치allgastitch개인민간이지영<NA>http://allgastitch.modoo.at청주시 상당구 단재로 118-2사용
23공예사업체도자Hyun CeramicHyun Ceramic개인민간이주현2023-01-01<NA>경기도 고양시 일산동구 공릉천로 187번길사용
34공예사업체섬유제이엠비스튜디오JMBstudio법인민간최지영2020-01-01<NA>서울시 송파구 올림픽로35가길 11, 잠실 한신코아 오피스텔 1409호사용
45공예 외금속 , 디자인이조YZO개인민간윤준오2023-01-01yzo.co.kr경북 상주시 내서면 능암1길 26-3사용
56공예사업체기타이오이오(eoeo)eoeoart개인민간이주학2020-01-01eoeoart.com인천 부평구 장제로329번길46(b동2층)사용
67공예사업체금속주식회사 코스틱KOSTIC CO., LTD.법인민간이병식2015-01-01www.kostick.kr서울시 성동구 성수동 656-593 1층사용
78공예사업체도자김민섭도자기KIMMINSEOP CERAMICS개인민간김민섭2019-01-01https://www.wrkshpcrypt.com서울 서대문구 북아현로 100사용
89공예사업체유리미쟈르미쟈mizarmiza개인민간서미경2019-01-01mizarmiza.com서울특별시 서대문구 연희동사용
910공예사업체목 , 특수/복합소재 , 기타글꽃공방geulkkot개인민간김세란2010-01-01<NA>충남사용
번호공예사업체공예분류업체명영문업체명사업자 구분공공성대표자 성명창립년도홈페이지본사 소재지사용여부
930931공예사업체기타(사)의석공예문화협회<NA>법인민간유희자2019-07-16<NA><NA>사용
931932공예사업체섬유(사)서울퀼트협회<NA>법인민간윤혜경2019-07-16<NA><NA>사용
932933공예사업체특수/복합소재(사)서울무형문화재기능보존회Seoul Intangible Cultural Heritage Center법인민간정명채2019-07-16<NA>서울특별시 종로구 북촌로 20-13, 북촌 교육전시장사용
933934공예 외기타(사)민족미술인협회The Korea People's Artists Association법인민간이종헌1969-01-01<NA>서울특별시 마포구 망원로8길 74사용
934935공예사업체(사)대한장애인공예협회<NA>법인민간고민숙2019-07-16<NA>서울특별시 노원구 한글비석로 396, 벽산상가 108동 지하30호사용
935936공예사업체기타(사)대한산업미술가협회<NA>법인민간이상태1945-01-01<NA>서울특별시 도봉구 해동로 17길 115사용
936937공예사업체특수/복합소재(사)대한민국명인회Korea Masters Association법인공공윤상호2004-11-01<NA>서울특별시 강남구 개포로 232, 희영빌딩 3층사용
937938공예사업체특수/복합소재(사)근대황실공예문화협회Modern Imperial Craft and Culture Association법인공공이칠용2019-01-01<NA>서울특별시 중구 삼일대로 326번지사용
938939공예사업체특수/복합소재(사)국가무형문화재기능협회National Intangible Cultural Heritage Association법인민간박종군1973-01-01<NA>서울특별시 강남구 봉은사로 406사용
939940공예사업체도자(사)고려닥종이공예협회Korean Paper art association법인공공전흥자2005-01-01<NA>서울특별시 광진구 아차산로65길 6사용