Overview

Dataset statistics

Number of variables9
Number of observations4988
Missing cells1681
Missing cells (%)3.7%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory350.8 KiB
Average record size in memory72.0 B

Variable types

Text6
Categorical2
DateTime1

Dataset

Description파주시 제조업 등록현황 데이터로서 회사명, 도로명주소, 지번주소, 업종명(오프셋 인쇄업, 목재가구 제조업, 콘크리트 제품제조업, 위생용 종이제품 제조업), 최초등록일 등의 정보를 제공합니다.
Author경기도 파주시
URLhttps://www.data.go.kr/data/15020818/fileData.do

Alerts

데이터기준일자 has constant value ""Constant
관리기관명 is highly overall correlated with 관리기관전화번호High correlation
관리기관전화번호 is highly overall correlated with 관리기관명High correlation
관리기관명 is highly imbalanced (86.6%)Imbalance
소재지도로명주소 has 609 (12.2%) missing valuesMissing
전화번호 has 1072 (21.5%) missing valuesMissing

Reproduction

Analysis started2023-12-12 14:57:30.504904
Analysis finished2023-12-12 14:57:32.407099
Duration1.9 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct4699
Distinct (%)94.2%
Missing0
Missing (%)0.0%
Memory size39.1 KiB
2023-12-12T23:57:32.612500image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length50
Median length22
Mean length6.9151965
Min length1

Characters and Unicode

Total characters34493
Distinct characters711
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique4441 ?
Unique (%)89.0%

Sample

1st row(사)내일을여는멋진여성 중증장애인사업단
2nd row(사)대한문화체육교육협회 가구사업부
3rd row(사)부산지체장애인단체협의회 제1공장
4th row(사)장애인고용진흥회 파주사업소
5th row(사)한국검정교과서
ValueCountFrequency (%)
주식회사 112
 
2.1%
농업회사법인 21
 
0.4%
제2공장 12
 
0.2%
도서출판 12
 
0.2%
주)코아스 8
 
0.2%
파주지점 7
 
0.1%
영진산업 4
 
0.1%
주)스미스테크 4
 
0.1%
korea 4
 
0.1%
파주공장 4
 
0.1%
Other values (4741) 5071
96.4%
2023-12-12T23:57:33.125023image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3075
 
8.9%
( 2928
 
8.5%
) 2928
 
8.5%
1036
 
3.0%
941
 
2.7%
584
 
1.7%
504
 
1.5%
499
 
1.4%
495
 
1.4%
457
 
1.3%
Other values (701) 21046
61.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 27377
79.4%
Open Punctuation 2930
 
8.5%
Close Punctuation 2930
 
8.5%
Space Separator 499
 
1.4%
Uppercase Letter 476
 
1.4%
Decimal Number 86
 
0.2%
Lowercase Letter 65
 
0.2%
Other Punctuation 63
 
0.2%
Other Symbol 61
 
0.2%
Dash Punctuation 5
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3075
 
11.2%
1036
 
3.8%
941
 
3.4%
584
 
2.1%
504
 
1.8%
495
 
1.8%
457
 
1.7%
441
 
1.6%
410
 
1.5%
396
 
1.4%
Other values (636) 19038
69.5%
Uppercase Letter
ValueCountFrequency (%)
S 61
 
12.8%
C 46
 
9.7%
E 39
 
8.2%
N 32
 
6.7%
P 32
 
6.7%
T 26
 
5.5%
G 26
 
5.5%
H 20
 
4.2%
D 20
 
4.2%
K 19
 
4.0%
Other values (14) 155
32.6%
Lowercase Letter
ValueCountFrequency (%)
e 9
13.8%
o 8
12.3%
a 8
12.3%
r 7
10.8%
i 5
 
7.7%
t 3
 
4.6%
n 3
 
4.6%
u 3
 
4.6%
l 3
 
4.6%
c 2
 
3.1%
Other values (10) 14
21.5%
Decimal Number
ValueCountFrequency (%)
2 41
47.7%
1 21
24.4%
3 9
 
10.5%
0 5
 
5.8%
5 4
 
4.7%
4 3
 
3.5%
7 2
 
2.3%
6 1
 
1.2%
Other Punctuation
ValueCountFrequency (%)
. 31
49.2%
& 27
42.9%
/ 3
 
4.8%
, 1
 
1.6%
· 1
 
1.6%
Open Punctuation
ValueCountFrequency (%)
( 2928
99.9%
[ 2
 
0.1%
Close Punctuation
ValueCountFrequency (%)
) 2928
99.9%
] 2
 
0.1%
Space Separator
ValueCountFrequency (%)
499
100.0%
Other Symbol
ValueCountFrequency (%)
61
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 5
100.0%
Control
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 27438
79.5%
Common 6514
 
18.9%
Latin 541
 
1.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3075
 
11.2%
1036
 
3.8%
941
 
3.4%
584
 
2.1%
504
 
1.8%
495
 
1.8%
457
 
1.7%
441
 
1.6%
410
 
1.5%
396
 
1.4%
Other values (637) 19099
69.6%
Latin
ValueCountFrequency (%)
S 61
 
11.3%
C 46
 
8.5%
E 39
 
7.2%
N 32
 
5.9%
P 32
 
5.9%
T 26
 
4.8%
G 26
 
4.8%
H 20
 
3.7%
D 20
 
3.7%
K 19
 
3.5%
Other values (34) 220
40.7%
Common
ValueCountFrequency (%)
( 2928
44.9%
) 2928
44.9%
499
 
7.7%
2 41
 
0.6%
. 31
 
0.5%
& 27
 
0.4%
1 21
 
0.3%
3 9
 
0.1%
- 5
 
0.1%
0 5
 
0.1%
Other values (10) 20
 
0.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 27377
79.4%
ASCII 7054
 
20.5%
None 62
 
0.2%

Most frequent character per block

Hangul
ValueCountFrequency (%)
3075
 
11.2%
1036
 
3.8%
941
 
3.4%
584
 
2.1%
504
 
1.8%
495
 
1.8%
457
 
1.7%
441
 
1.6%
410
 
1.5%
396
 
1.4%
Other values (636) 19038
69.5%
ASCII
ValueCountFrequency (%)
( 2928
41.5%
) 2928
41.5%
499
 
7.1%
S 61
 
0.9%
C 46
 
0.7%
2 41
 
0.6%
E 39
 
0.6%
N 32
 
0.5%
P 32
 
0.5%
. 31
 
0.4%
Other values (53) 417
 
5.9%
None
ValueCountFrequency (%)
61
98.4%
· 1
 
1.6%
Distinct4140
Distinct (%)94.5%
Missing609
Missing (%)12.2%
Memory size39.1 KiB
2023-12-12T23:57:33.448305image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length55
Median length48
Mean length26.216716
Min length14

Characters and Unicode

Total characters114803
Distinct characters511
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3932 ?
Unique (%)89.8%

Sample

1st row경기도 파주시 지목로75번길 14
2nd row경기도 파주시 광탄면 부흥로359번길 190
3rd row경기도 파주시 교하로505번길 16-27
4th row경기도 파주시 산남로157번길 30 (산남동)
5th row경기도 파주시 조리읍 당재봉로 29-28 (한국검정교과서)
ValueCountFrequency (%)
경기도 4380
 
17.1%
파주시 4380
 
17.1%
797
 
3.1%
광탄면 747
 
2.9%
조리읍 597
 
2.3%
탄현면 568
 
2.2%
1필지 429
 
1.7%
월롱면 423
 
1.7%
파주읍 338
 
1.3%
법원읍 209
 
0.8%
Other values (3705) 12766
49.8%
2023-12-12T23:57:33.858644image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
21402
 
18.6%
5067
 
4.4%
4875
 
4.2%
4506
 
3.9%
4456
 
3.9%
4422
 
3.9%
4403
 
3.8%
1 4246
 
3.7%
2876
 
2.5%
2 2803
 
2.4%
Other values (501) 55747
48.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 65690
57.2%
Space Separator 21402
 
18.6%
Decimal Number 19716
 
17.2%
Open Punctuation 2339
 
2.0%
Close Punctuation 2338
 
2.0%
Dash Punctuation 1943
 
1.7%
Other Punctuation 1053
 
0.9%
Uppercase Letter 272
 
0.2%
Lowercase Letter 42
 
< 0.1%
Math Symbol 7
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
5067
 
7.7%
4875
 
7.4%
4506
 
6.9%
4456
 
6.8%
4422
 
6.7%
4403
 
6.7%
2876
 
4.4%
2584
 
3.9%
2068
 
3.1%
1959
 
3.0%
Other values (443) 28474
43.3%
Uppercase Letter
ValueCountFrequency (%)
A 43
15.8%
B 29
10.7%
C 28
10.3%
E 22
 
8.1%
T 19
 
7.0%
S 17
 
6.2%
P 15
 
5.5%
M 15
 
5.5%
L 12
 
4.4%
G 10
 
3.7%
Other values (12) 62
22.8%
Lowercase Letter
ValueCountFrequency (%)
c 8
19.0%
s 6
14.3%
e 6
14.3%
h 6
14.3%
t 2
 
4.8%
i 2
 
4.8%
u 2
 
4.8%
k 2
 
4.8%
r 2
 
4.8%
a 2
 
4.8%
Other values (2) 4
9.5%
Decimal Number
ValueCountFrequency (%)
1 4246
21.5%
2 2803
14.2%
3 2405
12.2%
4 1915
9.7%
5 1642
 
8.3%
6 1551
 
7.9%
0 1398
 
7.1%
7 1357
 
6.9%
8 1255
 
6.4%
9 1144
 
5.8%
Other Punctuation
ValueCountFrequency (%)
, 1012
96.1%
. 26
 
2.5%
& 12
 
1.1%
· 1
 
0.1%
* 1
 
0.1%
/ 1
 
0.1%
Open Punctuation
ValueCountFrequency (%)
( 2338
> 99.9%
[ 1
 
< 0.1%
Close Punctuation
ValueCountFrequency (%)
) 2337
> 99.9%
] 1
 
< 0.1%
Space Separator
ValueCountFrequency (%)
21402
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1943
100.0%
Math Symbol
ValueCountFrequency (%)
~ 7
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 65690
57.2%
Common 48799
42.5%
Latin 314
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
5067
 
7.7%
4875
 
7.4%
4506
 
6.9%
4456
 
6.8%
4422
 
6.7%
4403
 
6.7%
2876
 
4.4%
2584
 
3.9%
2068
 
3.1%
1959
 
3.0%
Other values (443) 28474
43.3%
Latin
ValueCountFrequency (%)
A 43
13.7%
B 29
 
9.2%
C 28
 
8.9%
E 22
 
7.0%
T 19
 
6.1%
S 17
 
5.4%
P 15
 
4.8%
M 15
 
4.8%
L 12
 
3.8%
G 10
 
3.2%
Other values (24) 104
33.1%
Common
ValueCountFrequency (%)
21402
43.9%
1 4246
 
8.7%
2 2803
 
5.7%
3 2405
 
4.9%
( 2338
 
4.8%
) 2337
 
4.8%
- 1943
 
4.0%
4 1915
 
3.9%
5 1642
 
3.4%
6 1551
 
3.2%
Other values (14) 6217
 
12.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 65690
57.2%
ASCII 49112
42.8%
None 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
21402
43.6%
1 4246
 
8.6%
2 2803
 
5.7%
3 2405
 
4.9%
( 2338
 
4.8%
) 2337
 
4.8%
- 1943
 
4.0%
4 1915
 
3.9%
5 1642
 
3.3%
6 1551
 
3.2%
Other values (47) 6530
 
13.3%
Hangul
ValueCountFrequency (%)
5067
 
7.7%
4875
 
7.4%
4506
 
6.9%
4456
 
6.8%
4422
 
6.7%
4403
 
6.7%
2876
 
4.4%
2584
 
3.9%
2068
 
3.1%
1959
 
3.0%
Other values (443) 28474
43.3%
None
ValueCountFrequency (%)
· 1
100.0%
Distinct4707
Distinct (%)94.4%
Missing0
Missing (%)0.0%
Memory size39.1 KiB
2023-12-12T23:57:34.163328image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length49
Median length42
Mean length22.145549
Min length11

Characters and Unicode

Total characters110462
Distinct characters331
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique4463 ?
Unique (%)89.5%

Sample

1st row경기도 파주시 신촌동 63-32
2nd row경기도 파주시 광탄면 발랑리 404-1
3rd row경기도 파주시 동패동 591-6 번지 (가동)
4th row경기도 파주시 산남동 56-1
5th row경기도 파주시 조리읍 오산리 398-5번지
ValueCountFrequency (%)
파주시 4989
20.2%
경기도 4988
20.2%
광탄면 803
 
3.3%
탄현면 658
 
2.7%
조리읍 640
 
2.6%
월롱면 459
 
1.9%
파주읍 362
 
1.5%
분수리 248
 
1.0%
신촌동 248
 
1.0%
법원읍 239
 
1.0%
Other values (4498) 11040
44.7%
2023-12-12T23:57:34.625624image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
19921
18.0%
5487
 
5.0%
5440
 
4.9%
5131
 
4.6%
5001
 
4.5%
5000
 
4.5%
4990
 
4.5%
4221
 
3.8%
- 4170
 
3.8%
4064
 
3.7%
Other values (321) 47037
42.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 65562
59.4%
Decimal Number 20207
 
18.3%
Space Separator 19921
 
18.0%
Dash Punctuation 4170
 
3.8%
Open Punctuation 154
 
0.1%
Close Punctuation 153
 
0.1%
Other Punctuation 149
 
0.1%
Uppercase Letter 118
 
0.1%
Lowercase Letter 20
 
< 0.1%
Math Symbol 7
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
5487
 
8.4%
5440
 
8.3%
5131
 
7.8%
5001
 
7.6%
5000
 
7.6%
4990
 
7.6%
4221
 
6.4%
4064
 
6.2%
3724
 
5.7%
2367
 
3.6%
Other values (272) 20137
30.7%
Uppercase Letter
ValueCountFrequency (%)
A 29
24.6%
B 24
20.3%
C 16
13.6%
E 6
 
5.1%
S 6
 
5.1%
P 6
 
5.1%
D 5
 
4.2%
L 5
 
4.2%
F 4
 
3.4%
T 3
 
2.5%
Other values (9) 14
11.9%
Decimal Number
ValueCountFrequency (%)
1 3835
19.0%
2 2844
14.1%
3 2453
12.1%
4 2156
10.7%
5 1897
9.4%
7 1573
7.8%
8 1454
 
7.2%
6 1440
 
7.1%
9 1293
 
6.4%
0 1262
 
6.2%
Lowercase Letter
ValueCountFrequency (%)
s 6
30.0%
c 5
25.0%
e 2
 
10.0%
u 2
 
10.0%
k 1
 
5.0%
r 1
 
5.0%
a 1
 
5.0%
h 1
 
5.0%
n 1
 
5.0%
Other Punctuation
ValueCountFrequency (%)
, 132
88.6%
. 12
 
8.1%
& 3
 
2.0%
· 1
 
0.7%
/ 1
 
0.7%
Space Separator
ValueCountFrequency (%)
19921
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 4170
100.0%
Open Punctuation
ValueCountFrequency (%)
( 154
100.0%
Close Punctuation
ValueCountFrequency (%)
) 153
100.0%
Math Symbol
ValueCountFrequency (%)
~ 7
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 65562
59.4%
Common 44762
40.5%
Latin 138
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
5487
 
8.4%
5440
 
8.3%
5131
 
7.8%
5001
 
7.6%
5000
 
7.6%
4990
 
7.6%
4221
 
6.4%
4064
 
6.2%
3724
 
5.7%
2367
 
3.6%
Other values (272) 20137
30.7%
Latin
ValueCountFrequency (%)
A 29
21.0%
B 24
17.4%
C 16
11.6%
s 6
 
4.3%
E 6
 
4.3%
S 6
 
4.3%
P 6
 
4.3%
c 5
 
3.6%
D 5
 
3.6%
L 5
 
3.6%
Other values (18) 30
21.7%
Common
ValueCountFrequency (%)
19921
44.5%
- 4170
 
9.3%
1 3835
 
8.6%
2 2844
 
6.4%
3 2453
 
5.5%
4 2156
 
4.8%
5 1897
 
4.2%
7 1573
 
3.5%
8 1454
 
3.2%
6 1440
 
3.2%
Other values (11) 3019
 
6.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 65562
59.4%
ASCII 44899
40.6%
None 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
19921
44.4%
- 4170
 
9.3%
1 3835
 
8.5%
2 2844
 
6.3%
3 2453
 
5.5%
4 2156
 
4.8%
5 1897
 
4.2%
7 1573
 
3.5%
8 1454
 
3.2%
6 1440
 
3.2%
Other values (38) 3156
 
7.0%
Hangul
ValueCountFrequency (%)
5487
 
8.4%
5440
 
8.3%
5131
 
7.8%
5001
 
7.6%
5000
 
7.6%
4990
 
7.6%
4221
 
6.4%
4064
 
6.2%
3724
 
5.7%
2367
 
3.6%
Other values (272) 20137
30.7%
None
ValueCountFrequency (%)
· 1
100.0%
Distinct1047
Distinct (%)21.0%
Missing0
Missing (%)0.0%
Memory size39.1 KiB
2023-12-12T23:57:34.891728image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length33
Median length27
Mean length16.27085
Min length3

Characters and Unicode

Total characters81159
Distinct characters349
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique500 ?
Unique (%)10.0%

Sample

1st row오프셋 인쇄업
2nd row기타 목재가구 제조업
3rd row콘크리트 관 및 기타 구조용 콘크리트 제품 제조업 외 2 종
4th row위생용 종이제품 제조업
5th row그 외 기타 종이 및 판지 제품 제조업
ValueCountFrequency (%)
제조업 4112
 
15.7%
2441
 
9.3%
1974
 
7.6%
1897
 
7.3%
기타 1796
 
6.9%
1 1057
 
4.0%
544
 
2.1%
금속 517
 
2.0%
플라스틱 376
 
1.4%
2 366
 
1.4%
Other values (679) 11048
42.3%
2023-12-12T23:57:35.296998image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
21146
26.1%
5846
 
7.2%
5107
 
6.3%
4835
 
6.0%
3122
 
3.8%
2466
 
3.0%
2109
 
2.6%
1977
 
2.4%
1821
 
2.2%
1576
 
1.9%
Other values (339) 31154
38.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 57336
70.6%
Space Separator 21146
 
26.1%
Decimal Number 1970
 
2.4%
Math Symbol 665
 
0.8%
Close Punctuation 21
 
< 0.1%
Open Punctuation 21
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
5846
 
10.2%
5107
 
8.9%
4835
 
8.4%
3122
 
5.4%
2466
 
4.3%
2109
 
3.7%
1977
 
3.4%
1821
 
3.2%
1576
 
2.7%
1359
 
2.4%
Other values (325) 27118
47.3%
Decimal Number
ValueCountFrequency (%)
1 1128
57.3%
2 382
 
19.4%
3 179
 
9.1%
4 96
 
4.9%
5 62
 
3.1%
6 51
 
2.6%
7 33
 
1.7%
8 18
 
0.9%
9 13
 
0.7%
0 8
 
0.4%
Space Separator
ValueCountFrequency (%)
21146
100.0%
Math Symbol
ValueCountFrequency (%)
+ 665
100.0%
Close Punctuation
ValueCountFrequency (%)
) 21
100.0%
Open Punctuation
ValueCountFrequency (%)
( 21
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 57336
70.6%
Common 23823
29.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
5846
 
10.2%
5107
 
8.9%
4835
 
8.4%
3122
 
5.4%
2466
 
4.3%
2109
 
3.7%
1977
 
3.4%
1821
 
3.2%
1576
 
2.7%
1359
 
2.4%
Other values (325) 27118
47.3%
Common
ValueCountFrequency (%)
21146
88.8%
1 1128
 
4.7%
+ 665
 
2.8%
2 382
 
1.6%
3 179
 
0.8%
4 96
 
0.4%
5 62
 
0.3%
6 51
 
0.2%
7 33
 
0.1%
) 21
 
0.1%
Other values (4) 60
 
0.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 57302
70.6%
ASCII 23823
29.4%
Compat Jamo 34
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
21146
88.8%
1 1128
 
4.7%
+ 665
 
2.8%
2 382
 
1.6%
3 179
 
0.8%
4 96
 
0.4%
5 62
 
0.3%
6 51
 
0.2%
7 33
 
0.1%
) 21
 
0.1%
Other values (4) 60
 
0.3%
Hangul
ValueCountFrequency (%)
5846
 
10.2%
5107
 
8.9%
4835
 
8.4%
3122
 
5.4%
2466
 
4.3%
2109
 
3.7%
1977
 
3.5%
1821
 
3.2%
1576
 
2.8%
1359
 
2.4%
Other values (324) 27084
47.3%
Compat Jamo
ValueCountFrequency (%)
34
100.0%
Distinct3213
Distinct (%)64.4%
Missing0
Missing (%)0.0%
Memory size39.1 KiB
2023-12-12T23:57:35.676781image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length10
Mean length9.9951885
Min length2

Characters and Unicode

Total characters49856
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2024 ?
Unique (%)40.6%

Sample

1st row2019-01-28
2nd row2020-09-14
3rd row2011-01-07
4th row2023-10-05
5th row2011-08-01
ValueCountFrequency (%)
2018-07-19 6
 
0.1%
2017-04-18 6
 
0.1%
2023-07-17 6
 
0.1%
2023-06-21 6
 
0.1%
2023-08-22 6
 
0.1%
2020-07-21 6
 
0.1%
2023-03-09 6
 
0.1%
2018-09-17 5
 
0.1%
2021-07-12 5
 
0.1%
2020-10-22 5
 
0.1%
Other values (3203) 4931
98.9%
2023-12-12T23:57:36.204289image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 12363
24.8%
- 9976
20.0%
2 9346
18.7%
1 7449
14.9%
9 2200
 
4.4%
3 1750
 
3.5%
8 1490
 
3.0%
7 1461
 
2.9%
4 1326
 
2.7%
6 1290
 
2.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 39880
80.0%
Dash Punctuation 9976
 
20.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 12363
31.0%
2 9346
23.4%
1 7449
18.7%
9 2200
 
5.5%
3 1750
 
4.4%
8 1490
 
3.7%
7 1461
 
3.7%
4 1326
 
3.3%
6 1290
 
3.2%
5 1205
 
3.0%
Dash Punctuation
ValueCountFrequency (%)
- 9976
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 49856
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 12363
24.8%
- 9976
20.0%
2 9346
18.7%
1 7449
14.9%
9 2200
 
4.4%
3 1750
 
3.5%
8 1490
 
3.0%
7 1461
 
2.9%
4 1326
 
2.7%
6 1290
 
2.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 49856
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 12363
24.8%
- 9976
20.0%
2 9346
18.7%
1 7449
14.9%
9 2200
 
4.4%
3 1750
 
3.5%
8 1490
 
3.0%
7 1461
 
2.9%
4 1326
 
2.7%
6 1290
 
2.6%

전화번호
Text

MISSING 

Distinct3586
Distinct (%)91.6%
Missing1072
Missing (%)21.5%
Memory size39.1 KiB
2023-12-12T23:57:36.489202image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.006129
Min length9

Characters and Unicode

Total characters47016
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3304 ?
Unique (%)84.4%

Sample

1st row031-949-0976
2nd row031-947-6122
3rd row031-947-6674
4th row031-945-7678
5th row02-2657-3514
ValueCountFrequency (%)
031-945-4055 6
 
0.2%
031-944-2360 5
 
0.1%
031-946-9700 4
 
0.1%
031-953-0736 4
 
0.1%
031-944-5250 4
 
0.1%
031-8071-0414 4
 
0.1%
031-922-3367 4
 
0.1%
031-948-0077 3
 
0.1%
031-934-0266 3
 
0.1%
031-946-6221 3
 
0.1%
Other values (3576) 3876
99.0%
2023-12-12T23:57:36.943399image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 7810
16.6%
0 6627
14.1%
1 5747
12.2%
3 5646
12.0%
9 4795
10.2%
4 4209
9.0%
5 2903
 
6.2%
2 2653
 
5.6%
7 2475
 
5.3%
8 2154
 
4.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 39206
83.4%
Dash Punctuation 7810
 
16.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 6627
16.9%
1 5747
14.7%
3 5646
14.4%
9 4795
12.2%
4 4209
10.7%
5 2903
7.4%
2 2653
6.8%
7 2475
 
6.3%
8 2154
 
5.5%
6 1997
 
5.1%
Dash Punctuation
ValueCountFrequency (%)
- 7810
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 47016
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 7810
16.6%
0 6627
14.1%
1 5747
12.2%
3 5646
12.0%
9 4795
10.2%
4 4209
9.0%
5 2903
 
6.2%
2 2653
 
5.6%
7 2475
 
5.3%
8 2154
 
4.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 47016
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 7810
16.6%
0 6627
14.1%
1 5747
12.2%
3 5646
12.0%
9 4795
10.2%
4 4209
9.0%
5 2903
 
6.2%
2 2653
 
5.6%
7 2475
 
5.3%
8 2154
 
4.6%

관리기관명
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct6
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size39.1 KiB
경기도 파주시청
4747 
(사)파주출판문화정보국가산업단지 입주기업체협의회
 
159
한국산업단지공단 서울지역본부 파주양주지사
 
43
(사)교하문발지방산업단지협의회
 
31
경기주택도시공사
 
7

Length

Max length26
Median length8
Mean length8.7431836
Min length3

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row경기도 파주시청
2nd row경기도 파주시청
3rd row경기도 파주시청
4th row경기도 파주시청
5th row경기도 파주시청

Common Values

ValueCountFrequency (%)
경기도 파주시청 4747
95.2%
(사)파주출판문화정보국가산업단지 입주기업체협의회 159
 
3.2%
한국산업단지공단 서울지역본부 파주양주지사 43
 
0.9%
(사)교하문발지방산업단지협의회 31
 
0.6%
경기주택도시공사 7
 
0.1%
경기도 1
 
< 0.1%

Length

2023-12-12T23:57:37.409537image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T23:57:37.546985image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
경기도 4748
47.6%
파주시청 4747
47.6%
사)파주출판문화정보국가산업단지 159
 
1.6%
입주기업체협의회 159
 
1.6%
한국산업단지공단 43
 
0.4%
서울지역본부 43
 
0.4%
파주양주지사 43
 
0.4%
사)교하문발지방산업단지협의회 31
 
0.3%
경기주택도시공사 7
 
0.1%

관리기관전화번호
Categorical

HIGH CORRELATION 

Distinct23
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size39.1 KiB
031-940-4541
1639 
031-940-4543
1058 
031-940-4544
915 
031-940-4542
819 
031-940-5336
173 
Other values (18)
384 

Length

Max length13
Median length12
Mean length12.003408
Min length4

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row031-940-4541
2nd row031-940-4543
3rd row031-940-4542
4th row031-940-4542
5th row031-940-4541

Common Values

ValueCountFrequency (%)
031-940-4541 1639
32.9%
031-940-4543 1058
21.2%
031-940-4544 915
18.3%
031-940-4542 819
16.4%
031-940-5336 173
 
3.5%
031-940-4532 68
 
1.4%
070-8895-7967 53
 
1.1%
031-955-0039 53
 
1.1%
031-940-5335 41
 
0.8%
031-940-4545 29
 
0.6%
Other values (13) 140
 
2.8%

Length

2023-12-12T23:57:37.706433image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
031-940-4541 1639
32.9%
031-940-4543 1058
21.2%
031-940-4544 915
18.3%
031-940-4542 819
16.4%
031-940-5336 173
 
3.5%
031-940-4532 68
 
1.4%
070-8895-7967 53
 
1.1%
031-955-0039 53
 
1.1%
031-940-5335 41
 
0.8%
031-940-4545 29
 
0.6%
Other values (13) 140
 
2.8%

데이터기준일자
Date

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size39.1 KiB
Minimum2023-11-20 00:00:00
Maximum2023-11-20 00:00:00
2023-12-12T23:57:37.821090image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T23:57:37.935286image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Correlations

2023-12-12T23:57:38.016948image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
관리기관명관리기관전화번호
관리기관명1.0000.969
관리기관전화번호0.9691.000
2023-12-12T23:57:38.114486image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
관리기관명관리기관전화번호
관리기관명1.0000.873
관리기관전화번호0.8731.000
2023-12-12T23:57:38.210912image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
관리기관명관리기관전화번호
관리기관명1.0000.873
관리기관전화번호0.8731.000

Missing values

2023-12-12T23:57:32.048949image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T23:57:32.207228image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-12T23:57:32.340456image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

상호명소재지도로명주소소재지지번주소업종명최초등록일전화번호관리기관명관리기관전화번호데이터기준일자
0(사)내일을여는멋진여성 중증장애인사업단경기도 파주시 지목로75번길 14경기도 파주시 신촌동 63-32오프셋 인쇄업2019-01-28031-949-0976경기도 파주시청031-940-45412023-11-20
1(사)대한문화체육교육협회 가구사업부경기도 파주시 광탄면 부흥로359번길 190경기도 파주시 광탄면 발랑리 404-1기타 목재가구 제조업2020-09-14031-947-6122경기도 파주시청031-940-45432023-11-20
2(사)부산지체장애인단체협의회 제1공장경기도 파주시 교하로505번길 16-27경기도 파주시 동패동 591-6 번지 (가동)콘크리트 관 및 기타 구조용 콘크리트 제품 제조업 외 2 종2011-01-07031-947-6674경기도 파주시청031-940-45422023-11-20
3(사)장애인고용진흥회 파주사업소경기도 파주시 산남로157번길 30 (산남동)경기도 파주시 산남동 56-1위생용 종이제품 제조업2023-10-05031-945-7678경기도 파주시청031-940-45422023-11-20
4(사)한국검정교과서경기도 파주시 조리읍 당재봉로 29-28 (한국검정교과서)경기도 파주시 조리읍 오산리 398-5번지그 외 기타 종이 및 판지 제품 제조업2011-08-0102-2657-3514경기도 파주시청031-940-45412023-11-20
5(사)한국기능장애인협회 유니크퍼니쳐경기도 파주시 법원읍 화합로 514-1경기도 파주시 법원읍 갈곡리 67-1기타 목재가구 제조업 외 5 종2021-05-20<NA>경기도 파주시청031-940-45442023-11-20
6(사)한국장애인기업협회 울터미경기도 파주시 파주읍 여울길 310-8경기도 파주시 파주읍 부곡리 54-3 다동그 외 기타 분류 안된 비금속 광물제품 제조업2022-08-12<NA>경기도 파주시청031-940-45422023-11-20
7(사)한국장애인미래정책포럼 산전사업단경기도 파주시 광탄면 보광로 1541경기도 파주시 광탄면 방축리 190번지공기 조화장치 제조업 외 6 종2020-02-04031-945-7602경기도 파주시청031-940-45432023-11-20
8(사)한국장애인협회 기프트센터경기도 파주시 지목로 89-28 (신촌동)경기도 파주시 신촌동 61번지문구용 종이제품 제조업 외 2 종2019-02-14031-949-3733경기도 파주시청031-940-45422023-11-20
9(유)가원경기도 파주시 광탄면 명봉산로352번길 34-19 외 1필지경기도 파주시 광탄면 용미리 585-4비내화 모르타르 제조업 외 1 종2023-01-20070-8845-3701경기도 파주시청031-940-45432023-11-20
상호명소재지도로명주소소재지지번주소업종명최초등록일전화번호관리기관명관리기관전화번호데이터기준일자
4978효창경기도 파주시 광탄면 혜음로 666-16 (효창화학) (총 2 필지)경기도 파주시 광탄면 용미리 632-2번지치약+비누 및 기타 세제 제조업2006-02-13031-948-6013경기도 파주시청031-940-45432023-11-20
4979효형출판경기도 파주시 회동길 125-11 (문발동, 효형출판)경기도 파주시 문발동 532-2번지일반 서적 출판업2003-12-23031-955-7600(사)파주출판문화정보국가산업단지 입주기업체협의회031-955-00392023-11-20
4980휴그린(주)경기도 파주시 광탄면 혜음로 640-19 외 3필지경기도 파주시 광탄면 용미리 452-1사무용 기계 및 장비 제조업 외 2 종2016-02-02031-948-3784경기도 파주시청031-940-45432023-11-20
4981휴그린보드경기도 파주시 광탄면 보광로 1533경기도 파주시 광탄면 방축리 193-6번지사무용 기계 및 장비 제조업 외 1 종2012-12-27<NA>경기도 파주시청031-940-45412023-11-20
4982휴먼사이언스경기도 파주시 조리읍 명봉산로114번길 21경기도 파주시 조리읍 장곡리 598-25번지전시용 모형 제조업 외 2 종2013-12-1802-845-6363경기도 파주시청031-940-45432023-11-20
4983흥성원(주)경기도 파주시 광탄면 장지산로200번길 59 (총 3 필지) 외 2필지경기도 파주시 광탄면 분수리 2-8번지플라스틱 창호 제조업2014-02-07031-943-0353경기도 파주시청031-940-45432023-11-20
4984흥성윈(주)경기도 파주시 광탄면 장지산로200번길 59 (총 3 필지) 외 2필지경기도 파주시 광탄면 분수리 2-8번지플라스틱 창호 제조업2014-02-07031-943-0353경기도 파주시청031-940-45432023-11-20
4985희망테크놀로지경기도 파주시 하우4길 26-1 (상지석동)경기도 파주시 상지석동 554-46번지배전반 및 전기 자동제어반 제조업2020-12-101877-5604경기도 파주시청031-940-45412023-11-20
4986희스토리경기도 파주시 월롱면 누현1길 35경기도 파주시 월롱면 위전리 531-9번지포장용 플라스틱 성형용기 제조업2014-06-12<NA>경기도 파주시청031-940-45322023-11-20
4987히트텍경기도 파주시 법원읍 사임당로556번길 119-20경기도 파주시 법원읍 동문리 537번지절연 코드세트 및 기타 도체 제조업2013-06-24031-958-2282경기도 파주시청031-940-45322023-11-20