Overview

Dataset statistics

Number of variables6
Number of observations631
Missing cells25
Missing cells (%)0.7%
Duplicate rows1
Duplicate rows (%)0.2%
Total size in memory29.7 KiB
Average record size in memory48.2 B

Variable types

Categorical2
Text3
DateTime1

Dataset

Description경기도 김포시 출판사 및 인쇄소(업체구분, 업체명, 소재지도로명주소, 소재지지번주소, 인허가년월일, 데이터기준일자)의 정보를 제공합니다.
URLhttps://www.data.go.kr/data/15038222/fileData.do

Alerts

데이터기준일자 has constant value ""Constant
Dataset has 1 (0.2%) duplicate rowsDuplicates
소재지도로명주소 has 19 (3.0%) missing valuesMissing

Reproduction

Analysis started2023-12-12 13:07:16.154390
Analysis finished2023-12-12 13:07:16.946636
Duration0.79 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

업체구분
Categorical

Distinct2
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size5.1 KiB
출판사
536 
인쇄사
95 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row출판사
2nd row출판사
3rd row출판사
4th row출판사
5th row출판사

Common Values

ValueCountFrequency (%)
출판사 536
84.9%
인쇄사 95
 
15.1%

Length

2023-12-12T22:07:17.005025image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T22:07:17.094158image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
출판사 536
84.9%
인쇄사 95
 
15.1%
Distinct607
Distinct (%)96.2%
Missing0
Missing (%)0.0%
Memory size5.1 KiB
2023-12-12T22:07:17.314262image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length30
Median length23
Mean length6.7179081
Min length2

Characters and Unicode

Total characters4239
Distinct characters511
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique584 ?
Unique (%)92.6%

Sample

1st row로직큐브
2nd row김포출판사
3rd row모드니에
4th row김포대학출판부
5th row해령출판사
ValueCountFrequency (%)
도서출판 47
 
5.6%
주식회사 34
 
4.1%
출판사 7
 
0.8%
스튜디오 6
 
0.7%
북스 4
 
0.5%
출판부 3
 
0.4%
창공디자인 3
 
0.4%
연구소 2
 
0.2%
global 2
 
0.2%
이오클래식 2
 
0.2%
Other values (698) 726
86.8%
2023-12-12T22:07:17.677906image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
206
 
4.9%
119
 
2.8%
118
 
2.8%
105
 
2.5%
) 98
 
2.3%
( 97
 
2.3%
95
 
2.2%
90
 
2.1%
89
 
2.1%
66
 
1.6%
Other values (501) 3156
74.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3299
77.8%
Lowercase Letter 330
 
7.8%
Space Separator 206
 
4.9%
Uppercase Letter 181
 
4.3%
Close Punctuation 98
 
2.3%
Open Punctuation 97
 
2.3%
Decimal Number 16
 
0.4%
Other Punctuation 12
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
119
 
3.6%
118
 
3.6%
105
 
3.2%
95
 
2.9%
90
 
2.7%
89
 
2.7%
66
 
2.0%
66
 
2.0%
62
 
1.9%
58
 
1.8%
Other values (443) 2431
73.7%
Uppercase Letter
ValueCountFrequency (%)
C 16
 
8.8%
N 14
 
7.7%
M 13
 
7.2%
D 12
 
6.6%
O 12
 
6.6%
H 11
 
6.1%
R 11
 
6.1%
I 10
 
5.5%
A 10
 
5.5%
S 9
 
5.0%
Other values (14) 63
34.8%
Lowercase Letter
ValueCountFrequency (%)
o 40
12.1%
e 37
11.2%
n 29
 
8.8%
a 28
 
8.5%
s 26
 
7.9%
i 24
 
7.3%
t 19
 
5.8%
r 19
 
5.8%
l 15
 
4.5%
m 14
 
4.2%
Other values (12) 79
23.9%
Decimal Number
ValueCountFrequency (%)
1 7
43.8%
2 4
25.0%
0 2
 
12.5%
4 1
 
6.2%
8 1
 
6.2%
3 1
 
6.2%
Other Punctuation
ValueCountFrequency (%)
& 7
58.3%
. 4
33.3%
/ 1
 
8.3%
Space Separator
ValueCountFrequency (%)
206
100.0%
Close Punctuation
ValueCountFrequency (%)
) 98
100.0%
Open Punctuation
ValueCountFrequency (%)
( 97
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3299
77.8%
Latin 511
 
12.1%
Common 429
 
10.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
119
 
3.6%
118
 
3.6%
105
 
3.2%
95
 
2.9%
90
 
2.7%
89
 
2.7%
66
 
2.0%
66
 
2.0%
62
 
1.9%
58
 
1.8%
Other values (443) 2431
73.7%
Latin
ValueCountFrequency (%)
o 40
 
7.8%
e 37
 
7.2%
n 29
 
5.7%
a 28
 
5.5%
s 26
 
5.1%
i 24
 
4.7%
t 19
 
3.7%
r 19
 
3.7%
C 16
 
3.1%
l 15
 
2.9%
Other values (36) 258
50.5%
Common
ValueCountFrequency (%)
206
48.0%
) 98
22.8%
( 97
22.6%
1 7
 
1.6%
& 7
 
1.6%
. 4
 
0.9%
2 4
 
0.9%
0 2
 
0.5%
4 1
 
0.2%
/ 1
 
0.2%
Other values (2) 2
 
0.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3299
77.8%
ASCII 940
 
22.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
206
21.9%
) 98
 
10.4%
( 97
 
10.3%
o 40
 
4.3%
e 37
 
3.9%
n 29
 
3.1%
a 28
 
3.0%
s 26
 
2.8%
i 24
 
2.6%
t 19
 
2.0%
Other values (48) 336
35.7%
Hangul
ValueCountFrequency (%)
119
 
3.6%
118
 
3.6%
105
 
3.2%
95
 
2.9%
90
 
2.7%
89
 
2.7%
66
 
2.0%
66
 
2.0%
62
 
1.9%
58
 
1.8%
Other values (443) 2431
73.7%
Distinct577
Distinct (%)94.3%
Missing19
Missing (%)3.0%
Memory size5.1 KiB
2023-12-12T22:07:17.946603image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length61
Median length45
Mean length37.482026
Min length17

Characters and Unicode

Total characters22939
Distinct characters294
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique546 ?
Unique (%)89.2%

Sample

1st row경기도 김포시 양촌읍 양곡1로24번길 34-10
2nd row경기도 김포시 통진읍 조강로 58
3rd row경기도 김포시 월곶면 김포대학로 97
4th row경기도 김포시 통진읍 월하로548번길 80
5th row경기도 김포시 고촌읍 인향로 92-43
ValueCountFrequency (%)
경기도 611
 
13.7%
김포시 611
 
13.7%
장기동 94
 
2.1%
풍무동 75
 
1.7%
구래동 58
 
1.3%
양촌읍 57
 
1.3%
고촌읍 57
 
1.3%
운양동 55
 
1.2%
김포한강11로 36
 
0.8%
사우동 34
 
0.8%
Other values (1066) 2767
62.1%
2023-12-12T22:07:18.431101image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3874
 
16.9%
1 1126
 
4.9%
888
 
3.9%
855
 
3.7%
727
 
3.2%
0 721
 
3.1%
716
 
3.1%
2 698
 
3.0%
, 676
 
2.9%
676
 
2.9%
Other values (284) 11982
52.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 12266
53.5%
Decimal Number 4961
21.6%
Space Separator 3874
 
16.9%
Other Punctuation 676
 
2.9%
Open Punctuation 476
 
2.1%
Close Punctuation 476
 
2.1%
Dash Punctuation 141
 
0.6%
Lowercase Letter 35
 
0.2%
Uppercase Letter 33
 
0.1%
Math Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
888
 
7.2%
855
 
7.0%
727
 
5.9%
716
 
5.8%
676
 
5.5%
667
 
5.4%
626
 
5.1%
625
 
5.1%
440
 
3.6%
306
 
2.5%
Other values (249) 5740
46.8%
Uppercase Letter
ValueCountFrequency (%)
G 5
15.2%
T 5
15.2%
C 4
12.1%
K 3
9.1%
H 3
9.1%
L 2
 
6.1%
B 2
 
6.1%
A 2
 
6.1%
J 1
 
3.0%
F 1
 
3.0%
Other values (5) 5
15.2%
Decimal Number
ValueCountFrequency (%)
1 1126
22.7%
0 721
14.5%
2 698
14.1%
3 521
10.5%
4 389
 
7.8%
5 374
 
7.5%
7 325
 
6.6%
8 295
 
5.9%
6 293
 
5.9%
9 219
 
4.4%
Lowercase Letter
ValueCountFrequency (%)
e 20
57.1%
o 5
 
14.3%
w 5
 
14.3%
r 5
 
14.3%
Space Separator
ValueCountFrequency (%)
3874
100.0%
Other Punctuation
ValueCountFrequency (%)
, 676
100.0%
Open Punctuation
ValueCountFrequency (%)
( 476
100.0%
Close Punctuation
ValueCountFrequency (%)
) 476
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 141
100.0%
Math Symbol
ValueCountFrequency (%)
~ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 12266
53.5%
Common 10605
46.2%
Latin 68
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
888
 
7.2%
855
 
7.0%
727
 
5.9%
716
 
5.8%
676
 
5.5%
667
 
5.4%
626
 
5.1%
625
 
5.1%
440
 
3.6%
306
 
2.5%
Other values (249) 5740
46.8%
Latin
ValueCountFrequency (%)
e 20
29.4%
G 5
 
7.4%
T 5
 
7.4%
o 5
 
7.4%
w 5
 
7.4%
r 5
 
7.4%
C 4
 
5.9%
K 3
 
4.4%
H 3
 
4.4%
L 2
 
2.9%
Other values (9) 11
16.2%
Common
ValueCountFrequency (%)
3874
36.5%
1 1126
 
10.6%
0 721
 
6.8%
2 698
 
6.6%
, 676
 
6.4%
3 521
 
4.9%
( 476
 
4.5%
) 476
 
4.5%
4 389
 
3.7%
5 374
 
3.5%
Other values (6) 1274
 
12.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 12266
53.5%
ASCII 10673
46.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3874
36.3%
1 1126
 
10.5%
0 721
 
6.8%
2 698
 
6.5%
, 676
 
6.3%
3 521
 
4.9%
( 476
 
4.5%
) 476
 
4.5%
4 389
 
3.6%
5 374
 
3.5%
Other values (25) 1342
 
12.6%
Hangul
ValueCountFrequency (%)
888
 
7.2%
855
 
7.0%
727
 
5.9%
716
 
5.8%
676
 
5.5%
667
 
5.4%
626
 
5.1%
625
 
5.1%
440
 
3.6%
306
 
2.5%
Other values (249) 5740
46.8%
Distinct521
Distinct (%)83.4%
Missing6
Missing (%)1.0%
Memory size5.1 KiB
2023-12-12T22:07:18.679020image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length49
Median length41
Mean length27.1984
Min length12

Characters and Unicode

Total characters16999
Distinct characters279
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique451 ?
Unique (%)72.2%

Sample

1st row경기도 김포시 양촌읍 구래리 432-4 제일골드빌라 B동 201호
2nd row경기도 김포시 통진읍 서암리 720-5
3rd row경기도 김포시 사우동 134-1
4th row경기도 김포시 월곶면 포내리 111-9
5th row경기도 김포시 통진읍 귀전리 675-2
ValueCountFrequency (%)
경기도 624
 
17.4%
김포시 624
 
17.4%
장기동 97
 
2.7%
풍무동 77
 
2.1%
운양동 58
 
1.6%
고촌읍 57
 
1.6%
구래동 57
 
1.6%
양촌읍 57
 
1.6%
사우동 43
 
1.2%
신곡리 38
 
1.1%
Other values (885) 1850
51.6%
2023-12-12T22:07:19.066138image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3414
20.1%
725
 
4.3%
1 680
 
4.0%
677
 
4.0%
677
 
4.0%
670
 
3.9%
658
 
3.9%
638
 
3.8%
589
 
3.5%
2 523
 
3.1%
Other values (269) 7748
45.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 9358
55.1%
Decimal Number 3739
 
22.0%
Space Separator 3414
 
20.1%
Dash Punctuation 401
 
2.4%
Uppercase Letter 36
 
0.2%
Lowercase Letter 34
 
0.2%
Close Punctuation 6
 
< 0.1%
Open Punctuation 6
 
< 0.1%
Other Punctuation 5
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
725
 
7.7%
677
 
7.2%
677
 
7.2%
670
 
7.2%
658
 
7.0%
638
 
6.8%
589
 
6.3%
235
 
2.5%
215
 
2.3%
163
 
1.7%
Other values (236) 4111
43.9%
Uppercase Letter
ValueCountFrequency (%)
A 5
13.9%
T 5
13.9%
G 5
13.9%
K 4
11.1%
C 4
11.1%
B 3
8.3%
L 2
 
5.6%
H 2
 
5.6%
F 1
 
2.8%
J 1
 
2.8%
Other values (4) 4
11.1%
Decimal Number
ValueCountFrequency (%)
1 680
18.2%
2 523
14.0%
0 497
13.3%
3 364
9.7%
8 334
8.9%
6 310
8.3%
4 285
7.6%
5 280
7.5%
7 264
 
7.1%
9 202
 
5.4%
Lowercase Letter
ValueCountFrequency (%)
e 19
55.9%
r 5
 
14.7%
w 5
 
14.7%
o 5
 
14.7%
Space Separator
ValueCountFrequency (%)
3414
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 401
100.0%
Close Punctuation
ValueCountFrequency (%)
) 6
100.0%
Open Punctuation
ValueCountFrequency (%)
( 6
100.0%
Other Punctuation
ValueCountFrequency (%)
, 5
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 9358
55.1%
Common 7571
44.5%
Latin 70
 
0.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
725
 
7.7%
677
 
7.2%
677
 
7.2%
670
 
7.2%
658
 
7.0%
638
 
6.8%
589
 
6.3%
235
 
2.5%
215
 
2.3%
163
 
1.7%
Other values (236) 4111
43.9%
Latin
ValueCountFrequency (%)
e 19
27.1%
A 5
 
7.1%
T 5
 
7.1%
r 5
 
7.1%
w 5
 
7.1%
G 5
 
7.1%
o 5
 
7.1%
K 4
 
5.7%
C 4
 
5.7%
B 3
 
4.3%
Other values (8) 10
14.3%
Common
ValueCountFrequency (%)
3414
45.1%
1 680
 
9.0%
2 523
 
6.9%
0 497
 
6.6%
- 401
 
5.3%
3 364
 
4.8%
8 334
 
4.4%
6 310
 
4.1%
4 285
 
3.8%
5 280
 
3.7%
Other values (5) 483
 
6.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 9358
55.1%
ASCII 7641
44.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3414
44.7%
1 680
 
8.9%
2 523
 
6.8%
0 497
 
6.5%
- 401
 
5.2%
3 364
 
4.8%
8 334
 
4.4%
6 310
 
4.1%
4 285
 
3.7%
5 280
 
3.7%
Other values (23) 553
 
7.2%
Hangul
ValueCountFrequency (%)
725
 
7.7%
677
 
7.2%
677
 
7.2%
670
 
7.2%
658
 
7.0%
638
 
6.8%
589
 
6.3%
235
 
2.5%
215
 
2.3%
163
 
1.7%
Other values (236) 4111
43.9%
Distinct544
Distinct (%)86.2%
Missing0
Missing (%)0.0%
Memory size5.1 KiB
Minimum1961-12-28 00:00:00
Maximum2023-07-26 00:00:00
2023-12-12T22:07:19.217750image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T22:07:19.353774image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

데이터기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size5.1 KiB
2023-08-16
631 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2023-08-16
2nd row2023-08-16
3rd row2023-08-16
4th row2023-08-16
5th row2023-08-16

Common Values

ValueCountFrequency (%)
2023-08-16 631
100.0%

Length

2023-12-12T22:07:19.491539image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T22:07:19.577409image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2023-08-16 631
100.0%

Missing values

2023-12-12T22:07:16.730706image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T22:07:16.823281image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-12T22:07:16.902172image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

업체구분업체명소재지도로명주소소재지지번주소인허가년월일데이터기준일자
0출판사로직큐브경기도 김포시 양촌읍 양곡1로24번길 34-10경기도 김포시 양촌읍 구래리 432-4 제일골드빌라 B동 201호2002-06-262023-08-16
1출판사김포출판사경기도 김포시 통진읍 조강로 58경기도 김포시 통진읍 서암리 720-52003-02-062023-08-16
2출판사모드니에<NA>경기도 김포시 사우동 134-12003-02-062023-08-16
3출판사김포대학출판부경기도 김포시 월곶면 김포대학로 97경기도 김포시 월곶면 포내리 111-92003-02-172023-08-16
4출판사해령출판사경기도 김포시 통진읍 월하로548번길 80경기도 김포시 통진읍 귀전리 675-22003-02-172023-08-16
5출판사김포의소리<NA>경기도 김포시 풍무동 71-22003-02-172023-08-16
6출판사샘물출판사경기도 김포시 고촌읍 인향로 92-43경기도 김포시 고촌읍 신곡리 949-12003-02-172023-08-16
7출판사도서출판쁄라경기도 김포시 풍무로96번길 11 (풍무동)경기도 김포시 풍무동 405-22003-02-172023-08-16
8출판사무의도경기도 김포시 통진읍 고척로228번길 48경기도 김포시 통진읍 고정리 295-42003-02-172023-08-16
9출판사도서출판드림<NA>경기도 김포시 운양동 398-12003-02-172023-08-16
업체구분업체명소재지도로명주소소재지지번주소인허가년월일데이터기준일자
621인쇄사sc미디어경기도 김포시 고촌읍 아라육로58번길 186, 402호경기도 김포시 고촌읍 전호리 737 402호2021-02-092023-08-16
622인쇄사킹콩프린팅파크경기도 김포시 통진읍 가현로 41경기도 김포시 통진읍 가현리 764-472022-01-122023-08-16
623인쇄사(주)직지피앤디경기도 김포시 통진읍 가현로 41경기도 김포시 통진읍 가현리 764-472022-04-192023-08-16
624인쇄사에이치디프린팅경기도 김포시 고촌읍 아라육로57번길 70, 합동경동물류김포고촌물류센터 4층경기도 김포시 고촌읍 전호리 745 합동경동물류김포고촌물류센터2022-04-202023-08-16
625인쇄사주식회사 아이콘커뮤니케이션경기도 김포시 태장로 765, 금강테크노밸리지식산업센터 437호 (장기동)경기도 김포시 장기동 2008-1 금강테크노밸리지식산업센터 437호2022-05-312023-08-16
626인쇄사지엘팩토리경기도 김포시 김포한강10로133번길 127, 디원시티 지식산업센터 941호 (구래동)경기도 김포시 구래동 6871-72022-06-222023-08-16
627인쇄사(주)세림피엔지경기도 김포시 통진읍 귀전로 13-5경기도 김포시 통진읍 귀전리 172-52022-06-242023-08-16
628인쇄사(주)나무이야기경기도 김포시 태장로 755, 김포 한강신도시 GTower지식산업센터 425호 (장기동)경기도 김포시 장기동 2008-2 김포 한강신도시 GTower지식산업센터2022-07-082023-08-16
629인쇄사(주)모아경기도 김포시 고촌읍 상미로10번길 104-40경기도 김포시 고촌읍 향산리 155-42022-09-142023-08-16
630인쇄사해드림산업주식회사 김포지점경기도 김포시 대곶면 대곶서로 292-20경기도 김포시 대곶면 율생리 648-142023-01-022023-08-16

Duplicate rows

Most frequently occurring

업체구분업체명소재지도로명주소소재지지번주소인허가년월일데이터기준일자# duplicates
0출판사창공디자인경기도 김포시 사우중로3번길 48, 1층 103호 (사우동)경기도 김포시 사우동 238-72018-03-192023-08-162