Overview

Dataset statistics

Number of variables6
Number of observations68
Missing cells46
Missing cells (%)11.3%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory3.3 KiB
Average record size in memory49.9 B

Variable types

DateTime2
Text4

Dataset

Description경기도 김포시 안경업소의 대한 데이터입니다.(업소명, 도로명주소, 지번주소, 인허가년월일, 전화번호, 데이터기준일자)
URLhttps://www.data.go.kr/data/15038218/fileData.do

Alerts

데이터기준일자 has constant value ""Constant
지번주소 has 10 (14.7%) missing valuesMissing
사업장전화번호 has 36 (52.9%) missing valuesMissing
인허가년월일 has unique valuesUnique
도로명주소 has unique valuesUnique

Reproduction

Analysis started2023-12-12 00:34:18.993365
Analysis finished2023-12-12 00:34:19.575203
Duration0.58 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

인허가년월일
Date

UNIQUE 

Distinct68
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size676.0 B
Minimum1991-04-24 00:00:00
Maximum2022-03-16 00:00:00
2023-12-12T09:34:19.650786image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T09:34:19.768310image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct67
Distinct (%)98.5%
Missing0
Missing (%)0.0%
Memory size676.0 B
2023-12-12T09:34:20.014885image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length23
Median length11
Mean length7.1911765
Min length4

Characters and Unicode

Total characters489
Distinct characters131
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique66 ?
Unique (%)97.1%

Sample

1st row아이비젼
2nd row안경발전소
3rd row으뜸플러스사우점
4th row안경진정성 김포운양점
5th row안경, 진정성 걸포북변점
ValueCountFrequency (%)
안경박사 3
 
3.8%
안경 3
 
3.8%
김포장기점 3
 
3.8%
아이피아 2
 
2.5%
진정성 2
 
2.5%
아이파파 1
 
1.2%
밝은세상안경고촌점 1
 
1.2%
김포점 1
 
1.2%
다비치안경원 1
 
1.2%
안경창고콘택트 1
 
1.2%
Other values (62) 62
77.5%
2023-12-12T09:34:20.405597image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
51
 
10.4%
51
 
10.4%
23
 
4.7%
18
 
3.7%
17
 
3.5%
17
 
3.5%
16
 
3.3%
12
 
2.5%
10
 
2.0%
7
 
1.4%
Other values (121) 267
54.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 447
91.4%
Space Separator 12
 
2.5%
Uppercase Letter 11
 
2.2%
Decimal Number 10
 
2.0%
Lowercase Letter 3
 
0.6%
Other Punctuation 2
 
0.4%
Open Punctuation 2
 
0.4%
Close Punctuation 2
 
0.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
51
 
11.4%
51
 
11.4%
23
 
5.1%
18
 
4.0%
17
 
3.8%
17
 
3.8%
16
 
3.6%
10
 
2.2%
7
 
1.6%
7
 
1.6%
Other values (103) 230
51.5%
Uppercase Letter
ValueCountFrequency (%)
L 2
18.2%
A 2
18.2%
S 2
18.2%
K 1
9.1%
O 1
9.1%
P 1
9.1%
G 1
9.1%
Y 1
9.1%
Decimal Number
ValueCountFrequency (%)
0 4
40.0%
1 3
30.0%
5 2
20.0%
2 1
 
10.0%
Lowercase Letter
ValueCountFrequency (%)
e 2
66.7%
y 1
33.3%
Space Separator
ValueCountFrequency (%)
12
100.0%
Other Punctuation
ValueCountFrequency (%)
, 2
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 447
91.4%
Common 28
 
5.7%
Latin 14
 
2.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
51
 
11.4%
51
 
11.4%
23
 
5.1%
18
 
4.0%
17
 
3.8%
17
 
3.8%
16
 
3.6%
10
 
2.2%
7
 
1.6%
7
 
1.6%
Other values (103) 230
51.5%
Latin
ValueCountFrequency (%)
L 2
14.3%
A 2
14.3%
S 2
14.3%
e 2
14.3%
K 1
7.1%
O 1
7.1%
y 1
7.1%
P 1
7.1%
G 1
7.1%
Y 1
7.1%
Common
ValueCountFrequency (%)
12
42.9%
0 4
 
14.3%
1 3
 
10.7%
5 2
 
7.1%
, 2
 
7.1%
( 2
 
7.1%
) 2
 
7.1%
2 1
 
3.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 447
91.4%
ASCII 42
 
8.6%

Most frequent character per block

Hangul
ValueCountFrequency (%)
51
 
11.4%
51
 
11.4%
23
 
5.1%
18
 
4.0%
17
 
3.8%
17
 
3.8%
16
 
3.6%
10
 
2.2%
7
 
1.6%
7
 
1.6%
Other values (103) 230
51.5%
ASCII
ValueCountFrequency (%)
12
28.6%
0 4
 
9.5%
1 3
 
7.1%
L 2
 
4.8%
A 2
 
4.8%
S 2
 
4.8%
5 2
 
4.8%
e 2
 
4.8%
, 2
 
4.8%
( 2
 
4.8%
Other values (8) 9
21.4%

지번주소
Text

MISSING 

Distinct58
Distinct (%)100.0%
Missing10
Missing (%)14.7%
Memory size676.0 B
2023-12-12T09:34:20.923717image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length39
Median length33
Mean length24.517241
Min length16

Characters and Unicode

Total characters1422
Distinct characters105
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique58 ?
Unique (%)100.0%

Sample

1st row경기도 김포시 장기동 2077-2
2nd row경기도 김포시 고촌읍 신곡리 880-1
3rd row경기도 김포시 사우동 928 김포아트프라자
4th row경기도 김포시 운양동 1296-9 에이스프라자
5th row경기도 김포시 걸포동 268-2
ValueCountFrequency (%)
경기도 58
 
17.8%
김포시 58
 
17.8%
풍무동 8
 
2.5%
구래동 8
 
2.5%
사우동 7
 
2.2%
장기동 7
 
2.2%
고촌읍 6
 
1.8%
5호 5
 
1.5%
신곡리 5
 
1.5%
3호 5
 
1.5%
Other values (110) 158
48.6%
2023-12-12T09:34:21.349202image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
268
18.8%
1 67
 
4.7%
65
 
4.6%
61
 
4.3%
60
 
4.2%
59
 
4.1%
58
 
4.1%
58
 
4.1%
49
 
3.4%
48
 
3.4%
Other values (95) 629
44.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 833
58.6%
Decimal Number 302
 
21.2%
Space Separator 268
 
18.8%
Dash Punctuation 11
 
0.8%
Other Punctuation 4
 
0.3%
Uppercase Letter 2
 
0.1%
Open Punctuation 1
 
0.1%
Close Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
65
 
7.8%
61
 
7.3%
60
 
7.2%
59
 
7.1%
58
 
7.0%
58
 
7.0%
49
 
5.9%
48
 
5.8%
45
 
5.4%
43
 
5.2%
Other values (77) 287
34.5%
Decimal Number
ValueCountFrequency (%)
1 67
22.2%
0 36
11.9%
3 32
10.6%
8 32
10.6%
5 29
9.6%
9 26
 
8.6%
6 25
 
8.3%
2 25
 
8.3%
7 18
 
6.0%
4 12
 
4.0%
Other Punctuation
ValueCountFrequency (%)
, 3
75.0%
. 1
 
25.0%
Uppercase Letter
ValueCountFrequency (%)
F 1
50.0%
B 1
50.0%
Space Separator
ValueCountFrequency (%)
268
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 11
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 833
58.6%
Common 587
41.3%
Latin 2
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
65
 
7.8%
61
 
7.3%
60
 
7.2%
59
 
7.1%
58
 
7.0%
58
 
7.0%
49
 
5.9%
48
 
5.8%
45
 
5.4%
43
 
5.2%
Other values (77) 287
34.5%
Common
ValueCountFrequency (%)
268
45.7%
1 67
 
11.4%
0 36
 
6.1%
3 32
 
5.5%
8 32
 
5.5%
5 29
 
4.9%
9 26
 
4.4%
6 25
 
4.3%
2 25
 
4.3%
7 18
 
3.1%
Other values (6) 29
 
4.9%
Latin
ValueCountFrequency (%)
F 1
50.0%
B 1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 833
58.6%
ASCII 589
41.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
268
45.5%
1 67
 
11.4%
0 36
 
6.1%
3 32
 
5.4%
8 32
 
5.4%
5 29
 
4.9%
9 26
 
4.4%
6 25
 
4.2%
2 25
 
4.2%
7 18
 
3.1%
Other values (8) 31
 
5.3%
Hangul
ValueCountFrequency (%)
65
 
7.8%
61
 
7.3%
60
 
7.2%
59
 
7.1%
58
 
7.0%
58
 
7.0%
49
 
5.9%
48
 
5.8%
45
 
5.4%
43
 
5.2%
Other values (77) 287
34.5%

도로명주소
Text

UNIQUE 

Distinct68
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size676.0 B
2023-12-12T09:34:21.662517image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length50
Median length38
Mean length30.308824
Min length18

Characters and Unicode

Total characters2061
Distinct characters132
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique68 ?
Unique (%)100.0%

Sample

1st row경기도 김포시 태장로 800, 201호 (장기동)
2nd row경기도 김포시 고촌읍 수기로 109, 107호
3rd row경기도 김포시 사우중로 52, 김포아트프라자 201호 (사우동)
4th row경기도 김포시 김포한강11로 288-31, 에이스프라자 102,103호 (운양동)
5th row경기도 김포시 걸포2로 33, 104호 (걸포동)
ValueCountFrequency (%)
경기도 68
 
16.0%
김포시 68
 
16.0%
장기동 11
 
2.6%
김포대로 9
 
2.1%
구래동 9
 
2.1%
사우동 8
 
1.9%
1층 8
 
1.9%
풍무동 8
 
1.9%
김포한강4로 7
 
1.6%
고촌읍 6
 
1.4%
Other values (161) 223
52.5%
2023-12-12T09:34:22.119364image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
357
 
17.3%
1 120
 
5.8%
109
 
5.3%
104
 
5.0%
81
 
3.9%
69
 
3.3%
69
 
3.3%
68
 
3.3%
68
 
3.3%
, 68
 
3.3%
Other values (122) 948
46.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1129
54.8%
Decimal Number 384
 
18.6%
Space Separator 357
 
17.3%
Other Punctuation 69
 
3.3%
Close Punctuation 53
 
2.6%
Open Punctuation 53
 
2.6%
Uppercase Letter 9
 
0.4%
Dash Punctuation 6
 
0.3%
Letter Number 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
109
 
9.7%
104
 
9.2%
81
 
7.2%
69
 
6.1%
69
 
6.1%
68
 
6.0%
68
 
6.0%
52
 
4.6%
35
 
3.1%
28
 
2.5%
Other values (99) 446
39.5%
Decimal Number
ValueCountFrequency (%)
1 120
31.2%
0 53
13.8%
2 45
 
11.7%
3 34
 
8.9%
8 26
 
6.8%
4 26
 
6.8%
7 24
 
6.2%
5 22
 
5.7%
9 20
 
5.2%
6 14
 
3.6%
Uppercase Letter
ValueCountFrequency (%)
B 4
44.4%
W 1
 
11.1%
E 1
 
11.1%
S 1
 
11.1%
T 1
 
11.1%
F 1
 
11.1%
Other Punctuation
ValueCountFrequency (%)
, 68
98.6%
. 1
 
1.4%
Space Separator
ValueCountFrequency (%)
357
100.0%
Close Punctuation
ValueCountFrequency (%)
) 53
100.0%
Open Punctuation
ValueCountFrequency (%)
( 53
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 6
100.0%
Letter Number
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1129
54.8%
Common 922
44.7%
Latin 10
 
0.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
109
 
9.7%
104
 
9.2%
81
 
7.2%
69
 
6.1%
69
 
6.1%
68
 
6.0%
68
 
6.0%
52
 
4.6%
35
 
3.1%
28
 
2.5%
Other values (99) 446
39.5%
Common
ValueCountFrequency (%)
357
38.7%
1 120
 
13.0%
, 68
 
7.4%
) 53
 
5.7%
( 53
 
5.7%
0 53
 
5.7%
2 45
 
4.9%
3 34
 
3.7%
8 26
 
2.8%
4 26
 
2.8%
Other values (6) 87
 
9.4%
Latin
ValueCountFrequency (%)
B 4
40.0%
1
 
10.0%
W 1
 
10.0%
E 1
 
10.0%
S 1
 
10.0%
T 1
 
10.0%
F 1
 
10.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1129
54.8%
ASCII 931
45.2%
Number Forms 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
357
38.3%
1 120
 
12.9%
, 68
 
7.3%
) 53
 
5.7%
( 53
 
5.7%
0 53
 
5.7%
2 45
 
4.8%
3 34
 
3.7%
8 26
 
2.8%
4 26
 
2.8%
Other values (12) 96
 
10.3%
Hangul
ValueCountFrequency (%)
109
 
9.7%
104
 
9.2%
81
 
7.2%
69
 
6.1%
69
 
6.1%
68
 
6.0%
68
 
6.0%
52
 
4.6%
35
 
3.1%
28
 
2.5%
Other values (99) 446
39.5%
Number Forms
ValueCountFrequency (%)
1
100.0%

사업장전화번호
Text

MISSING 

Distinct32
Distinct (%)100.0%
Missing36
Missing (%)52.9%
Memory size676.0 B
2023-12-12T09:34:22.368813image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.0625
Min length12

Characters and Unicode

Total characters386
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique32 ?
Unique (%)100.0%

Sample

1st row031-8049-8830
2nd row031-985-6111
3rd row031-983-0036
4th row031-997-3120
5th row031-986-1997
ValueCountFrequency (%)
031-998-5713 1
 
3.1%
031-985-6111 1
 
3.1%
031-989-1039 1
 
3.1%
031-984-5201 1
 
3.1%
031-987-9623 1
 
3.1%
031-998-5546 1
 
3.1%
031-996-6565 1
 
3.1%
031-988-6488 1
 
3.1%
031-8049-8830 1
 
3.1%
031-981-4238 1
 
3.1%
Other values (22) 22
68.8%
2023-12-12T09:34:22.711153image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 64
16.6%
0 52
13.5%
1 52
13.5%
9 50
13.0%
3 48
12.4%
8 42
10.9%
5 21
 
5.4%
6 21
 
5.4%
2 15
 
3.9%
4 11
 
2.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 322
83.4%
Dash Punctuation 64
 
16.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 52
16.1%
1 52
16.1%
9 50
15.5%
3 48
14.9%
8 42
13.0%
5 21
6.5%
6 21
6.5%
2 15
 
4.7%
4 11
 
3.4%
7 10
 
3.1%
Dash Punctuation
ValueCountFrequency (%)
- 64
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 386
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 64
16.6%
0 52
13.5%
1 52
13.5%
9 50
13.0%
3 48
12.4%
8 42
10.9%
5 21
 
5.4%
6 21
 
5.4%
2 15
 
3.9%
4 11
 
2.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 386
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 64
16.6%
0 52
13.5%
1 52
13.5%
9 50
13.0%
3 48
12.4%
8 42
10.9%
5 21
 
5.4%
6 21
 
5.4%
2 15
 
3.9%
4 11
 
2.8%

데이터기준일자
Date

CONSTANT 

Distinct1
Distinct (%)1.5%
Missing0
Missing (%)0.0%
Memory size676.0 B
Minimum2023-08-16 00:00:00
Maximum2023-08-16 00:00:00
2023-12-12T09:34:22.849038image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T09:34:22.965238image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Correlations

2023-12-12T09:34:23.068764image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
인허가년월일업소명지번주소도로명주소사업장전화번호
인허가년월일1.0001.0001.0001.0001.000
업소명1.0001.0001.0001.0001.000
지번주소1.0001.0001.0001.0001.000
도로명주소1.0001.0001.0001.0001.000
사업장전화번호1.0001.0001.0001.0001.000

Missing values

2023-12-12T09:34:19.352477image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T09:34:19.446751image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-12T09:34:19.529352image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

인허가년월일업소명지번주소도로명주소사업장전화번호데이터기준일자
02022-03-16아이비젼경기도 김포시 장기동 2077-2경기도 김포시 태장로 800, 201호 (장기동)031-8049-88302023-08-16
12022-02-10안경발전소경기도 김포시 고촌읍 신곡리 880-1경기도 김포시 고촌읍 수기로 109, 107호<NA>2023-08-16
22022-02-07으뜸플러스사우점경기도 김포시 사우동 928 김포아트프라자경기도 김포시 사우중로 52, 김포아트프라자 201호 (사우동)<NA>2023-08-16
32021-12-22안경진정성 김포운양점경기도 김포시 운양동 1296-9 에이스프라자경기도 김포시 김포한강11로 288-31, 에이스프라자 102,103호 (운양동)<NA>2023-08-16
42021-06-04안경, 진정성 걸포북변점경기도 김포시 걸포동 268-2경기도 김포시 걸포2로 33, 104호 (걸포동)<NA>2023-08-16
52021-05-18안경, 진정성<NA>경기도 김포시 김포한강8로148번길 5, 103,104호 (마산동)<NA>2023-08-16
62021-03-16오렌즈 김포장기점경기도 김포시 장기동 1610경기도 김포시 김포한강4로 113, 1층 108호 (장기동)031-985-61112023-08-16
72020-10-08별별안경<NA>경기도 김포시 걸포2로 83, B123,B124,B125호 (걸포동, 한강메트로자이1단지)031-983-00362023-08-16
82020-03-12한빛안경랜드경기도 김포시 양촌읍 양곡리 1305번지 4호 아름터프라자경기도 김포시 양촌읍 양곡2로 50, 아름터프라자 105-1호031-997-31202023-08-16
92020-03-06으뜸플러스김포구래점경기도 김포시 구래동 6885번지 3호 웅신프라자경기도 김포시 김포한강8로 378, 웅신프라자 301호 (구래동)<NA>2023-08-16
인허가년월일업소명지번주소도로명주소사업장전화번호데이터기준일자
582000-12-23피카소안경경기도 김포시 사우동 932번지 메가라인빌딩 101,102호경기도 김포시 사우중로 51, 101,102호 (사우동, 메가라인)031-985-59912023-08-16
592000-10-14밝은안경경기도 김포시 풍무동 109번지 5호 109-5경기도 김포시 풍무로 109 (풍무동)<NA>2023-08-16
602000-09-23오뜨안경경기도 김포시 감정동 515번지 11호경기도 김포시 중봉로 6 (감정동)<NA>2023-08-16
612000-04-07양곡서전안경경기도 김포시 양촌읍 양곡리 388번지 4호경기도 김포시 양촌읍 양곡1로37번길 6<NA>2023-08-16
621999-12-03안경과사람들경기도 김포시 풍무동 47번지경기도 김포시 풍무로 145 (풍무동)<NA>2023-08-16
631998-04-02안경테나라경기도 김포시 북변동 383번지 1호경기도 김포시 북변중로 52 (북변동)<NA>2023-08-16
641993-12-02김포안경백화점경기도 김포시 사우동 922번지경기도 김포시 김포대로 835 (사우동)<NA>2023-08-16
651992-07-02덕신안경원경기도 김포시 양촌읍 양곡리 391번지 1호경기도 김포시 양촌읍 양곡1로 29<NA>2023-08-16
661991-06-14이태리안경경기도 김포시 통진읍 마송리 63번지 11호경기도 김포시 통진읍 조강로 37<NA>2023-08-16
671991-04-24연세안경경기도 김포시 감정동 532-7경기도 김포시 중봉1로 21, 1층 101호 (감정동)<NA>2023-08-16