Overview

Dataset statistics

Number of variables4
Number of observations175
Missing cells47
Missing cells (%)6.7%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory5.8 KiB
Average record size in memory33.8 B

Variable types

Text3
Numeric1

Dataset

Description경기도 안양시 인쇄현황 데이터 현황 상호명 우편번호 소재지 전화번호 등 경영하고자 하는 자의 신청을 받아 인쇄사 신고된 현황 정보입니다.
URLhttps://www.data.go.kr/data/3079531/fileData.do

Alerts

우편번호 has 3 (1.7%) missing valuesMissing
전화번호 has 44 (25.1%) missing valuesMissing

Reproduction

Analysis started2023-12-12 14:02:29.459959
Analysis finished2023-12-12 14:02:30.144091
Duration0.68 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct172
Distinct (%)98.3%
Missing0
Missing (%)0.0%
Memory size1.5 KiB
2023-12-12T23:02:30.336719image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length21
Median length16
Mean length6.1828571
Min length2

Characters and Unicode

Total characters1082
Distinct characters218
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique169 ?
Unique (%)96.6%

Sample

1st row부흥인쇄사
2nd row국제문화사
3rd row문화인쇄소
4th row유신인쇄소
5th row태양당인쇄
ValueCountFrequency (%)
주식회사 16
 
7.5%
디자인 3
 
1.4%
샛별기획인쇄 2
 
0.9%
인쇄사 2
 
0.9%
사단법인 2
 
0.9%
기획 2
 
0.9%
대양 2
 
0.9%
하늘기획 2
 
0.9%
가람인쇄 1
 
0.5%
삼일사 1
 
0.5%
Other values (180) 180
84.5%
2023-12-12T23:02:30.771186image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
61
 
5.6%
58
 
5.4%
48
 
4.4%
47
 
4.3%
43
 
4.0%
38
 
3.5%
37
 
3.4%
) 28
 
2.6%
( 27
 
2.5%
20
 
1.8%
Other values (208) 675
62.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 979
90.5%
Space Separator 38
 
3.5%
Close Punctuation 28
 
2.6%
Open Punctuation 27
 
2.5%
Uppercase Letter 3
 
0.3%
Lowercase Letter 3
 
0.3%
Other Punctuation 2
 
0.2%
Decimal Number 1
 
0.1%
Dash Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
61
 
6.2%
58
 
5.9%
48
 
4.9%
47
 
4.8%
43
 
4.4%
37
 
3.8%
20
 
2.0%
17
 
1.7%
16
 
1.6%
15
 
1.5%
Other values (196) 617
63.0%
Uppercase Letter
ValueCountFrequency (%)
S 1
33.3%
P 1
33.3%
C 1
33.3%
Lowercase Letter
ValueCountFrequency (%)
o 1
33.3%
n 1
33.3%
e 1
33.3%
Space Separator
ValueCountFrequency (%)
38
100.0%
Close Punctuation
ValueCountFrequency (%)
) 28
100.0%
Open Punctuation
ValueCountFrequency (%)
( 27
100.0%
Other Punctuation
ValueCountFrequency (%)
& 2
100.0%
Decimal Number
ValueCountFrequency (%)
1 1
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 979
90.5%
Common 97
 
9.0%
Latin 6
 
0.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
61
 
6.2%
58
 
5.9%
48
 
4.9%
47
 
4.8%
43
 
4.4%
37
 
3.8%
20
 
2.0%
17
 
1.7%
16
 
1.6%
15
 
1.5%
Other values (196) 617
63.0%
Common
ValueCountFrequency (%)
38
39.2%
) 28
28.9%
( 27
27.8%
& 2
 
2.1%
1 1
 
1.0%
- 1
 
1.0%
Latin
ValueCountFrequency (%)
S 1
16.7%
P 1
16.7%
C 1
16.7%
o 1
16.7%
n 1
16.7%
e 1
16.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 979
90.5%
ASCII 103
 
9.5%

Most frequent character per block

Hangul
ValueCountFrequency (%)
61
 
6.2%
58
 
5.9%
48
 
4.9%
47
 
4.8%
43
 
4.4%
37
 
3.8%
20
 
2.0%
17
 
1.7%
16
 
1.6%
15
 
1.5%
Other values (196) 617
63.0%
ASCII
ValueCountFrequency (%)
38
36.9%
) 28
27.2%
( 27
26.2%
& 2
 
1.9%
S 1
 
1.0%
P 1
 
1.0%
C 1
 
1.0%
1 1
 
1.0%
- 1
 
1.0%
o 1
 
1.0%
Other values (2) 2
 
1.9%

우편번호
Real number (ℝ)

MISSING 

Distinct58
Distinct (%)33.7%
Missing3
Missing (%)1.7%
Infinite0
Infinite (%)0.0%
Mean14024.424
Minimum11947
Maximum14120
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.7 KiB
2023-12-12T23:02:30.962890image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum11947
5-th percentile13937
Q113992
median14057
Q314086
95-th percentile14118
Maximum14120
Range2173
Interquartile range (IQR)94

Descriptive statistics

Standard deviation169.93384
Coefficient of variation (CV)0.012116992
Kurtosis132.25178
Mean14024.424
Median Absolute Deviation (MAD)31.5
Skewness-10.809655
Sum2412201
Variance28877.509
MonotonicityNot monotonic
2023-12-12T23:02:31.120815image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
14086 16
 
9.1%
14057 15
 
8.6%
14059 14
 
8.0%
14056 8
 
4.6%
14092 8
 
4.6%
13992 6
 
3.4%
13947 6
 
3.4%
13948 6
 
3.4%
14033 5
 
2.9%
13946 5
 
2.9%
Other values (48) 83
47.4%
ValueCountFrequency (%)
11947 1
 
0.6%
13902 2
 
1.1%
13911 1
 
0.6%
13931 3
1.7%
13936 1
 
0.6%
13937 4
2.3%
13938 1
 
0.6%
13939 2
 
1.1%
13944 1
 
0.6%
13946 5
2.9%
ValueCountFrequency (%)
14120 1
 
0.6%
14119 5
2.9%
14118 4
2.3%
14117 1
 
0.6%
14114 1
 
0.6%
14109 1
 
0.6%
14098 2
 
1.1%
14094 1
 
0.6%
14093 2
 
1.1%
14092 8
4.6%
Distinct170
Distinct (%)97.1%
Missing0
Missing (%)0.0%
Memory size1.5 KiB
2023-12-12T23:02:31.508615image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length60
Median length49
Mean length35.937143
Min length24

Characters and Unicode

Total characters6289
Distinct characters173
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique165 ?
Unique (%)94.3%

Sample

1st row경기도 안양시 만안구 양화로 56, 1층 (안양동)
2nd row경기도 안양시 만안구 안양로314번길 27 (안양동)
3rd row경기도 안양시 만안구 만안로 199 (안양동)
4th row경기도 안양시 만안구 만안로 189 (안양동)
5th row경기도 안양시 만안구 석수로 492 (석수동)
ValueCountFrequency (%)
경기도 175
 
13.8%
안양시 175
 
13.8%
동안구 97
 
7.7%
만안구 78
 
6.2%
안양동 53
 
4.2%
관양동 42
 
3.3%
호계동 17
 
1.3%
관악대로 15
 
1.2%
안양로 12
 
0.9%
전파로 11
 
0.9%
Other values (333) 591
46.7%
2023-12-12T23:02:32.001497image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1172
18.6%
480
 
7.6%
347
 
5.5%
289
 
4.6%
1 222
 
3.5%
186
 
3.0%
, 180
 
2.9%
178
 
2.8%
177
 
2.8%
177
 
2.8%
Other values (163) 2881
45.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3533
56.2%
Space Separator 1172
 
18.6%
Decimal Number 956
 
15.2%
Other Punctuation 180
 
2.9%
Close Punctuation 176
 
2.8%
Open Punctuation 176
 
2.8%
Uppercase Letter 46
 
0.7%
Dash Punctuation 24
 
0.4%
Lowercase Letter 24
 
0.4%
Math Symbol 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
480
 
13.6%
347
 
9.8%
289
 
8.2%
186
 
5.3%
178
 
5.0%
177
 
5.0%
177
 
5.0%
175
 
5.0%
175
 
5.0%
109
 
3.1%
Other values (133) 1240
35.1%
Decimal Number
ValueCountFrequency (%)
1 222
23.2%
2 164
17.2%
3 118
12.3%
0 111
11.6%
4 97
10.1%
5 55
 
5.8%
6 53
 
5.5%
7 50
 
5.2%
8 46
 
4.8%
9 40
 
4.2%
Uppercase Letter
ValueCountFrequency (%)
B 15
32.6%
I 7
15.2%
S 5
 
10.9%
V 5
 
10.9%
K 4
 
8.7%
T 3
 
6.5%
Z 3
 
6.5%
C 2
 
4.3%
A 2
 
4.3%
Lowercase Letter
ValueCountFrequency (%)
e 8
33.3%
c 4
16.7%
n 4
16.7%
t 4
16.7%
r 4
16.7%
Space Separator
ValueCountFrequency (%)
1172
100.0%
Other Punctuation
ValueCountFrequency (%)
, 180
100.0%
Close Punctuation
ValueCountFrequency (%)
) 176
100.0%
Open Punctuation
ValueCountFrequency (%)
( 176
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 24
100.0%
Math Symbol
ValueCountFrequency (%)
~ 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3533
56.2%
Common 2686
42.7%
Latin 70
 
1.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
480
 
13.6%
347
 
9.8%
289
 
8.2%
186
 
5.3%
178
 
5.0%
177
 
5.0%
177
 
5.0%
175
 
5.0%
175
 
5.0%
109
 
3.1%
Other values (133) 1240
35.1%
Common
ValueCountFrequency (%)
1172
43.6%
1 222
 
8.3%
, 180
 
6.7%
) 176
 
6.6%
( 176
 
6.6%
2 164
 
6.1%
3 118
 
4.4%
0 111
 
4.1%
4 97
 
3.6%
5 55
 
2.0%
Other values (6) 215
 
8.0%
Latin
ValueCountFrequency (%)
B 15
21.4%
e 8
11.4%
I 7
10.0%
S 5
 
7.1%
V 5
 
7.1%
K 4
 
5.7%
c 4
 
5.7%
n 4
 
5.7%
t 4
 
5.7%
r 4
 
5.7%
Other values (4) 10
14.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3533
56.2%
ASCII 2756
43.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1172
42.5%
1 222
 
8.1%
, 180
 
6.5%
) 176
 
6.4%
( 176
 
6.4%
2 164
 
6.0%
3 118
 
4.3%
0 111
 
4.0%
4 97
 
3.5%
5 55
 
2.0%
Other values (20) 285
 
10.3%
Hangul
ValueCountFrequency (%)
480
 
13.6%
347
 
9.8%
289
 
8.2%
186
 
5.3%
178
 
5.0%
177
 
5.0%
177
 
5.0%
175
 
5.0%
175
 
5.0%
109
 
3.1%
Other values (133) 1240
35.1%

전화번호
Text

MISSING 

Distinct125
Distinct (%)95.4%
Missing44
Missing (%)25.1%
Memory size1.5 KiB
2023-12-12T23:02:32.261601image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length11.992366
Min length11

Characters and Unicode

Total characters1571
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique119 ?
Unique (%)90.8%

Sample

1st row031-449-2330
2nd row031-449-3841
3rd row031-449-5023
4th row031-449-3849
5th row031-471-1251
ValueCountFrequency (%)
031-463-3780 2
 
1.5%
031-423-9004 2
 
1.5%
031-445-8824 2
 
1.5%
031-442-0470 2
 
1.5%
031-478-4646 2
 
1.5%
031-455-0001 2
 
1.5%
031-420-4780 1
 
0.8%
031-426-7887 1
 
0.8%
031-449-2330 1
 
0.8%
031-421-8418 1
 
0.8%
Other values (115) 115
87.8%
2023-12-12T23:02:32.634620image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 262
16.7%
0 218
13.9%
4 209
13.3%
3 203
12.9%
1 194
12.3%
2 103
 
6.6%
6 92
 
5.9%
7 79
 
5.0%
5 79
 
5.0%
8 74
 
4.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1309
83.3%
Dash Punctuation 262
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 218
16.7%
4 209
16.0%
3 203
15.5%
1 194
14.8%
2 103
7.9%
6 92
7.0%
7 79
 
6.0%
5 79
 
6.0%
8 74
 
5.7%
9 58
 
4.4%
Dash Punctuation
ValueCountFrequency (%)
- 262
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1571
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 262
16.7%
0 218
13.9%
4 209
13.3%
3 203
12.9%
1 194
12.3%
2 103
 
6.6%
6 92
 
5.9%
7 79
 
5.0%
5 79
 
5.0%
8 74
 
4.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1571
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 262
16.7%
0 218
13.9%
4 209
13.3%
3 203
12.9%
1 194
12.3%
2 103
 
6.6%
6 92
 
5.9%
7 79
 
5.0%
5 79
 
5.0%
8 74
 
4.7%

Interactions

2023-12-12T23:02:29.725652image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Missing values

2023-12-12T23:02:29.863059image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T23:02:29.961468image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-12T23:02:30.085068image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

상호명우편번호소재지(도로명)전화번호
0부흥인쇄사14007경기도 안양시 만안구 양화로 56, 1층 (안양동)031-449-2330
1국제문화사13992경기도 안양시 만안구 안양로314번길 27 (안양동)031-449-3841
2문화인쇄소13992경기도 안양시 만안구 만안로 199 (안양동)031-449-5023
3유신인쇄소13902경기도 안양시 만안구 만안로 189 (안양동)031-449-3849
4태양당인쇄13902경기도 안양시 만안구 석수로 492 (석수동)031-471-1251
5대신사13998경기도 안양시 만안구 삼덕로 60 (안양동,1층)031-443-0199
6영기획인쇄13992경기도 안양시 만안구 만안로 203 (안양동)031-449-0623
7진원인쇄14034경기도 안양시 만안구 안양로 150 (안양동)031-443-9426
8민기획인쇄소<NA>경기도 안양시 만안구 태평로8번길 8 (안양동)031-448-2136
9아침기획13968경기도 안양시 만안구 석천로159번길 44 (석수동,201호)031-472-6366
상호명우편번호소재지(도로명)전화번호
165주식회사 디자인다솜14057경기도 안양시 동안구 벌말로 126, 평촌 오비즈타워 제지1층 제비114호 (관양동)<NA>
166주식회사 엘큐브14117경기도 안양시 동안구 엘에스로 92, 국제유통단지 30동 303호 (호계동)<NA>
167(주)가현기획14057경기도 안양시 동안구 벌말로 126, 평촌 오비즈타워 1001호 (관양동)<NA>
168나모세종14057경기도 안양시 동안구 벌말로 140, 동일테크노타운7차 7207호 (관양동)031-360-7799
169(주)파스텔북14059경기도 안양시 동안구 흥안대로427번길 38, 인덕원성지스타위드 511호 (관양동)070-8181-2891
170제이투프린팅14059경기도 안양시 동안구 흥안대로427번길 16, 평촌디지털엠파이어 B114호 (관양동)<NA>
171청년기획14058경기도 안양시 동안구 흥안대로 457-27, 에이스하이테크시티평촌 405호 (관양동)<NA>
172주식회사 오뚜기프렌즈14060경기도 안양시 동안구 흥안대로 405 (주)오뚜기 안양공장 기획생산동 1층 (평촌동)031-421-8201
173(주)이문기업14059경기도 안양시 동안구 흥안대로 415, 두산벤처다임 서관 205호 (평촌동)<NA>
174디프린팅14118경기도 안양시 동안구 엘에스로 142, 호계 금정역 SK V1 center 1037-210C호 (호계동)031-477-3511