Overview

Dataset statistics

Number of variables4
Number of observations215
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory7.1 KiB
Average record size in memory33.6 B

Variable types

Numeric1
Text1
Categorical2

Dataset

Description인천광역시 계양구 관내 출판 인쇄업 현황에 대한 데이터로, 연번, 사업체 명칭, 사업체 소재지, 업종 등을 제공합니다.
Author인천광역시 계양구
URLhttps://www.data.go.kr/data/15038925/fileData.do

Alerts

연번 is highly overall correlated with 업종High correlation
업종 is highly overall correlated with 연번High correlation
업종 is highly imbalanced (58.5%)Imbalance
연번 has unique valuesUnique

Reproduction

Analysis started2024-03-16 04:21:10.442094
Analysis finished2024-03-16 04:21:11.026923
Duration0.58 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct215
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean108
Minimum1
Maximum215
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.0 KiB
2024-03-16T13:21:11.137576image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile11.7
Q154.5
median108
Q3161.5
95-th percentile204.3
Maximum215
Range214
Interquartile range (IQR)107

Descriptive statistics

Standard deviation62.209324
Coefficient of variation (CV)0.57601226
Kurtosis-1.2
Mean108
Median Absolute Deviation (MAD)54
Skewness0
Sum23220
Variance3870
MonotonicityStrictly increasing
2024-03-16T13:21:11.302847image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.5%
149 1
 
0.5%
138 1
 
0.5%
139 1
 
0.5%
140 1
 
0.5%
141 1
 
0.5%
142 1
 
0.5%
143 1
 
0.5%
144 1
 
0.5%
145 1
 
0.5%
Other values (205) 205
95.3%
ValueCountFrequency (%)
1 1
0.5%
2 1
0.5%
3 1
0.5%
4 1
0.5%
5 1
0.5%
6 1
0.5%
7 1
0.5%
8 1
0.5%
9 1
0.5%
10 1
0.5%
ValueCountFrequency (%)
215 1
0.5%
214 1
0.5%
213 1
0.5%
212 1
0.5%
211 1
0.5%
210 1
0.5%
209 1
0.5%
208 1
0.5%
207 1
0.5%
206 1
0.5%
Distinct209
Distinct (%)97.2%
Missing0
Missing (%)0.0%
Memory size1.8 KiB
2024-03-16T13:21:11.609679image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length25
Median length16
Mean length7.0139535
Min length2

Characters and Unicode

Total characters1508
Distinct characters359
Distinct categories10 ?
Distinct scripts4 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique203 ?
Unique (%)94.4%

Sample

1st row도서출판 혜인
2nd row경인여자대학교 출판부
3rd row땅위에세운집
4th row형문출판사
5th row해경고시사
ValueCountFrequency (%)
도서출판 24
 
7.6%
출판사 8
 
2.5%
주식회사 8
 
2.5%
디자인 4
 
1.3%
북스 3
 
0.9%
주)디자인메이커스 2
 
0.6%
오시드 2
 
0.6%
연구소 2
 
0.6%
세상 2
 
0.6%
도래커뮤니케이션 2
 
0.6%
Other values (254) 260
82.0%
2024-03-16T13:21:12.129761image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
102
 
6.8%
45
 
3.0%
44
 
2.9%
36
 
2.4%
32
 
2.1%
31
 
2.1%
30
 
2.0%
24
 
1.6%
( 23
 
1.5%
) 23
 
1.5%
Other values (349) 1118
74.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1163
77.1%
Space Separator 102
 
6.8%
Uppercase Letter 97
 
6.4%
Lowercase Letter 79
 
5.2%
Open Punctuation 23
 
1.5%
Close Punctuation 23
 
1.5%
Decimal Number 13
 
0.9%
Other Punctuation 5
 
0.3%
Dash Punctuation 2
 
0.1%
Connector Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
45
 
3.9%
44
 
3.8%
36
 
3.1%
32
 
2.8%
31
 
2.7%
30
 
2.6%
24
 
2.1%
22
 
1.9%
21
 
1.8%
19
 
1.6%
Other values (295) 859
73.9%
Uppercase Letter
ValueCountFrequency (%)
S 9
 
9.3%
I 9
 
9.3%
N 7
 
7.2%
R 7
 
7.2%
L 7
 
7.2%
O 6
 
6.2%
U 6
 
6.2%
E 5
 
5.2%
T 5
 
5.2%
A 4
 
4.1%
Other values (13) 32
33.0%
Lowercase Letter
ValueCountFrequency (%)
t 12
15.2%
a 10
12.7%
i 9
11.4%
o 8
10.1%
e 8
10.1%
l 7
8.9%
n 7
8.9%
d 3
 
3.8%
r 3
 
3.8%
s 2
 
2.5%
Other values (9) 10
12.7%
Decimal Number
ValueCountFrequency (%)
0 5
38.5%
1 4
30.8%
3 2
 
15.4%
2 2
 
15.4%
Other Punctuation
ValueCountFrequency (%)
: 2
40.0%
. 2
40.0%
& 1
20.0%
Space Separator
ValueCountFrequency (%)
102
100.0%
Open Punctuation
ValueCountFrequency (%)
( 23
100.0%
Close Punctuation
ValueCountFrequency (%)
) 23
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1154
76.5%
Latin 176
 
11.7%
Common 169
 
11.2%
Han 9
 
0.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
45
 
3.9%
44
 
3.8%
36
 
3.1%
32
 
2.8%
31
 
2.7%
30
 
2.6%
24
 
2.1%
22
 
1.9%
21
 
1.8%
19
 
1.6%
Other values (286) 850
73.7%
Latin
ValueCountFrequency (%)
t 12
 
6.8%
a 10
 
5.7%
S 9
 
5.1%
i 9
 
5.1%
I 9
 
5.1%
o 8
 
4.5%
e 8
 
4.5%
l 7
 
4.0%
N 7
 
4.0%
n 7
 
4.0%
Other values (32) 90
51.1%
Common
ValueCountFrequency (%)
102
60.4%
( 23
 
13.6%
) 23
 
13.6%
0 5
 
3.0%
1 4
 
2.4%
3 2
 
1.2%
2 2
 
1.2%
: 2
 
1.2%
- 2
 
1.2%
. 2
 
1.2%
Other values (2) 2
 
1.2%
Han
ValueCountFrequency (%)
1
11.1%
1
11.1%
1
11.1%
1
11.1%
1
11.1%
1
11.1%
1
11.1%
1
11.1%
1
11.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1154
76.5%
ASCII 345
 
22.9%
CJK 9
 
0.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
102
29.6%
( 23
 
6.7%
) 23
 
6.7%
t 12
 
3.5%
a 10
 
2.9%
S 9
 
2.6%
i 9
 
2.6%
I 9
 
2.6%
o 8
 
2.3%
e 8
 
2.3%
Other values (44) 132
38.3%
Hangul
ValueCountFrequency (%)
45
 
3.9%
44
 
3.8%
36
 
3.1%
32
 
2.8%
31
 
2.7%
30
 
2.6%
24
 
2.1%
22
 
1.9%
21
 
1.8%
19
 
1.6%
Other values (286) 850
73.7%
CJK
ValueCountFrequency (%)
1
11.1%
1
11.1%
1
11.1%
1
11.1%
1
11.1%
1
11.1%
1
11.1%
1
11.1%
1
11.1%
Distinct17
Distinct (%)7.9%
Missing0
Missing (%)0.0%
Memory size1.8 KiB
인천광역시 계양구 계산동
69 
인천광역시 계양구 작전동
42 
인천광역시 계양구 효성동
23 
인천광역시 계양구 임학동
13 
인천광역시 계양구 병방동
12 
Other values (12)
56 

Length

Max length14
Median length13
Mean length13.004651
Min length13

Unique

Unique4 ?
Unique (%)1.9%

Sample

1st row인천광역시 계양구 병방동
2nd row인천광역시 계양구 계산동
3rd row인천광역시 계양구 계산동
4th row인천광역시 계양구 박촌동
5th row인천광역시 계양구 작전동

Common Values

ValueCountFrequency (%)
인천광역시 계양구 계산동 69
32.1%
인천광역시 계양구 작전동 42
19.5%
인천광역시 계양구 효성동 23
 
10.7%
인천광역시 계양구 임학동 13
 
6.0%
인천광역시 계양구 병방동 12
 
5.6%
인천광역시 계양구 동양동 10
 
4.7%
인천광역시 계양구 귤현동 10
 
4.7%
인천광역시 계양구 용종동 9
 
4.2%
인천광역시 계양구 서운동 7
 
3.3%
인천광역시 계양구 박촌동 7
 
3.3%
Other values (7) 13
 
6.0%

Length

2024-03-16T13:21:12.270166image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
인천광역시 215
33.3%
계양구 215
33.3%
계산동 69
 
10.7%
작전동 42
 
6.5%
효성동 23
 
3.6%
임학동 13
 
2.0%
병방동 12
 
1.9%
동양동 10
 
1.6%
귤현동 10
 
1.6%
용종동 9
 
1.4%
Other values (9) 27
 
4.2%

업종
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)0.9%
Missing0
Missing (%)0.0%
Memory size1.8 KiB
출판사
197 
인쇄사
 
18

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row출판사
2nd row출판사
3rd row출판사
4th row출판사
5th row출판사

Common Values

ValueCountFrequency (%)
출판사 197
91.6%
인쇄사 18
 
8.4%

Length

2024-03-16T13:21:12.404812image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-16T13:21:12.533926image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
출판사 197
91.6%
인쇄사 18
 
8.4%

Interactions

2024-03-16T13:21:10.734159image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-16T13:21:12.621246image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번사업체소재지업종
연번1.0000.2650.984
사업체소재지0.2651.0000.201
업종0.9840.2011.000
2024-03-16T13:21:12.745975image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
사업체소재지업종
사업체소재지1.0000.173
업종0.1731.000
2024-03-16T13:21:12.852042image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번사업체소재지업종
연번1.0000.1070.874
사업체소재지0.1071.0000.173
업종0.8740.1731.000

Missing values

2024-03-16T13:21:10.874114image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-16T13:21:10.979684image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번사업체명칭사업체소재지업종
01도서출판 혜인인천광역시 계양구 병방동출판사
12경인여자대학교 출판부인천광역시 계양구 계산동출판사
23땅위에세운집인천광역시 계양구 계산동출판사
34형문출판사인천광역시 계양구 박촌동출판사
45해경고시사인천광역시 계양구 작전동출판사
56예수생명운동인천광역시 계양구 계산동출판사
67e-월드출판사인천광역시 계양구 임학동출판사
78엘리트인천광역시 계양구 계산동출판사
89정보마당인천광역시 계양구 효성동출판사
910번다코리아인천광역시 계양구 임학동출판사
연번사업체명칭사업체소재지업종
205206그린기획인천광역시 계양구 계산동인쇄사
206207드림기획인천광역시 계양구 계산동인쇄사
207208백암실업인천광역시 계양구 상야동인쇄사
208209주식회사 디자인촉 인천인천광역시 계양구 오류동인쇄사
209210디자인아트고광균인천광역시 계양구 병방동인쇄사
210211주식회사 에이치티씨인천광역시 계양구 서운동인쇄사
211212오시드 디자인인천광역시 계양구 작전동인쇄사
212213(주) 디자인 임광건설인천광역시 계양구 병방동인쇄사
213214도서출판 다원인천광역시 계양구 효성동인쇄사
214215(주)디자인메이커스인천광역시 계양구 서운동인쇄사