Overview

Dataset statistics

Number of variables5
Number of observations106
Missing cells41
Missing cells (%)7.7%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory4.4 KiB
Average record size in memory42.2 B

Variable types

Numeric1
Categorical2
Text2

Dataset

Description인천광역시 동구 관내의 출판 및 인쇄업 현황 데이터로, 업종, 업체명, 업체소재지, 전화번호 등 항목을 제공하고 있습니다.
Author인천광역시 동구
URLhttps://www.data.go.kr/data/15115981/fileData.do

Alerts

연번 is highly overall correlated with 업종High correlation
업종 is highly overall correlated with 연번High correlation
전화번호 has 41 (38.7%) missing valuesMissing
연번 has unique valuesUnique

Reproduction

Analysis started2024-03-15 01:11:49.644329
Analysis finished2024-03-15 01:11:50.970164
Duration1.33 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct106
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean53.5
Minimum1
Maximum106
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.1 KiB
2024-03-15T10:11:51.206211image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile6.25
Q127.25
median53.5
Q379.75
95-th percentile100.75
Maximum106
Range105
Interquartile range (IQR)52.5

Descriptive statistics

Standard deviation30.743563
Coefficient of variation (CV)0.57464604
Kurtosis-1.2
Mean53.5
Median Absolute Deviation (MAD)26.5
Skewness0
Sum5671
Variance945.16667
MonotonicityStrictly increasing
2024-03-15T10:11:51.667688image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.9%
81 1
 
0.9%
79 1
 
0.9%
78 1
 
0.9%
77 1
 
0.9%
76 1
 
0.9%
75 1
 
0.9%
74 1
 
0.9%
73 1
 
0.9%
72 1
 
0.9%
Other values (96) 96
90.6%
ValueCountFrequency (%)
1 1
0.9%
2 1
0.9%
3 1
0.9%
4 1
0.9%
5 1
0.9%
6 1
0.9%
7 1
0.9%
8 1
0.9%
9 1
0.9%
10 1
0.9%
ValueCountFrequency (%)
106 1
0.9%
105 1
0.9%
104 1
0.9%
103 1
0.9%
102 1
0.9%
101 1
0.9%
100 1
0.9%
99 1
0.9%
98 1
0.9%
97 1
0.9%

업종
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)1.9%
Missing0
Missing (%)0.0%
Memory size976.0 B
출판사
70 
인쇄사
36 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row출판사
2nd row출판사
3rd row출판사
4th row출판사
5th row출판사

Common Values

ValueCountFrequency (%)
출판사 70
66.0%
인쇄사 36
34.0%

Length

2024-03-15T10:11:52.093096image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-15T10:11:52.319995image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
출판사 70
66.0%
인쇄사 36
34.0%
Distinct92
Distinct (%)86.8%
Missing0
Missing (%)0.0%
Memory size976.0 B
2024-03-15T10:11:53.208246image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length21
Median length16.5
Mean length6.1132075
Min length1

Characters and Unicode

Total characters648
Distinct characters229
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique78 ?
Unique (%)73.6%

Sample

1st row환웅문화사
2nd row은정문화사
3rd row도서출판 KIM
4th row삼정
5th row해반
ValueCountFrequency (%)
도서출판 8
 
5.8%
신성사 2
 
1.5%
2
 
1.5%
단비기획 2
 
1.5%
햇빛과 2
 
1.5%
주)글소리 2
 
1.5%
베리즈 2
 
1.5%
야베스 2
 
1.5%
유림 2
 
1.5%
코퍼레이션 2
 
1.5%
Other values (103) 111
81.0%
2024-03-15T10:11:54.334753image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
32
 
4.9%
25
 
3.9%
16
 
2.5%
( 13
 
2.0%
) 13
 
2.0%
12
 
1.9%
11
 
1.7%
11
 
1.7%
11
 
1.7%
11
 
1.7%
Other values (219) 493
76.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 526
81.2%
Lowercase Letter 44
 
6.8%
Space Separator 32
 
4.9%
Uppercase Letter 19
 
2.9%
Open Punctuation 13
 
2.0%
Close Punctuation 13
 
2.0%
Other Punctuation 1
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
25
 
4.8%
16
 
3.0%
12
 
2.3%
11
 
2.1%
11
 
2.1%
11
 
2.1%
11
 
2.1%
11
 
2.1%
10
 
1.9%
10
 
1.9%
Other values (186) 398
75.7%
Lowercase Letter
ValueCountFrequency (%)
a 7
15.9%
e 6
13.6%
o 5
11.4%
r 4
9.1%
h 4
9.1%
n 3
6.8%
d 3
6.8%
m 2
 
4.5%
t 2
 
4.5%
g 1
 
2.3%
Other values (7) 7
15.9%
Uppercase Letter
ValueCountFrequency (%)
T 3
15.8%
H 2
10.5%
S 2
10.5%
I 2
10.5%
K 2
10.5%
W 2
10.5%
B 1
 
5.3%
G 1
 
5.3%
M 1
 
5.3%
R 1
 
5.3%
Other values (2) 2
10.5%
Space Separator
ValueCountFrequency (%)
32
100.0%
Open Punctuation
ValueCountFrequency (%)
( 13
100.0%
Close Punctuation
ValueCountFrequency (%)
) 13
100.0%
Other Punctuation
ValueCountFrequency (%)
& 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 526
81.2%
Latin 63
 
9.7%
Common 59
 
9.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
25
 
4.8%
16
 
3.0%
12
 
2.3%
11
 
2.1%
11
 
2.1%
11
 
2.1%
11
 
2.1%
11
 
2.1%
10
 
1.9%
10
 
1.9%
Other values (186) 398
75.7%
Latin
ValueCountFrequency (%)
a 7
 
11.1%
e 6
 
9.5%
o 5
 
7.9%
r 4
 
6.3%
h 4
 
6.3%
T 3
 
4.8%
n 3
 
4.8%
d 3
 
4.8%
m 2
 
3.2%
H 2
 
3.2%
Other values (19) 24
38.1%
Common
ValueCountFrequency (%)
32
54.2%
( 13
22.0%
) 13
22.0%
& 1
 
1.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 526
81.2%
ASCII 122
 
18.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
32
26.2%
( 13
10.7%
) 13
10.7%
a 7
 
5.7%
e 6
 
4.9%
o 5
 
4.1%
r 4
 
3.3%
h 4
 
3.3%
T 3
 
2.5%
n 3
 
2.5%
Other values (23) 32
26.2%
Hangul
ValueCountFrequency (%)
25
 
4.8%
16
 
3.0%
12
 
2.3%
11
 
2.1%
11
 
2.1%
11
 
2.1%
11
 
2.1%
11
 
2.1%
10
 
1.9%
10
 
1.9%
Other values (186) 398
75.7%
Distinct7
Distinct (%)6.6%
Missing0
Missing (%)0.0%
Memory size976.0 B
인천광역시 동구 송림동
42 
인천광역시 동구 송현동
27 
인천광역시 동구 금곡동
18 
인천광역시 동구 화수동
인천광역시 동구 창영동
Other values (2)

Length

Max length12
Median length12
Mean length12
Min length12

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row인천광역시 동구 화수동
2nd row인천광역시 동구 송현동
3rd row인천광역시 동구 송림동
4th row인천광역시 동구 금곡동
5th row인천광역시 동구 송림동

Common Values

ValueCountFrequency (%)
인천광역시 동구 송림동 42
39.6%
인천광역시 동구 송현동 27
25.5%
인천광역시 동구 금곡동 18
17.0%
인천광역시 동구 화수동 7
 
6.6%
인천광역시 동구 창영동 6
 
5.7%
인천광역시 동구 화평동 4
 
3.8%
인천광역시 동구 만석동 2
 
1.9%

Length

2024-03-15T10:11:54.627899image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-15T10:11:54.912946image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
인천광역시 106
33.3%
동구 106
33.3%
송림동 42
 
13.2%
송현동 27
 
8.5%
금곡동 18
 
5.7%
화수동 7
 
2.2%
창영동 6
 
1.9%
화평동 4
 
1.3%
만석동 2
 
0.6%

전화번호
Text

MISSING 

Distinct54
Distinct (%)83.1%
Missing41
Missing (%)38.7%
Memory size976.0 B
2024-03-15T10:11:55.731195image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.015385
Min length9

Characters and Unicode

Total characters781
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique43 ?
Unique (%)66.2%

Sample

1st row032-761-3496
2nd row032-762-9318
3rd row032-588-0170
4th row032-773-3402
5th row032-765-7984
ValueCountFrequency (%)
032-764-1028 2
 
3.1%
032-883-5858 2
 
3.1%
032-773-5823 2
 
3.1%
032-207-9310 2
 
3.1%
032-773-1335 2
 
3.1%
032-773-3711 2
 
3.1%
032-508-2912 2
 
3.1%
032-881-4380 2
 
3.1%
032-885-4252 2
 
3.1%
032-773-3402 2
 
3.1%
Other values (44) 45
69.2%
2024-03-15T10:11:57.178388image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 129
16.5%
3 106
13.6%
0 98
12.5%
2 96
12.3%
7 79
10.1%
8 77
9.9%
5 47
 
6.0%
1 45
 
5.8%
6 36
 
4.6%
9 36
 
4.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 652
83.5%
Dash Punctuation 129
 
16.5%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
3 106
16.3%
0 98
15.0%
2 96
14.7%
7 79
12.1%
8 77
11.8%
5 47
7.2%
1 45
6.9%
6 36
 
5.5%
9 36
 
5.5%
4 32
 
4.9%
Dash Punctuation
ValueCountFrequency (%)
- 129
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 781
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 129
16.5%
3 106
13.6%
0 98
12.5%
2 96
12.3%
7 79
10.1%
8 77
9.9%
5 47
 
6.0%
1 45
 
5.8%
6 36
 
4.6%
9 36
 
4.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 781
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 129
16.5%
3 106
13.6%
0 98
12.5%
2 96
12.3%
7 79
10.1%
8 77
9.9%
5 47
 
6.0%
1 45
 
5.8%
6 36
 
4.6%
9 36
 
4.6%

Interactions

2024-03-15T10:11:50.051211image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-15T10:11:57.394319image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번업종업체명업체소재지(동)전화번호
연번1.0000.9960.0000.1710.000
업종0.9961.0000.0000.2540.000
업체명0.0000.0001.0001.0001.000
업체소재지(동)0.1710.2541.0001.0001.000
전화번호0.0000.0001.0001.0001.000
2024-03-15T10:11:57.669137image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업종업체소재지(동)
업종1.0000.264
업체소재지(동)0.2641.000
2024-03-15T10:11:57.910235image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번업종업체소재지(동)
연번1.0000.9060.091
업종0.9061.0000.264
업체소재지(동)0.0910.2641.000

Missing values

2024-03-15T10:11:50.318927image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-15T10:11:50.847911image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번업종업체명업체소재지(동)전화번호
01출판사환웅문화사인천광역시 동구 화수동032-761-3496
12출판사은정문화사인천광역시 동구 송현동032-762-9318
23출판사도서출판 KIM인천광역시 동구 송림동032-588-0170
34출판사삼정인천광역시 동구 금곡동032-773-3402
45출판사해반인천광역시 동구 송림동032-765-7984
56출판사씨엠하우스인천광역시 동구 금곡동032-772-9594
67출판사WIT컨설팅인천광역시 동구 송림동032-589-0569
78출판사음악사랑인천광역시 동구 송현동<NA>
89출판사Good Way인천광역시 동구 송현동032-588-1811
910출판사이레디자인(주)인천광역시 동구 화평동032-764-1028
연번업종업체명업체소재지(동)전화번호
9697인쇄사인천광역시 동구 송현동<NA>
9798인쇄사(주)혜성디자인 동구지점인천광역시 동구 송림동032-508-2912
9899인쇄사(주)글소리인천광역시 동구 송현동032-883-5858
99100인쇄사서해디자인인천광역시 동구 화수동032-773-5823
100101인쇄사베리즈 코퍼레이션인천광역시 동구 송현동<NA>
101102인쇄사한결인천광역시 동구 송림동<NA>
102103인쇄사경인문화교육콘텐츠 사회적협동조합인천광역시 동구 송현동<NA>
103104인쇄사라우드디자인인천광역시 동구 송현동<NA>
104105인쇄사주식회사 스쿨디인천광역시 동구 송현동<NA>
105106인쇄사대한DH인천광역시 동구 송현동032-764-0107