Overview

Dataset statistics

Number of variables7
Number of observations201
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory11.3 KiB
Average record size in memory57.6 B

Variable types

Numeric1
Text2
Categorical3
DateTime1

Dataset

Description양주시 출판소, 인쇄소 현황에 관련된 데이터로 사업체명칭,사업체소재지(도로명),업종 등을 포함하고 있습니다.
Author경기도 양주시
URLhttps://www.data.go.kr/data/3079983/fileData.do

Alerts

관리기관명 has constant value ""Constant
관리기관 전화번호 has constant value ""Constant
데이터기준일자 has constant value ""Constant
연번 is highly overall correlated with 업종High correlation
업종 is highly overall correlated with 연번High correlation
연번 has unique valuesUnique

Reproduction

Analysis started2024-03-14 12:08:30.604115
Analysis finished2024-03-14 12:08:32.246664
Duration1.64 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct201
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean101
Minimum1
Maximum201
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.9 KiB
2024-03-14T21:08:32.461254image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile11
Q151
median101
Q3151
95-th percentile191
Maximum201
Range200
Interquartile range (IQR)100

Descriptive statistics

Standard deviation58.167861
Coefficient of variation (CV)0.57591941
Kurtosis-1.2
Mean101
Median Absolute Deviation (MAD)50
Skewness0
Sum20301
Variance3383.5
MonotonicityStrictly increasing
2024-03-14T21:08:32.826942image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.5%
139 1
 
0.5%
129 1
 
0.5%
130 1
 
0.5%
131 1
 
0.5%
132 1
 
0.5%
133 1
 
0.5%
134 1
 
0.5%
135 1
 
0.5%
136 1
 
0.5%
Other values (191) 191
95.0%
ValueCountFrequency (%)
1 1
0.5%
2 1
0.5%
3 1
0.5%
4 1
0.5%
5 1
0.5%
6 1
0.5%
7 1
0.5%
8 1
0.5%
9 1
0.5%
10 1
0.5%
ValueCountFrequency (%)
201 1
0.5%
200 1
0.5%
199 1
0.5%
198 1
0.5%
197 1
0.5%
196 1
0.5%
195 1
0.5%
194 1
0.5%
193 1
0.5%
192 1
0.5%
Distinct190
Distinct (%)94.5%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
2024-03-14T21:08:34.076934image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length25
Median length17
Mean length6.2288557
Min length2

Characters and Unicode

Total characters1252
Distinct characters324
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique179 ?
Unique (%)89.1%

Sample

1st row예일
2nd row나한
3rd row홍익가족
4th row아소비출판사
5th row(주)에듀비타
ValueCountFrequency (%)
도서출판 13
 
4.7%
주식회사 10
 
3.6%
연구소 3
 
1.1%
3
 
1.1%
주)우진디피피 2
 
0.7%
fne 2
 
0.7%
원어성서원 2
 
0.7%
아드란 2
 
0.7%
인터니즈 2
 
0.7%
주)미래융합전략연구소 2
 
0.7%
Other values (228) 234
85.1%
2024-03-14T21:08:35.533709image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
74
 
5.9%
36
 
2.9%
35
 
2.8%
30
 
2.4%
25
 
2.0%
25
 
2.0%
( 21
 
1.7%
) 21
 
1.7%
21
 
1.7%
20
 
1.6%
Other values (314) 944
75.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1000
79.9%
Space Separator 74
 
5.9%
Lowercase Letter 62
 
5.0%
Uppercase Letter 60
 
4.8%
Open Punctuation 21
 
1.7%
Close Punctuation 21
 
1.7%
Decimal Number 12
 
1.0%
Other Punctuation 2
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
36
 
3.6%
35
 
3.5%
30
 
3.0%
25
 
2.5%
25
 
2.5%
21
 
2.1%
20
 
2.0%
19
 
1.9%
16
 
1.6%
15
 
1.5%
Other values (264) 758
75.8%
Uppercase Letter
ValueCountFrequency (%)
E 7
11.7%
C 6
 
10.0%
S 6
 
10.0%
B 5
 
8.3%
N 4
 
6.7%
A 4
 
6.7%
F 3
 
5.0%
T 3
 
5.0%
M 3
 
5.0%
K 3
 
5.0%
Other values (12) 16
26.7%
Lowercase Letter
ValueCountFrequency (%)
n 9
14.5%
o 7
11.3%
a 7
11.3%
u 6
9.7%
i 5
8.1%
d 5
8.1%
m 4
 
6.5%
c 3
 
4.8%
e 2
 
3.2%
p 2
 
3.2%
Other values (8) 12
19.4%
Decimal Number
ValueCountFrequency (%)
1 5
41.7%
2 3
25.0%
0 2
 
16.7%
3 1
 
8.3%
6 1
 
8.3%
Other Punctuation
ValueCountFrequency (%)
/ 1
50.0%
& 1
50.0%
Space Separator
ValueCountFrequency (%)
74
100.0%
Open Punctuation
ValueCountFrequency (%)
( 21
100.0%
Close Punctuation
ValueCountFrequency (%)
) 21
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1000
79.9%
Common 130
 
10.4%
Latin 122
 
9.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
36
 
3.6%
35
 
3.5%
30
 
3.0%
25
 
2.5%
25
 
2.5%
21
 
2.1%
20
 
2.0%
19
 
1.9%
16
 
1.6%
15
 
1.5%
Other values (264) 758
75.8%
Latin
ValueCountFrequency (%)
n 9
 
7.4%
E 7
 
5.7%
o 7
 
5.7%
a 7
 
5.7%
u 6
 
4.9%
C 6
 
4.9%
S 6
 
4.9%
i 5
 
4.1%
B 5
 
4.1%
d 5
 
4.1%
Other values (30) 59
48.4%
Common
ValueCountFrequency (%)
74
56.9%
( 21
 
16.2%
) 21
 
16.2%
1 5
 
3.8%
2 3
 
2.3%
0 2
 
1.5%
/ 1
 
0.8%
3 1
 
0.8%
6 1
 
0.8%
& 1
 
0.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1000
79.9%
ASCII 252
 
20.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
74
29.4%
( 21
 
8.3%
) 21
 
8.3%
n 9
 
3.6%
E 7
 
2.8%
o 7
 
2.8%
a 7
 
2.8%
u 6
 
2.4%
C 6
 
2.4%
S 6
 
2.4%
Other values (40) 88
34.9%
Hangul
ValueCountFrequency (%)
36
 
3.6%
35
 
3.5%
30
 
3.0%
25
 
2.5%
25
 
2.5%
21
 
2.1%
20
 
2.0%
19
 
1.9%
16
 
1.6%
15
 
1.5%
Other values (264) 758
75.8%
Distinct191
Distinct (%)95.0%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
2024-03-14T21:08:36.575260image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length58
Median length46
Mean length33.60199
Min length16

Characters and Unicode

Total characters6754
Distinct characters216
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique181 ?
Unique (%)90.0%

Sample

1st row경기도 양주시 만송동 356-3
2nd row경기도 양주시 장흥면 석굴암길 519
3rd row경기도 양주시 장흥면 호국로 176-16
4th row경기도 양주시 광사동 240-3
5th row경기도 양주시 장흥면 가마골로147번길 53
ValueCountFrequency (%)
경기도 201
 
14.9%
양주시 201
 
14.9%
옥정동 38
 
2.8%
장흥면 27
 
2.0%
부흥로 24
 
1.8%
백석읍 22
 
1.6%
광적면 20
 
1.5%
광사동 17
 
1.3%
삼숭동 11
 
0.8%
덕정동 11
 
0.8%
Other values (459) 779
57.7%
2024-03-14T21:08:37.780092image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1163
 
17.2%
1 342
 
5.1%
230
 
3.4%
230
 
3.4%
0 224
 
3.3%
217
 
3.2%
216
 
3.2%
204
 
3.0%
201
 
3.0%
201
 
3.0%
Other values (206) 3526
52.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3550
52.6%
Decimal Number 1503
22.3%
Space Separator 1163
 
17.2%
Other Punctuation 183
 
2.7%
Open Punctuation 136
 
2.0%
Close Punctuation 136
 
2.0%
Dash Punctuation 71
 
1.1%
Uppercase Letter 8
 
0.1%
Letter Number 3
 
< 0.1%
Lowercase Letter 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
230
 
6.5%
230
 
6.5%
217
 
6.1%
216
 
6.1%
204
 
5.7%
201
 
5.7%
201
 
5.7%
186
 
5.2%
112
 
3.2%
100
 
2.8%
Other values (183) 1653
46.6%
Decimal Number
ValueCountFrequency (%)
1 342
22.8%
0 224
14.9%
2 193
12.8%
3 165
11.0%
5 118
 
7.9%
4 112
 
7.5%
8 95
 
6.3%
7 91
 
6.1%
9 85
 
5.7%
6 78
 
5.2%
Uppercase Letter
ValueCountFrequency (%)
A 3
37.5%
S 2
25.0%
T 1
 
12.5%
P 1
 
12.5%
G 1
 
12.5%
Other Punctuation
ValueCountFrequency (%)
, 182
99.5%
@ 1
 
0.5%
Space Separator
ValueCountFrequency (%)
1163
100.0%
Open Punctuation
ValueCountFrequency (%)
( 136
100.0%
Close Punctuation
ValueCountFrequency (%)
) 136
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 71
100.0%
Letter Number
ValueCountFrequency (%)
3
100.0%
Lowercase Letter
ValueCountFrequency (%)
e 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3550
52.6%
Common 3192
47.3%
Latin 12
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
230
 
6.5%
230
 
6.5%
217
 
6.1%
216
 
6.1%
204
 
5.7%
201
 
5.7%
201
 
5.7%
186
 
5.2%
112
 
3.2%
100
 
2.8%
Other values (183) 1653
46.6%
Common
ValueCountFrequency (%)
1163
36.4%
1 342
 
10.7%
0 224
 
7.0%
2 193
 
6.0%
, 182
 
5.7%
3 165
 
5.2%
( 136
 
4.3%
) 136
 
4.3%
5 118
 
3.7%
4 112
 
3.5%
Other values (6) 421
 
13.2%
Latin
ValueCountFrequency (%)
A 3
25.0%
3
25.0%
S 2
16.7%
T 1
 
8.3%
P 1
 
8.3%
G 1
 
8.3%
e 1
 
8.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3550
52.6%
ASCII 3201
47.4%
Number Forms 3
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1163
36.3%
1 342
 
10.7%
0 224
 
7.0%
2 193
 
6.0%
, 182
 
5.7%
3 165
 
5.2%
( 136
 
4.2%
) 136
 
4.2%
5 118
 
3.7%
4 112
 
3.5%
Other values (12) 430
 
13.4%
Hangul
ValueCountFrequency (%)
230
 
6.5%
230
 
6.5%
217
 
6.1%
216
 
6.1%
204
 
5.7%
201
 
5.7%
201
 
5.7%
186
 
5.2%
112
 
3.2%
100
 
2.8%
Other values (183) 1653
46.6%
Number Forms
ValueCountFrequency (%)
3
100.0%

업종
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
출판사
164 
인쇄사
37 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row출판사
2nd row출판사
3rd row출판사
4th row출판사
5th row출판사

Common Values

ValueCountFrequency (%)
출판사 164
81.6%
인쇄사 37
 
18.4%

Length

2024-03-14T21:08:37.999010image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-14T21:08:38.171231image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
출판사 164
81.6%
인쇄사 37
 
18.4%

관리기관명
Categorical

CONSTANT 

Distinct1
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
양주시 문화관광과
201 

Length

Max length9
Median length9
Mean length9
Min length9

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row양주시 문화관광과
2nd row양주시 문화관광과
3rd row양주시 문화관광과
4th row양주시 문화관광과
5th row양주시 문화관광과

Common Values

ValueCountFrequency (%)
양주시 문화관광과 201
100.0%

Length

2024-03-14T21:08:38.353854image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-14T21:08:38.518620image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
양주시 201
50.0%
문화관광과 201
50.0%

관리기관 전화번호
Categorical

CONSTANT 

Distinct1
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
031-8082-5654
201 

Length

Max length13
Median length13
Mean length13
Min length13

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row031-8082-5654
2nd row031-8082-5654
3rd row031-8082-5654
4th row031-8082-5654
5th row031-8082-5654

Common Values

ValueCountFrequency (%)
031-8082-5654 201
100.0%

Length

2024-03-14T21:08:38.842511image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-14T21:08:39.148951image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
031-8082-5654 201
100.0%

데이터기준일자
Date

CONSTANT 

Distinct1
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
Minimum2023-12-29 00:00:00
Maximum2023-12-29 00:00:00
2024-03-14T21:08:39.420683image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T21:08:39.717453image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Interactions

2024-03-14T21:08:31.131416image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-14T21:08:39.924284image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번업종
연번1.0000.997
업종0.9971.000
2024-03-14T21:08:40.148152image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번업종
연번1.0000.935
업종0.9351.000

Missing values

2024-03-14T21:08:31.704282image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-14T21:08:32.095254image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번사업체명칭사업체소재지(도로명)업종관리기관명관리기관 전화번호데이터기준일자
01예일경기도 양주시 만송동 356-3출판사양주시 문화관광과031-8082-56542023-12-29
12나한경기도 양주시 장흥면 석굴암길 519출판사양주시 문화관광과031-8082-56542023-12-29
23홍익가족경기도 양주시 장흥면 호국로 176-16출판사양주시 문화관광과031-8082-56542023-12-29
34아소비출판사경기도 양주시 광사동 240-3출판사양주시 문화관광과031-8082-56542023-12-29
45(주)에듀비타경기도 양주시 장흥면 가마골로147번길 53출판사양주시 문화관광과031-8082-56542023-12-29
56광문사경기도 양주시 장흥면 권율로309번길 202출판사양주시 문화관광과031-8082-56542023-12-29
67중앙아카데미경기도 양주시 장흥면 가마골로147번길 53출판사양주시 문화관광과031-8082-56542023-12-29
78도서출판 필룩스경기도 양주시 광적면 광적로 235-48출판사양주시 문화관광과031-8082-56542023-12-29
89ABBA Communication경기도 양주시 장흥면 호국로473번길 11-12, 101동 505호 (우리마을APT)출판사양주시 문화관광과031-8082-56542023-12-29
910한국문화사경기도 양주시 옥정동 280-8출판사양주시 문화관광과031-8082-56542023-12-29
연번사업체명칭사업체소재지(도로명)업종관리기관명관리기관 전화번호데이터기준일자
191192재성인쇄경기도 양주시 평화로 1444 (덕계동)인쇄사양주시 문화관광과031-8082-56542023-12-29
192193양주타일 애드스토리경기도 양주시 광적면 부흥로 877인쇄사양주시 문화관광과031-8082-56542023-12-29
193194(주)미래융합전략연구소경기도 양주시 평화로1233번길 38-6 (산북동)인쇄사양주시 문화관광과031-8082-56542023-12-29
194195드림북경기도 양주시 광적면 부흥로 847, 양주테크노시티 422호인쇄사양주시 문화관광과031-8082-56542023-12-29
195196윈프로경기도 양주시 부흥로 1936, 다온프라자 405호 (광사동)인쇄사양주시 문화관광과031-8082-56542023-12-29
196197성광피앤피경기도 양주시 백석읍 권율로1398번길 165-15인쇄사양주시 문화관광과031-8082-56542023-12-29
197198배다201경기도 양주시 부흥로 1932, 5층 201호 (광사동)인쇄사양주시 문화관광과031-8082-56542023-12-29
198199체리슈경기도 양주시 부흥로 1533, 별동 3층 창업사무실(양주시청년센터) (남방동)인쇄사양주시 문화관광과031-8082-56542023-12-29
199200해피라인경기도 양주시 평화로1416번길 58-1, 2층 (덕계동)인쇄사양주시 문화관광과031-8082-56542023-12-29
200201주식회사 인터니즈경기도 양주시 화합로1710번길 12, 양주옥정 듀클래스Ⅰ 258호 (옥정동)인쇄사양주시 문화관광과031-8082-56542023-12-29