Overview

Dataset statistics

Number of variables8
Number of observations213
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory13.9 KiB
Average record size in memory66.6 B

Variable types

Categorical4
Text1
DateTime1
Numeric2

Dataset

Description제주특별자치도 서귀포시 관내 출판인쇄업 현황에 관한 데이터로 구분, 업체명, 신고일자, 소재지 등 정보를 제공합니다.
URLhttps://www.data.go.kr/data/15056352/fileData.do

Alerts

영업상태 has constant value ""Constant
데이터기준일자 has constant value ""Constant
경도 is highly overall correlated with 소재지High correlation
소재지 is highly overall correlated with 경도High correlation

Reproduction

Analysis started2023-12-12 23:25:17.972903
Analysis finished2023-12-12 23:25:18.982434
Duration1.01 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

구분
Categorical

Distinct2
Distinct (%)0.9%
Missing0
Missing (%)0.0%
Memory size1.8 KiB
출판사
179 
인쇄사
34 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row출판사
2nd row출판사
3rd row출판사
4th row출판사
5th row출판사

Common Values

ValueCountFrequency (%)
출판사 179
84.0%
인쇄사 34
 
16.0%

Length

2023-12-13T08:25:19.041041image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T08:25:19.152974image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
출판사 179
84.0%
인쇄사 34
 
16.0%
Distinct189
Distinct (%)88.7%
Missing0
Missing (%)0.0%
Memory size1.8 KiB
2023-12-13T08:25:19.482262image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length23
Median length19
Mean length6.5868545
Min length1

Characters and Unicode

Total characters1403
Distinct characters323
Distinct categories8 ?
Distinct scripts4 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique167 ?
Unique (%)78.4%

Sample

1st row유경문화인쇄사
2nd row도서출판서울문화사
3rd row에스케이핀크스 주식회사
4th row가시아히
5th row도서출판 제라헌
ValueCountFrequency (%)
도서출판 11
 
3.9%
주식회사 8
 
2.8%
한국hsg 3
 
1.1%
스튜디오 3
 
1.1%
마주보기 3
 
1.1%
재주상회 3
 
1.1%
오디콤 3
 
1.1%
books 2
 
0.7%
출판사 2
 
0.7%
곶자왈 2
 
0.7%
Other values (217) 241
85.8%
2023-12-13T08:25:19.973202image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
68
 
4.8%
37
 
2.6%
37
 
2.6%
36
 
2.6%
34
 
2.4%
30
 
2.1%
25
 
1.8%
25
 
1.8%
) 24
 
1.7%
24
 
1.7%
Other values (313) 1063
75.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1073
76.5%
Uppercase Letter 121
 
8.6%
Lowercase Letter 89
 
6.3%
Space Separator 68
 
4.8%
Close Punctuation 24
 
1.7%
Open Punctuation 24
 
1.7%
Other Punctuation 3
 
0.2%
Dash Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
37
 
3.4%
37
 
3.4%
36
 
3.4%
34
 
3.2%
30
 
2.8%
25
 
2.3%
25
 
2.3%
24
 
2.2%
21
 
2.0%
20
 
1.9%
Other values (261) 784
73.1%
Uppercase Letter
ValueCountFrequency (%)
E 14
 
11.6%
S 11
 
9.1%
M 10
 
8.3%
O 8
 
6.6%
N 8
 
6.6%
A 7
 
5.8%
R 7
 
5.8%
G 7
 
5.8%
U 6
 
5.0%
I 5
 
4.1%
Other values (14) 38
31.4%
Lowercase Letter
ValueCountFrequency (%)
o 11
 
12.4%
e 9
 
10.1%
s 7
 
7.9%
r 7
 
7.9%
i 6
 
6.7%
n 5
 
5.6%
t 4
 
4.5%
u 4
 
4.5%
d 4
 
4.5%
l 4
 
4.5%
Other values (12) 28
31.5%
Other Punctuation
ValueCountFrequency (%)
. 2
66.7%
& 1
33.3%
Space Separator
ValueCountFrequency (%)
68
100.0%
Close Punctuation
ValueCountFrequency (%)
) 24
100.0%
Open Punctuation
ValueCountFrequency (%)
( 24
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1069
76.2%
Latin 210
 
15.0%
Common 120
 
8.6%
Han 4
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
37
 
3.5%
37
 
3.5%
36
 
3.4%
34
 
3.2%
30
 
2.8%
25
 
2.3%
25
 
2.3%
24
 
2.2%
21
 
2.0%
20
 
1.9%
Other values (257) 780
73.0%
Latin
ValueCountFrequency (%)
E 14
 
6.7%
o 11
 
5.2%
S 11
 
5.2%
M 10
 
4.8%
e 9
 
4.3%
O 8
 
3.8%
N 8
 
3.8%
A 7
 
3.3%
s 7
 
3.3%
R 7
 
3.3%
Other values (36) 118
56.2%
Common
ValueCountFrequency (%)
68
56.7%
) 24
 
20.0%
( 24
 
20.0%
. 2
 
1.7%
& 1
 
0.8%
- 1
 
0.8%
Han
ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1069
76.2%
ASCII 330
 
23.5%
CJK 4
 
0.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
68
20.6%
) 24
 
7.3%
( 24
 
7.3%
E 14
 
4.2%
o 11
 
3.3%
S 11
 
3.3%
M 10
 
3.0%
e 9
 
2.7%
O 8
 
2.4%
N 8
 
2.4%
Other values (42) 143
43.3%
Hangul
ValueCountFrequency (%)
37
 
3.5%
37
 
3.5%
36
 
3.4%
34
 
3.2%
30
 
2.8%
25
 
2.3%
25
 
2.3%
24
 
2.2%
21
 
2.0%
20
 
1.9%
Other values (257) 780
73.0%
CJK
ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%
Distinct194
Distinct (%)91.1%
Missing0
Missing (%)0.0%
Memory size1.8 KiB
Minimum1977-02-05 00:00:00
Maximum2022-08-05 00:00:00
2023-12-13T08:25:20.103105image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:25:20.224931image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

소재지
Categorical

HIGH CORRELATION 

Distinct22
Distinct (%)10.3%
Missing0
Missing (%)0.0%
Memory size1.8 KiB
제주특별자치도 서귀포시 동홍동
30 
제주특별자치도 서귀포시 대정읍
29 
제주특별자치도 서귀포시 안덕면
23 
제주특별자치도 서귀포시 서귀동
22 
제주특별자치도 서귀포시 표선면
13 
Other values (17)
96 

Length

Max length16
Median length16
Mean length16
Min length16

Unique

Unique2 ?
Unique (%)0.9%

Sample

1st row제주특별자치도 서귀포시 서귀동
2nd row제주특별자치도 서귀포시 서귀동
3rd row제주특별자치도 서귀포시 안덕면
4th row제주특별자치도 서귀포시 대정읍
5th row제주특별자치도 서귀포시 서홍동

Common Values

ValueCountFrequency (%)
제주특별자치도 서귀포시 동홍동 30
14.1%
제주특별자치도 서귀포시 대정읍 29
13.6%
제주특별자치도 서귀포시 안덕면 23
10.8%
제주특별자치도 서귀포시 서귀동 22
10.3%
제주특별자치도 서귀포시 표선면 13
 
6.1%
제주특별자치도 서귀포시 서호동 12
 
5.6%
제주특별자치도 서귀포시 남원읍 12
 
5.6%
제주특별자치도 서귀포시 성산읍 11
 
5.2%
제주특별자치도 서귀포시 중문동 11
 
5.2%
제주특별자치도 서귀포시 서홍동 10
 
4.7%
Other values (12) 40
18.8%

Length

2023-12-13T08:25:20.425437image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
제주특별자치도 213
33.3%
서귀포시 213
33.3%
동홍동 30
 
4.7%
대정읍 29
 
4.5%
안덕면 23
 
3.6%
서귀동 22
 
3.4%
표선면 13
 
2.0%
서호동 12
 
1.9%
남원읍 12
 
1.9%
성산읍 11
 
1.7%
Other values (14) 61
 
9.5%

영업상태
Categorical

CONSTANT 

Distinct1
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size1.8 KiB
영업중
213 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row영업중
2nd row영업중
3rd row영업중
4th row영업중
5th row영업중

Common Values

ValueCountFrequency (%)
영업중 213
100.0%

Length

2023-12-13T08:25:20.551087image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T08:25:20.637123image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
영업중 213
100.0%

위도
Real number (ℝ)

Distinct166
Distinct (%)77.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean33.272948
Minimum33.214041
Maximum33.450982
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.0 KiB
2023-12-13T08:25:20.732265image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum33.214041
5-th percentile33.233064
Q133.251369
median33.256306
Q333.278054
95-th percentile33.359992
Maximum33.450982
Range0.2369413
Interquartile range (IQR)0.026685

Descriptive statistics

Standard deviation0.043585956
Coefficient of variation (CV)0.0013099517
Kurtosis5.9784949
Mean33.272948
Median Absolute Deviation (MAD)0.00814387
Skewness2.3993566
Sum7087.138
Variance0.0018997355
MonotonicityNot monotonic
2023-12-13T08:25:20.861973image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
33.25254932 6
 
2.8%
33.28795197 4
 
1.9%
33.2790421 4
 
1.9%
33.34113366 4
 
1.9%
33.26062081 4
 
1.9%
33.25365162 3
 
1.4%
33.25490166 3
 
1.4%
33.25558176 3
 
1.4%
33.25259872 3
 
1.4%
33.25422173 3
 
1.4%
Other values (156) 176
82.6%
ValueCountFrequency (%)
33.2140409 1
0.5%
33.22819103 1
0.5%
33.22831213 1
0.5%
33.22843162 1
0.5%
33.23055599 1
0.5%
33.23109048 1
0.5%
33.23119767 1
0.5%
33.23141378 1
0.5%
33.23153716 2
0.9%
33.23206558 1
0.5%
ValueCountFrequency (%)
33.4509822 1
0.5%
33.44723293 1
0.5%
33.4461279 1
0.5%
33.4450987 1
0.5%
33.43731038 1
0.5%
33.43615675 1
0.5%
33.41371766 1
0.5%
33.41266125 1
0.5%
33.38836209 1
0.5%
33.37850568 1
0.5%

경도
Real number (ℝ)

HIGH CORRELATION 

Distinct166
Distinct (%)77.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean126.51489
Minimum126.17152
Maximum126.91795
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.0 KiB
2023-12-13T08:25:20.978833image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum126.17152
5-th percentile126.2733
Q1126.38915
median126.52423
Q3126.57039
95-th percentile126.84054
Maximum126.91795
Range0.7464247
Interquartile range (IQR)0.1812416

Descriptive statistics

Standard deviation0.17175316
Coefficient of variation (CV)0.0013575727
Kurtosis-0.18871831
Mean126.51489
Median Absolute Deviation (MAD)0.0927493
Skewness0.33323032
Sum26947.671
Variance0.029499148
MonotonicityNot monotonic
2023-12-13T08:25:21.373023image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
126.5186144 6
 
2.8%
126.281324 4
 
1.9%
126.2743237 4
 
1.9%
126.8142501 4
 
1.9%
126.5672113 4
 
1.9%
126.4314815 3
 
1.4%
126.4294805 3
 
1.4%
126.5105274 3
 
1.4%
126.5673617 3
 
1.4%
126.5611312 3
 
1.4%
Other values (156) 176
82.6%
ValueCountFrequency (%)
126.1715205 2
0.9%
126.1902533 1
0.5%
126.235423 1
0.5%
126.2423651 1
0.5%
126.2531226 1
0.5%
126.2592064 1
0.5%
126.2632412 2
0.9%
126.2703511 1
0.5%
126.2719082 1
0.5%
126.2742291 1
0.5%
ValueCountFrequency (%)
126.9179452 1
0.5%
126.9157673 1
0.5%
126.9143928 1
0.5%
126.9102547 1
0.5%
126.9076043 1
0.5%
126.8969399 1
0.5%
126.8877948 1
0.5%
126.8866227 1
0.5%
126.8740077 1
0.5%
126.8492464 1
0.5%

데이터기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size1.8 KiB
2023-08-28
213 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2023-08-28
2nd row2023-08-28
3rd row2023-08-28
4th row2023-08-28
5th row2023-08-28

Common Values

ValueCountFrequency (%)
2023-08-28 213
100.0%

Length

2023-12-13T08:25:21.510274image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T08:25:21.595426image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2023-08-28 213
100.0%

Interactions

2023-12-13T08:25:18.508873image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:25:18.318871image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:25:18.617415image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:25:18.411336image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T08:25:21.660355image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구분소재지위도경도
구분1.0000.6140.2350.552
소재지0.6141.0000.8150.962
위도0.2350.8151.0000.890
경도0.5520.9620.8901.000
2023-12-13T08:25:21.739106image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
소재지구분
소재지1.0000.467
구분0.4671.000
2023-12-13T08:25:21.837114image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
위도경도구분소재지
위도1.0000.3550.1760.457
경도0.3551.0000.4170.783
구분0.1760.4171.0000.467
소재지0.4570.7830.4671.000

Missing values

2023-12-13T08:25:18.792600image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T08:25:18.921835image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

구분업체명신고일자소재지영업상태위도경도데이터기준일자
0출판사유경문화인쇄사1996-05-14제주특별자치도 서귀포시 서귀동영업중33.249019126.568632023-08-28
1출판사도서출판서울문화사1998-06-01제주특별자치도 서귀포시 서귀동영업중33.249882126.5585352023-08-28
2출판사에스케이핀크스 주식회사1999-04-08제주특별자치도 서귀포시 안덕면영업중33.30536126.3936852023-08-28
3출판사가시아히2000-10-20제주특별자치도 서귀포시 대정읍영업중33.265093126.2354232023-08-28
4출판사도서출판 제라헌2001-07-25제주특별자치도 서귀포시 서홍동영업중33.264802126.5564932023-08-28
5출판사월간베스트 제주2003-08-08제주특별자치도 서귀포시 서귀동영업중33.249238126.5650872023-08-28
6출판사그린피스2004-03-10제주특별자치도 서귀포시 표선면영업중33.355234126.7748322023-08-28
7출판사오디콤2005-11-14제주특별자치도 서귀포시 서홍동영업중33.254469126.5605442023-08-28
8출판사나그네길2005-09-05제주특별자치도 서귀포시 안덕면영업중33.307829126.3818742023-08-28
9출판사도서출판돌의나라2005-11-29제주특별자치도 서귀포시 안덕면영업중33.283673126.3230952023-08-28
구분업체명신고일자소재지영업상태위도경도데이터기준일자
203인쇄사(주)에스엠산업2017-08-11제주특별자치도 서귀포시 대포동영업중33.261757126.4506412023-08-28
204인쇄사오디콤2017-11-27제주특별자치도 서귀포시 서홍동영업중33.254469126.5605442023-08-28
205인쇄사남도기획디자인2017-12-11제주특별자치도 서귀포시 동홍동영업중33.260621126.5672112023-08-28
206인쇄사신세계광고2018-06-07제주특별자치도 서귀포시 토평동영업중33.256298126.581132023-08-28
207인쇄사제주광고인쇄타운2019-03-11제주특별자치도 서귀포시 하예동영업중33.243819126.3891512023-08-28
208인쇄사끌로드2019-07-29제주특별자치도 서귀포시 안덕면영업중33.301386126.3175252023-08-28
209인쇄사한국HSG2019-08-16제주특별자치도 서귀포시 동홍동영업중33.254222126.5611312023-08-28
210인쇄사아정기획2019-09-24제주특별자치도 서귀포시 동홍동영업중33.258898126.5715162023-08-28
211인쇄사한국HSG2019-12-16제주특별자치도 서귀포시 동홍동영업중33.254222126.5611312023-08-28
212인쇄사아르떼2020-01-23제주특별자치도 서귀포시 동홍동영업중33.252599126.5673622023-08-28