Overview

Dataset statistics

Number of variables6
Number of observations129
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory6.4 KiB
Average record size in memory51.0 B

Variable types

Numeric2
Categorical1
Text2
DateTime1

Dataset

Description경기도 고양시 정기간행물 등록 현황에 대한 데이터로 연번, 시군구, 등록번호, 제호(간행물명), 발행인, 등록일자 등의 항목을 제공합니다.
URLhttps://www.data.go.kr/data/15118893/fileData.do

Alerts

시군구 has constant value ""Constant
연번 is highly overall correlated with 등록번호High correlation
등록번호 is highly overall correlated with 연번High correlation
연번 has unique valuesUnique
등록번호 has unique valuesUnique
제호 has unique valuesUnique

Reproduction

Analysis started2023-12-11 23:36:00.598338
Analysis finished2023-12-11 23:36:01.528275
Duration0.93 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct129
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean65
Minimum1
Maximum129
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.3 KiB
2023-12-12T08:36:01.672783image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile7.4
Q133
median65
Q397
95-th percentile122.6
Maximum129
Range128
Interquartile range (IQR)64

Descriptive statistics

Standard deviation37.383151
Coefficient of variation (CV)0.5751254
Kurtosis-1.2
Mean65
Median Absolute Deviation (MAD)32
Skewness0
Sum8385
Variance1397.5
MonotonicityStrictly increasing
2023-12-12T08:36:01.829443image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.8%
98 1
 
0.8%
96 1
 
0.8%
95 1
 
0.8%
94 1
 
0.8%
93 1
 
0.8%
92 1
 
0.8%
91 1
 
0.8%
90 1
 
0.8%
89 1
 
0.8%
Other values (119) 119
92.2%
ValueCountFrequency (%)
1 1
0.8%
2 1
0.8%
3 1
0.8%
4 1
0.8%
5 1
0.8%
6 1
0.8%
7 1
0.8%
8 1
0.8%
9 1
0.8%
10 1
0.8%
ValueCountFrequency (%)
129 1
0.8%
128 1
0.8%
127 1
0.8%
126 1
0.8%
125 1
0.8%
124 1
0.8%
123 1
0.8%
122 1
0.8%
121 1
0.8%
120 1
0.8%

시군구
Categorical

CONSTANT 

Distinct1
Distinct (%)0.8%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
고양시
129 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row고양시
2nd row고양시
3rd row고양시
4th row고양시
5th row고양시

Common Values

ValueCountFrequency (%)
고양시 129
100.0%

Length

2023-12-12T08:36:02.011856image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T08:36:02.116215image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
고양시 129
100.0%

등록번호
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct129
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean65
Minimum1
Maximum129
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.3 KiB
2023-12-12T08:36:02.248506image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile7.4
Q133
median65
Q397
95-th percentile122.6
Maximum129
Range128
Interquartile range (IQR)64

Descriptive statistics

Standard deviation37.383151
Coefficient of variation (CV)0.5751254
Kurtosis-1.2
Mean65
Median Absolute Deviation (MAD)32
Skewness0
Sum8385
Variance1397.5
MonotonicityStrictly decreasing
2023-12-12T08:36:02.441491image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
129 1
 
0.8%
32 1
 
0.8%
34 1
 
0.8%
35 1
 
0.8%
36 1
 
0.8%
37 1
 
0.8%
38 1
 
0.8%
39 1
 
0.8%
40 1
 
0.8%
41 1
 
0.8%
Other values (119) 119
92.2%
ValueCountFrequency (%)
1 1
0.8%
2 1
0.8%
3 1
0.8%
4 1
0.8%
5 1
0.8%
6 1
0.8%
7 1
0.8%
8 1
0.8%
9 1
0.8%
10 1
0.8%
ValueCountFrequency (%)
129 1
0.8%
128 1
0.8%
127 1
0.8%
126 1
0.8%
125 1
0.8%
124 1
0.8%
123 1
0.8%
122 1
0.8%
121 1
0.8%
120 1
0.8%

제호
Text

UNIQUE 

Distinct129
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
2023-12-12T08:36:02.810657image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length30
Median length19
Mean length8.7054264
Min length2

Characters and Unicode

Total characters1123
Distinct characters285
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique129 ?
Unique (%)100.0%

Sample

1st row월간 국민 필라테스
2nd row예수바라기
3rd row아토포스
4th row문화정원
5th rowSayaka
ValueCountFrequency (%)
월간 8
 
3.8%
media 3
 
1.4%
매거진 2
 
1.0%
파란 2
 
1.0%
위즈보우 2
 
1.0%
플래너 2
 
1.0%
news 2
 
1.0%
플러스 2
 
1.0%
리포트 2
 
1.0%
sayaka 2
 
1.0%
Other values (182) 182
87.1%
2023-12-12T08:36:03.335467image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
80
 
7.1%
e 26
 
2.3%
23
 
2.0%
19
 
1.7%
a 19
 
1.7%
) 17
 
1.5%
( 17
 
1.5%
C 16
 
1.4%
E 15
 
1.3%
i 14
 
1.2%
Other values (275) 877
78.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 652
58.1%
Lowercase Letter 177
 
15.8%
Uppercase Letter 154
 
13.7%
Space Separator 80
 
7.1%
Close Punctuation 17
 
1.5%
Open Punctuation 17
 
1.5%
Other Punctuation 13
 
1.2%
Decimal Number 13
 
1.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
23
 
3.5%
19
 
2.9%
14
 
2.1%
13
 
2.0%
13
 
2.0%
13
 
2.0%
13
 
2.0%
13
 
2.0%
11
 
1.7%
9
 
1.4%
Other values (216) 511
78.4%
Lowercase Letter
ValueCountFrequency (%)
e 26
14.7%
a 19
 
10.7%
i 14
 
7.9%
o 14
 
7.9%
s 10
 
5.6%
n 10
 
5.6%
u 9
 
5.1%
r 9
 
5.1%
c 9
 
5.1%
t 8
 
4.5%
Other values (13) 49
27.7%
Uppercase Letter
ValueCountFrequency (%)
C 16
 
10.4%
E 15
 
9.7%
R 11
 
7.1%
A 11
 
7.1%
I 11
 
7.1%
S 11
 
7.1%
O 10
 
6.5%
P 9
 
5.8%
M 8
 
5.2%
L 7
 
4.5%
Other values (12) 45
29.2%
Decimal Number
ValueCountFrequency (%)
0 3
23.1%
5 2
15.4%
3 2
15.4%
6 2
15.4%
1 2
15.4%
2 1
 
7.7%
4 1
 
7.7%
Other Punctuation
ValueCountFrequency (%)
. 6
46.2%
& 4
30.8%
· 2
 
15.4%
, 1
 
7.7%
Space Separator
ValueCountFrequency (%)
80
100.0%
Close Punctuation
ValueCountFrequency (%)
) 17
100.0%
Open Punctuation
ValueCountFrequency (%)
( 17
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 652
58.1%
Latin 331
29.5%
Common 140
 
12.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
23
 
3.5%
19
 
2.9%
14
 
2.1%
13
 
2.0%
13
 
2.0%
13
 
2.0%
13
 
2.0%
13
 
2.0%
11
 
1.7%
9
 
1.4%
Other values (216) 511
78.4%
Latin
ValueCountFrequency (%)
e 26
 
7.9%
a 19
 
5.7%
C 16
 
4.8%
E 15
 
4.5%
i 14
 
4.2%
o 14
 
4.2%
R 11
 
3.3%
A 11
 
3.3%
I 11
 
3.3%
S 11
 
3.3%
Other values (35) 183
55.3%
Common
ValueCountFrequency (%)
80
57.1%
) 17
 
12.1%
( 17
 
12.1%
. 6
 
4.3%
& 4
 
2.9%
0 3
 
2.1%
5 2
 
1.4%
· 2
 
1.4%
3 2
 
1.4%
6 2
 
1.4%
Other values (4) 5
 
3.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 652
58.1%
ASCII 469
41.8%
None 2
 
0.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
80
 
17.1%
e 26
 
5.5%
a 19
 
4.1%
) 17
 
3.6%
( 17
 
3.6%
C 16
 
3.4%
E 15
 
3.2%
i 14
 
3.0%
o 14
 
3.0%
R 11
 
2.3%
Other values (48) 240
51.2%
Hangul
ValueCountFrequency (%)
23
 
3.5%
19
 
2.9%
14
 
2.1%
13
 
2.0%
13
 
2.0%
13
 
2.0%
13
 
2.0%
13
 
2.0%
11
 
1.7%
9
 
1.4%
Other values (216) 511
78.4%
None
ValueCountFrequency (%)
· 2
100.0%
Distinct111
Distinct (%)86.0%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
2023-12-12T08:36:03.709765image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length9
Median length3
Mean length3.0775194
Min length2

Characters and Unicode

Total characters397
Distinct characters114
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique97 ?
Unique (%)75.2%

Sample

1st row황종선
2nd row예수바라기(예바)
3rd row문근식
4th row김혜성
5th row김금산
ValueCountFrequency (%)
한종수 4
 
3.1%
성애리 3
 
2.3%
박지형 3
 
2.3%
김수미 2
 
1.6%
안세희 2
 
1.6%
임경호 2
 
1.6%
김금산 2
 
1.6%
안덕찬 2
 
1.6%
신동우 2
 
1.6%
박병건 2
 
1.6%
Other values (101) 105
81.4%
2023-12-12T08:36:04.593648image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
21
 
5.3%
20
 
5.0%
14
 
3.5%
13
 
3.3%
13
 
3.3%
10
 
2.5%
9
 
2.3%
9
 
2.3%
8
 
2.0%
8
 
2.0%
Other values (104) 272
68.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 393
99.0%
Open Punctuation 2
 
0.5%
Close Punctuation 2
 
0.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
21
 
5.3%
20
 
5.1%
14
 
3.6%
13
 
3.3%
13
 
3.3%
10
 
2.5%
9
 
2.3%
9
 
2.3%
8
 
2.0%
8
 
2.0%
Other values (102) 268
68.2%
Open Punctuation
ValueCountFrequency (%)
( 2
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 393
99.0%
Common 4
 
1.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
21
 
5.3%
20
 
5.1%
14
 
3.6%
13
 
3.3%
13
 
3.3%
10
 
2.5%
9
 
2.3%
9
 
2.3%
8
 
2.0%
8
 
2.0%
Other values (102) 268
68.2%
Common
ValueCountFrequency (%)
( 2
50.0%
) 2
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 393
99.0%
ASCII 4
 
1.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
21
 
5.3%
20
 
5.1%
14
 
3.6%
13
 
3.3%
13
 
3.3%
10
 
2.5%
9
 
2.3%
9
 
2.3%
8
 
2.0%
8
 
2.0%
Other values (102) 268
68.2%
ASCII
ValueCountFrequency (%)
( 2
50.0%
) 2
50.0%
Distinct121
Distinct (%)93.8%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
Minimum1988-05-28 00:00:00
Maximum2023-12-11 00:00:00
2023-12-12T08:36:04.770196image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T08:36:04.917298image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

Interactions

2023-12-12T08:36:01.067797image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T08:36:00.874029image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T08:36:01.177022image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T08:36:00.974766image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T08:36:05.034352image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번등록번호
연번1.0001.000
등록번호1.0001.000
2023-12-12T08:36:05.116091image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번등록번호
연번1.000-1.000
등록번호-1.0001.000

Missing values

2023-12-12T08:36:01.327127image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T08:36:01.483335image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번시군구등록번호제호발행인등록일자
01고양시129월간 국민 필라테스황종선2023-04-28
12고양시128예수바라기예수바라기(예바)2023-02-01
23고양시127아토포스문근식2022-11-01
34고양시126문화정원김혜성2022-05-17
45고양시125Sayaka김금산2022-05-03
56고양시124Sayaka 한국어판김금산2022-05-03
67고양시123한국노인복지장기요양안덕찬2022-05-03
78고양시122춘추대한문학이향희2022-03-08
89고양시121과학기술과 사회조영남2021-12-14
910고양시120경기도 주민자치회안덕찬2021-11-24
연번시군구등록번호제호발행인등록일자
119120고양시10피에이한종수2000-12-29
120121고양시9A.P.C뉴스은효진1999-10-20
121122고양시8사닥다리김송삼1998-12-18
122123고양시7온세상위하여최홍석1998-05-01
123124고양시6여럿이함께신유나1997-09-04
124125고양시5일하는제자들박현민1993-04-13
125126고양시4(주)고양.파주 벼룩시장양선일1993-03-05
126127고양시3고양.파주 알림방최상호1992-04-17
127128고양시2공동체(Community News)이희영1990-05-24
128129고양시1역사비평정순구1988-05-28