Overview

Dataset statistics

Number of variables3
Number of observations945
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory24.1 KiB
Average record size in memory26.1 B

Variable types

Numeric2
Text1

Dataset

Description강원도 홍천군_미디어관리시스템_태그정보
Author강원도 홍천군
URLhttps://www.data.go.kr/data/15072421/fileData.do

Alerts

연번 is highly overall correlated with 집계일자High correlation
집계일자 is highly overall correlated with 연번High correlation
연번 has unique valuesUnique
태그 has unique valuesUnique

Reproduction

Analysis started2023-12-12 16:22:16.472970
Analysis finished2023-12-12 16:22:17.414799
Duration0.94 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct945
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean473
Minimum1
Maximum945
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size8.4 KiB
2023-12-13T01:22:17.510695image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile48.2
Q1237
median473
Q3709
95-th percentile897.8
Maximum945
Range944
Interquartile range (IQR)472

Descriptive statistics

Standard deviation272.9423
Coefficient of variation (CV)0.57704504
Kurtosis-1.2
Mean473
Median Absolute Deviation (MAD)236
Skewness0
Sum446985
Variance74497.5
MonotonicityStrictly increasing
2023-12-13T01:22:17.677373image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.1%
636 1
 
0.1%
624 1
 
0.1%
625 1
 
0.1%
626 1
 
0.1%
627 1
 
0.1%
628 1
 
0.1%
629 1
 
0.1%
630 1
 
0.1%
631 1
 
0.1%
Other values (935) 935
98.9%
ValueCountFrequency (%)
1 1
0.1%
2 1
0.1%
3 1
0.1%
4 1
0.1%
5 1
0.1%
6 1
0.1%
7 1
0.1%
8 1
0.1%
9 1
0.1%
10 1
0.1%
ValueCountFrequency (%)
945 1
0.1%
944 1
0.1%
943 1
0.1%
942 1
0.1%
941 1
0.1%
940 1
0.1%
939 1
0.1%
938 1
0.1%
937 1
0.1%
936 1
0.1%

집계일자
Real number (ℝ)

HIGH CORRELATION 

Distinct77
Distinct (%)8.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean20200393
Minimum20200102
Maximum20200803
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size8.4 KiB
2023-12-13T01:22:17.824323image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum20200102
5-th percentile20200107
Q120200210
median20200413
Q320200602
95-th percentile20200722
Maximum20200803
Range701
Interquartile range (IQR)392

Descriptive statistics

Standard deviation211.71446
Coefficient of variation (CV)1.048071 × 10-5
Kurtosis-1.2837133
Mean20200393
Median Absolute Deviation (MAD)199
Skewness0.10503429
Sum1.9089372 × 1010
Variance44823.011
MonotonicityIncreasing
2023-12-13T01:22:18.015718image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
20200414 38
 
4.0%
20200113 35
 
3.7%
20200616 29
 
3.1%
20200102 28
 
3.0%
20200519 27
 
2.9%
20200513 26
 
2.8%
20200324 24
 
2.5%
20200722 24
 
2.5%
20200713 23
 
2.4%
20200501 22
 
2.3%
Other values (67) 669
70.8%
ValueCountFrequency (%)
20200102 28
3.0%
20200107 22
2.3%
20200108 8
 
0.8%
20200113 35
3.7%
20200114 8
 
0.8%
20200115 20
2.1%
20200116 11
 
1.2%
20200117 12
 
1.3%
20200120 13
 
1.4%
20200121 13
 
1.4%
ValueCountFrequency (%)
20200803 13
1.4%
20200730 6
 
0.6%
20200728 11
1.2%
20200727 6
 
0.6%
20200722 24
2.5%
20200713 23
2.4%
20200708 9
 
1.0%
20200705 12
1.3%
20200702 15
1.6%
20200628 10
1.1%

태그
Text

UNIQUE 

Distinct945
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size7.5 KiB
2023-12-13T01:22:18.375629image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length25
Median length21
Mean length6.5534392
Min length1

Characters and Unicode

Total characters6193
Distinct characters461
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique945 ?
Unique (%)100.0%

Sample

1st row가리산 휴양림 위탁관리 체결식
2nd row산림조합
3rd row가리산
4th row무궁화장학금
5th row 산림조합
ValueCountFrequency (%)
코로나 17
 
1.2%
19 14
 
1.0%
협약식 12
 
0.8%
마스크 12
 
0.8%
방문 10
 
0.7%
전달 9
 
0.6%
코로나19 8
 
0.6%
이웃돕기 8
 
0.6%
성금 8
 
0.6%
상품권 8
 
0.6%
Other values (916) 1345
92.7%
2023-12-13T01:22:18.857702image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
860
 
13.9%
159
 
2.6%
106
 
1.7%
91
 
1.5%
91
 
1.5%
86
 
1.4%
78
 
1.3%
77
 
1.2%
69
 
1.1%
66
 
1.1%
Other values (451) 4510
72.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 5135
82.9%
Space Separator 860
 
13.9%
Decimal Number 141
 
2.3%
Uppercase Letter 25
 
0.4%
Lowercase Letter 22
 
0.4%
Dash Punctuation 5
 
0.1%
Open Punctuation 2
 
< 0.1%
Close Punctuation 2
 
< 0.1%
Other Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
159
 
3.1%
106
 
2.1%
91
 
1.8%
91
 
1.8%
86
 
1.7%
78
 
1.5%
77
 
1.5%
69
 
1.3%
66
 
1.3%
64
 
1.2%
Other values (414) 4248
82.7%
Uppercase Letter
ValueCountFrequency (%)
K 4
16.0%
S 3
12.0%
T 3
12.0%
C 3
12.0%
F 2
8.0%
B 2
8.0%
H 2
8.0%
R 1
 
4.0%
A 1
 
4.0%
V 1
 
4.0%
Other values (3) 3
12.0%
Decimal Number
ValueCountFrequency (%)
0 36
25.5%
1 35
24.8%
9 23
16.3%
2 18
12.8%
5 9
 
6.4%
8 5
 
3.5%
3 4
 
2.8%
6 4
 
2.8%
7 4
 
2.8%
4 3
 
2.1%
Lowercase Letter
ValueCountFrequency (%)
s 4
18.2%
t 4
18.2%
g 3
13.6%
r 2
9.1%
y 2
9.1%
a 2
9.1%
o 2
9.1%
n 2
9.1%
k 1
 
4.5%
Space Separator
ValueCountFrequency (%)
860
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 5
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2
100.0%
Other Punctuation
ValueCountFrequency (%)
& 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 5135
82.9%
Common 1011
 
16.3%
Latin 47
 
0.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
159
 
3.1%
106
 
2.1%
91
 
1.8%
91
 
1.8%
86
 
1.7%
78
 
1.5%
77
 
1.5%
69
 
1.3%
66
 
1.3%
64
 
1.2%
Other values (414) 4248
82.7%
Latin
ValueCountFrequency (%)
s 4
 
8.5%
t 4
 
8.5%
K 4
 
8.5%
S 3
 
6.4%
T 3
 
6.4%
g 3
 
6.4%
C 3
 
6.4%
F 2
 
4.3%
r 2
 
4.3%
y 2
 
4.3%
Other values (12) 17
36.2%
Common
ValueCountFrequency (%)
860
85.1%
0 36
 
3.6%
1 35
 
3.5%
9 23
 
2.3%
2 18
 
1.8%
5 9
 
0.9%
- 5
 
0.5%
8 5
 
0.5%
3 4
 
0.4%
6 4
 
0.4%
Other values (5) 12
 
1.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 5135
82.9%
ASCII 1058
 
17.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
860
81.3%
0 36
 
3.4%
1 35
 
3.3%
9 23
 
2.2%
2 18
 
1.7%
5 9
 
0.9%
- 5
 
0.5%
8 5
 
0.5%
s 4
 
0.4%
t 4
 
0.4%
Other values (27) 59
 
5.6%
Hangul
ValueCountFrequency (%)
159
 
3.1%
106
 
2.1%
91
 
1.8%
91
 
1.8%
86
 
1.7%
78
 
1.5%
77
 
1.5%
69
 
1.3%
66
 
1.3%
64
 
1.2%
Other values (414) 4248
82.7%

Interactions

2023-12-13T01:22:17.016481image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T01:22:16.800418image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T01:22:17.139346image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T01:22:16.891325image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T01:22:18.956064image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번집계일자
연번1.0000.978
집계일자0.9781.000
2023-12-13T01:22:19.046161image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번집계일자
연번1.0001.000
집계일자1.0001.000

Missing values

2023-12-13T01:22:17.287570image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T01:22:17.376056image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번집계일자태그
0120200102가리산 휴양림 위탁관리 체결식
1220200102산림조합
2320200102가리산
3420200102무궁화장학금
4520200102산림조합
5620200102문화재단 종무식
6720200102문화재단
7820200102종무식
8920200102종무식
91020200102홍천군청
연번집계일자태그
93593620200803양수발전
93693720200803동아리
93793820200803한땀한땀
93893920200803진달래로터리클럽
93994020200803파크골프 먼지털이기 설치 방문
94094120200803파크골프
94194220200803노일리 코스모단지 밀밭
94294320200803노일리
94394420200803코스모스
94494520200803밀밭