Overview

Dataset statistics

Number of variables5
Number of observations176
Missing cells370
Missing cells (%)42.0%
Duplicate rows1
Duplicate rows (%)0.6%
Total size in memory7.7 KiB
Average record size in memory44.8 B

Variable types

DateTime1
Numeric2
Unsupported2

Dataset

Description충청남도 홈페이지의 회원가입현황으로써 월별 가입자 수 데이터 입니다.
Author충청남도
URLhttps://alldam.chungnam.go.kr/index.chungnam?menuCd=DOM_000000201001001001&st=&cds=&orgCd=&apiType=&isOpen=Y&pageIndex=377&beforeMenuCd=DOM_000000201001001000&publicdatapk=15063379

Alerts

Dataset has 1 (0.6%) duplicate rowsDuplicates
가입건수 is highly overall correlated with 누적가입자수 High correlation
누적가입자수 is highly overall correlated with 가입건수High correlation
년월 has 6 (3.4%) missing valuesMissing
가입건수 has 6 (3.4%) missing valuesMissing
누적가입자수 has 6 (3.4%) missing valuesMissing
Unnamed: 3 has 176 (100.0%) missing valuesMissing
Unnamed: 4 has 176 (100.0%) missing valuesMissing
Unnamed: 3 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 4 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2024-01-09 20:05:23.587985
Analysis finished2024-01-09 20:05:25.140648
Duration1.55 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

년월
Date

MISSING 

Distinct170
Distinct (%)100.0%
Missing6
Missing (%)3.4%
Memory size1.5 KiB
Minimum2008-07-31 00:00:00
Maximum2022-08-31 00:00:00
2024-01-10T05:05:25.237764image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T05:05:25.405943image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

가입건수
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct114
Distinct (%)67.1%
Missing6
Missing (%)3.4%
Infinite0
Infinite (%)0.0%
Mean109.91765
Minimum1
Maximum764
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.7 KiB
2024-01-10T05:05:25.573365image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile7.45
Q122
median88.5
Q3165
95-th percentile306
Maximum764
Range763
Interquartile range (IQR)143

Descriptive statistics

Standard deviation109.28993
Coefficient of variation (CV)0.99428922
Kurtosis6.8310811
Mean109.91765
Median Absolute Deviation (MAD)67.5
Skewness1.8952389
Sum18686
Variance11944.289
MonotonicityNot monotonic
2024-01-10T05:05:25.766878image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
17 6
 
3.4%
1 5
 
2.8%
9 5
 
2.8%
21 5
 
2.8%
25 4
 
2.3%
33 4
 
2.3%
143 3
 
1.7%
12 3
 
1.7%
19 3
 
1.7%
22 3
 
1.7%
Other values (104) 129
73.3%
(Missing) 6
 
3.4%
ValueCountFrequency (%)
1 5
2.8%
4 1
 
0.6%
5 1
 
0.6%
6 1
 
0.6%
7 1
 
0.6%
8 1
 
0.6%
9 5
2.8%
10 1
 
0.6%
11 1
 
0.6%
12 3
1.7%
ValueCountFrequency (%)
764 1
0.6%
438 1
0.6%
395 1
0.6%
357 1
0.6%
339 1
0.6%
324 1
0.6%
316 1
0.6%
309 1
0.6%
306 2
1.1%
300 1
0.6%

누적가입자수
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct170
Distinct (%)100.0%
Missing6
Missing (%)3.4%
Infinite0
Infinite (%)0.0%
Mean5781.1824
Minimum1
Maximum18686
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.7 KiB
2024-01-10T05:05:25.944857image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile882.05
Q11676.25
median3316
Q38970.25
95-th percentile16229.7
Maximum18686
Range18685
Interquartile range (IQR)7294

Descriptive statistics

Standard deviation5141.2779
Coefficient of variation (CV)0.88931254
Kurtosis-0.32690213
Mean5781.1824
Median Absolute Deviation (MAD)2197.5
Skewness0.95362179
Sum982801
Variance26432739
MonotonicityStrictly decreasing
2024-01-10T05:05:26.139841image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1901 1
 
0.6%
2187 1
 
0.6%
2138 1
 
0.6%
2109 1
 
0.6%
2080 1
 
0.6%
1992 1
 
0.6%
1958 1
 
0.6%
1933 1
 
0.6%
1920 1
 
0.6%
1884 1
 
0.6%
Other values (160) 160
90.9%
(Missing) 6
 
3.4%
ValueCountFrequency (%)
1 1
0.6%
765 1
0.6%
766 1
0.6%
767 1
0.6%
768 1
0.6%
769 1
0.6%
774 1
0.6%
783 1
0.6%
842 1
0.6%
931 1
0.6%
ValueCountFrequency (%)
18686 1
0.6%
18472 1
0.6%
18220 1
0.6%
17939 1
0.6%
17501 1
0.6%
17195 1
0.6%
16838 1
0.6%
16522 1
0.6%
16326 1
0.6%
16112 1
0.6%

Unnamed: 3
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing176
Missing (%)100.0%
Memory size1.7 KiB

Unnamed: 4
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing176
Missing (%)100.0%
Memory size1.7 KiB

Interactions

2024-01-10T05:05:24.467643image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T05:05:24.178320image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T05:05:24.615304image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T05:05:24.306054image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-01-10T05:05:26.271216image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
가입건수누적가입자수
가입건수1.0000.704
누적가입자수0.7041.000
2024-01-10T05:05:26.415771image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
가입건수누적가입자수
가입건수1.0000.850
누적가입자수0.8501.000

Missing values

2024-01-10T05:05:24.785630image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-10T05:05:24.916202image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2024-01-10T05:05:25.059727image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

년월가입건수누적가입자수Unnamed: 3Unnamed: 4
02022-08-3121418686<NA><NA>
12022-07-3125218472<NA><NA>
22022-06-3028118220<NA><NA>
32022-05-3143817939<NA><NA>
42022-04-3030617501<NA><NA>
52022-03-3135717195<NA><NA>
62022-02-2831616838<NA><NA>
72022-01-3119616522<NA><NA>
82021-12-3121416326<NA><NA>
92021-11-3023616112<NA><NA>
년월가입건수누적가입자수Unnamed: 3Unnamed: 4
1662008-10-311767<NA><NA>
1672008-09-301766<NA><NA>
1682008-08-31764765<NA><NA>
1692008-07-3111<NA><NA>
170<NA><NA><NA><NA><NA>
171<NA><NA><NA><NA><NA>
172<NA><NA><NA><NA><NA>
173<NA><NA><NA><NA><NA>
174<NA><NA><NA><NA><NA>
175<NA><NA><NA><NA><NA>

Duplicate rows

Most frequently occurring

년월가입건수누적가입자수# duplicates
0<NA><NA><NA>6