Overview

Dataset statistics

Number of variables6
Number of observations500
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory24.5 KiB
Average record size in memory50.3 B

Variable types

Numeric2
Categorical3
Text1

Dataset

Description샘플 데이터
Author다음소프트
URLhttps://bigdata.seoul.go.kr/data/selectSampleData.do?sample_data_seq=57

Reproduction

Analysis started2023-12-10 14:54:22.182263
Analysis finished2023-12-10 14:54:23.011172
Duration0.83 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

DOC_DATE(DATE)
Real number (ℝ)

Distinct395
Distinct (%)79.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean20181134
Minimum20170103
Maximum20191226
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size4.5 KiB
2023-12-10T23:54:23.084261image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum20170103
5-th percentile20170304
Q120170883
median20180621
Q320190426
95-th percentile20191101
Maximum20191226
Range21123
Interquartile range (IQR)19542.25

Descriptive statistics

Standard deviation8302.3993
Coefficient of variation (CV)0.0004113941
Kurtosis-1.5306408
Mean20181134
Median Absolute Deviation (MAD)9800.5
Skewness-0.090393198
Sum1.0090567 × 1010
Variance68929835
MonotonicityNot monotonic
2023-12-10T23:54:23.211230image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
20170716 5
 
1.0%
20190118 3
 
0.6%
20190715 3
 
0.6%
20191111 3
 
0.6%
20180524 3
 
0.6%
20170620 3
 
0.6%
20170326 3
 
0.6%
20181217 3
 
0.6%
20190222 3
 
0.6%
20181107 3
 
0.6%
Other values (385) 468
93.6%
ValueCountFrequency (%)
20170103 2
0.4%
20170108 1
0.2%
20170109 1
0.2%
20170111 1
0.2%
20170112 1
0.2%
20170116 2
0.4%
20170119 1
0.2%
20170123 1
0.2%
20170128 2
0.4%
20170203 1
0.2%
ValueCountFrequency (%)
20191226 2
0.4%
20191224 1
0.2%
20191219 1
0.2%
20191215 1
0.2%
20191214 1
0.2%
20191213 2
0.4%
20191212 1
0.2%
20191206 1
0.2%
20191205 1
0.2%
20191204 2
0.4%
Distinct2
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size4.0 KiB
커뮤니티블로그
379 
트위터
121 

Length

Max length7
Median length7
Mean length6.032
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row커뮤니티블로그
2nd row커뮤니티블로그
3rd row트위터
4th row커뮤니티블로그
5th row커뮤니티블로그

Common Values

ValueCountFrequency (%)
커뮤니티블로그 379
75.8%
트위터 121
 
24.2%

Length

2023-12-10T23:54:23.351819image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T23:54:23.448766image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
커뮤니티블로그 379
75.8%
트위터 121
 
24.2%
Distinct275
Distinct (%)55.0%
Missing0
Missing (%)0.0%
Memory size4.0 KiB
2023-12-10T23:54:23.719376image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length9
Median length8
Mean length3.164
Min length2

Characters and Unicode

Total characters1582
Distinct characters213
Distinct categories3 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique176 ?
Unique (%)35.2%

Sample

1st row국립현대미술관
2nd row역삼역
3rd row해양박물관
4th row석파정서울미술관
5th row마포
ValueCountFrequency (%)
이태원 13
 
2.6%
강남 10
 
2.0%
한남동 8
 
1.6%
서울 8
 
1.6%
합정 7
 
1.4%
마포구 7
 
1.4%
여의도 7
 
1.4%
강변 6
 
1.2%
홍대 6
 
1.2%
익선동 5
 
1.0%
Other values (265) 423
84.6%
2023-12-10T23:54:24.160248image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
115
 
7.3%
94
 
5.9%
61
 
3.9%
43
 
2.7%
38
 
2.4%
37
 
2.3%
32
 
2.0%
32
 
2.0%
30
 
1.9%
26
 
1.6%
Other values (203) 1074
67.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1551
98.0%
Lowercase Letter 28
 
1.8%
Decimal Number 3
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
115
 
7.4%
94
 
6.1%
61
 
3.9%
43
 
2.8%
38
 
2.5%
37
 
2.4%
32
 
2.1%
32
 
2.1%
30
 
1.9%
26
 
1.7%
Other values (194) 1043
67.2%
Lowercase Letter
ValueCountFrequency (%)
c 8
28.6%
v 5
17.9%
g 5
17.9%
i 4
14.3%
n 2
 
7.1%
f 2
 
7.1%
p 1
 
3.6%
k 1
 
3.6%
Decimal Number
ValueCountFrequency (%)
3 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1551
98.0%
Latin 28
 
1.8%
Common 3
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
115
 
7.4%
94
 
6.1%
61
 
3.9%
43
 
2.8%
38
 
2.5%
37
 
2.4%
32
 
2.1%
32
 
2.1%
30
 
1.9%
26
 
1.7%
Other values (194) 1043
67.2%
Latin
ValueCountFrequency (%)
c 8
28.6%
v 5
17.9%
g 5
17.9%
i 4
14.3%
n 2
 
7.1%
f 2
 
7.1%
p 1
 
3.6%
k 1
 
3.6%
Common
ValueCountFrequency (%)
3 3
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1551
98.0%
ASCII 31
 
2.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
115
 
7.4%
94
 
6.1%
61
 
3.9%
43
 
2.8%
38
 
2.5%
37
 
2.4%
32
 
2.1%
32
 
2.1%
30
 
1.9%
26
 
1.7%
Other values (194) 1043
67.2%
ASCII
ValueCountFrequency (%)
c 8
25.8%
v 5
16.1%
g 5
16.1%
i 4
12.9%
3 3
 
9.7%
n 2
 
6.5%
f 2
 
6.5%
p 1
 
3.2%
k 1
 
3.2%

행정구(GU_NM)
Categorical

Distinct25
Distinct (%)5.0%
Missing0
Missing (%)0.0%
Memory size4.0 KiB
강남구
73 
종로구
64 
마포구
48 
용산구
47 
송파구
36 
Other values (20)
232 

Length

Max length4
Median length3
Mean length2.978
Min length2

Unique

Unique1 ?
Unique (%)0.2%

Sample

1st row종로구
2nd row강남구
3rd row서초구
4th row광진구
5th row관악구

Common Values

ValueCountFrequency (%)
강남구 73
14.6%
종로구 64
12.8%
마포구 48
 
9.6%
용산구 47
 
9.4%
송파구 36
 
7.2%
중구 32
 
6.4%
관악구 22
 
4.4%
광진구 21
 
4.2%
서초구 18
 
3.6%
성동구 17
 
3.4%
Other values (15) 122
24.4%

Length

2023-12-10T23:54:24.359668image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
강남구 73
14.6%
종로구 64
12.8%
마포구 48
 
9.6%
용산구 47
 
9.4%
송파구 36
 
7.2%
중구 32
 
6.4%
관악구 22
 
4.4%
광진구 21
 
4.2%
서초구 18
 
3.6%
성동구 17
 
3.4%
Other values (15) 122
24.4%
Distinct34
Distinct (%)6.8%
Missing0
Missing (%)0.0%
Memory size4.0 KiB
미식
124 
분위기
58 
비쥬얼
33 
인스타그램
25 
입맛
 
21
Other values (29)
239 

Length

Max length6
Median length5
Mean length2.804
Min length2

Unique

Unique3 ?
Unique (%)0.6%

Sample

1st row분위기
2nd row미식
3rd row백종원
4th row식감
5th row존맛

Common Values

ValueCountFrequency (%)
미식 124
24.8%
분위기 58
 
11.6%
비쥬얼 33
 
6.6%
인스타그램 25
 
5.0%
입맛 21
 
4.2%
인테리어 20
 
4.0%
재료 19
 
3.8%
이색메뉴 19
 
3.8%
감성 17
 
3.4%
식감 16
 
3.2%
Other values (24) 148
29.6%

Length

2023-12-10T23:54:24.507445image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
미식 124
24.8%
분위기 58
 
11.6%
비쥬얼 33
 
6.6%
인스타그램 25
 
5.0%
입맛 21
 
4.2%
인테리어 20
 
4.0%
재료 19
 
3.8%
이색메뉴 19
 
3.8%
감성 17
 
3.4%
식감 16
 
3.2%
Other values (24) 148
29.6%

FREQ(FREQ)
Real number (ℝ)

Distinct18
Distinct (%)3.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.128
Minimum1
Maximum44
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size4.5 KiB
2023-12-10T23:54:24.637729image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q11
median1
Q32
95-th percentile6
Maximum44
Range43
Interquartile range (IQR)1

Descriptive statistics

Standard deviation4.0817812
Coefficient of variation (CV)1.9181303
Kurtosis63.567244
Mean2.128
Median Absolute Deviation (MAD)0
Skewness7.4000844
Sum1064
Variance16.660938
MonotonicityNot monotonic
2023-12-10T23:54:24.777500image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=18)
ValueCountFrequency (%)
1 350
70.0%
2 72
 
14.4%
3 27
 
5.4%
4 16
 
3.2%
6 12
 
2.4%
5 7
 
1.4%
7 3
 
0.6%
14 2
 
0.4%
17 2
 
0.4%
12 1
 
0.2%
Other values (8) 8
 
1.6%
ValueCountFrequency (%)
1 350
70.0%
2 72
 
14.4%
3 27
 
5.4%
4 16
 
3.2%
5 7
 
1.4%
6 12
 
2.4%
7 3
 
0.6%
9 1
 
0.2%
11 1
 
0.2%
12 1
 
0.2%
ValueCountFrequency (%)
44 1
0.2%
42 1
0.2%
41 1
0.2%
30 1
0.2%
27 1
0.2%
19 1
0.2%
17 2
0.4%
14 2
0.4%
12 1
0.2%
11 1
0.2%

Interactions

2023-12-10T23:54:22.670945image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:54:22.511197image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:54:22.765079image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:54:22.593098image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-10T23:54:24.889547image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
DOC_DATE(DATE)수집소스(SOURCE)행정구(GU_NM)세부키워드(KEYWORD_DETAIL)FREQ(FREQ)
DOC_DATE(DATE)1.0000.0860.1280.2630.000
수집소스(SOURCE)0.0861.0000.1940.0700.092
행정구(GU_NM)0.1280.1941.0000.2730.046
세부키워드(KEYWORD_DETAIL)0.2630.0700.2731.0000.000
FREQ(FREQ)0.0000.0920.0460.0001.000
2023-12-10T23:54:25.023885image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
세부키워드(KEYWORD_DETAIL)수집소스(SOURCE)행정구(GU_NM)
세부키워드(KEYWORD_DETAIL)1.0000.0530.066
수집소스(SOURCE)0.0531.0000.164
행정구(GU_NM)0.0660.1641.000
2023-12-10T23:54:25.132848image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
DOC_DATE(DATE)FREQ(FREQ)수집소스(SOURCE)행정구(GU_NM)세부키워드(KEYWORD_DETAIL)
DOC_DATE(DATE)1.000-0.0300.0560.0740.132
FREQ(FREQ)-0.0301.0000.0970.0160.000
수집소스(SOURCE)0.0560.0971.0000.1640.053
행정구(GU_NM)0.0740.0160.1641.0000.066
세부키워드(KEYWORD_DETAIL)0.1320.0000.0530.0661.000

Missing values

2023-12-10T23:54:22.867551image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-10T23:54:22.967043image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

DOC_DATE(DATE)수집소스(SOURCE)행정동(DONG_NM)행정구(GU_NM)세부키워드(KEYWORD_DETAIL)FREQ(FREQ)
020170622커뮤니티블로그국립현대미술관종로구분위기1
120180305커뮤니티블로그역삼역강남구미식2
220190702트위터해양박물관서초구백종원1
320180518커뮤니티블로그석파정서울미술관광진구식감1
420190518커뮤니티블로그마포관악구존맛1
520170402커뮤니티블로그종각송파구사르르2
620190616커뮤니티블로그용산마포구입맛44
720190118커뮤니티블로그남부터미널강남구미식1
820191008커뮤니티블로그이태원강남구포토존1
920191007트위터홍대입구역강남구인테리어2
DOC_DATE(DATE)수집소스(SOURCE)행정동(DONG_NM)행정구(GU_NM)세부키워드(KEYWORD_DETAIL)FREQ(FREQ)
49020170716트위터강남중구입맛6
49120171126커뮤니티블로그디큐브시티종로구미식1
49220190909커뮤니티블로그이촌동종로구갬성3
49320190301커뮤니티블로그종각역마포구분위기1
49420190112트위터성수용산구입맛7
49520170203커뮤니티블로그용산구종로구미식1
49620180113커뮤니티블로그가로수길강남구sns1
49720190723커뮤니티블로그여의도강남구이색메뉴2
49820171108커뮤니티블로그역사박물관관악구미식1
49920181227커뮤니티블로그화곡동종로구미식1