Overview

Dataset statistics

Number of variables5
Number of observations1500
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory64.6 KiB
Average record size in memory44.1 B

Variable types

Categorical2
Numeric3

Dataset

Description처리일자,청소년유해업소업종코드,청소년유해업소업종명,건수
Author중구
URLhttps://data.seoul.go.kr/dataList/OA-10193/S/1/datasetView.do

Alerts

추천시도코드 has constant value ""Constant
청소년유해업소업종코드 is highly overall correlated with 청소년유해업소업종명High correlation
건수 is highly overall correlated with 청소년유해업소업종명High correlation
청소년유해업소업종명 is highly overall correlated with 청소년유해업소업종코드 and 1 other fieldsHigh correlation

Reproduction

Analysis started2024-05-11 04:47:01.818611
Analysis finished2024-05-11 04:47:06.557303
Duration4.74 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

추천시도코드
Categorical

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size11.8 KiB
3010000
1500 

Length

Max length7
Median length7
Mean length7
Min length7

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row3010000
2nd row3010000
3rd row3010000
4th row3010000
5th row3010000

Common Values

ValueCountFrequency (%)
3010000 1500
100.0%

Length

2024-05-11T04:47:06.924816image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-11T04:47:07.355713image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
3010000 1500
100.0%

처리일자
Real number (ℝ)

Distinct30
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean20240449
Minimum20240411
Maximum20240510
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size13.3 KiB
2024-05-11T04:47:07.768918image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum20240411
5-th percentile20240412
Q120240418
median20240426
Q320240503
95-th percentile20240509
Maximum20240510
Range99
Interquartile range (IQR)85

Descriptive statistics

Standard deviation40.392573
Coefficient of variation (CV)1.9956362 × 10-6
Kurtosis-1.4858409
Mean20240449
Median Absolute Deviation (MAD)10
Skewness0.67011251
Sum3.0360673 × 1010
Variance1631.5599
MonotonicityDecreasing
2024-05-11T04:47:08.362521image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=30)
ValueCountFrequency (%)
20240510 50
 
3.3%
20240424 50
 
3.3%
20240411 50
 
3.3%
20240412 50
 
3.3%
20240413 50
 
3.3%
20240414 50
 
3.3%
20240415 50
 
3.3%
20240416 50
 
3.3%
20240417 50
 
3.3%
20240418 50
 
3.3%
Other values (20) 1000
66.7%
ValueCountFrequency (%)
20240411 50
3.3%
20240412 50
3.3%
20240413 50
3.3%
20240414 50
3.3%
20240415 50
3.3%
20240416 50
3.3%
20240417 50
3.3%
20240418 50
3.3%
20240419 50
3.3%
20240420 50
3.3%
ValueCountFrequency (%)
20240510 50
3.3%
20240509 50
3.3%
20240508 50
3.3%
20240507 50
3.3%
20240506 50
3.3%
20240505 50
3.3%
20240504 50
3.3%
20240503 50
3.3%
20240502 50
3.3%
20240501 50
3.3%

청소년유해업소업종코드
Real number (ℝ)

HIGH CORRELATION 

Distinct50
Distinct (%)3.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean13326.9
Minimum10101
Maximum30111
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size13.3 KiB
2024-05-11T04:47:09.064538image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum10101
5-th percentile10103
Q110114
median10254.5
Q310499
95-th percentile24205
Maximum30111
Range20010
Interquartile range (IQR)385

Descriptive statistics

Standard deviation5795.0163
Coefficient of variation (CV)0.43483603
Kurtosis1.0968539
Mean13326.9
Median Absolute Deviation (MAD)149
Skewness1.5759076
Sum19990350
Variance33582214
MonotonicityNot monotonic
2024-05-11T04:47:09.873764image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
10301 30
 
2.0%
10113 30
 
2.0%
10206 30
 
2.0%
10205 30
 
2.0%
10201 30
 
2.0%
10199 30
 
2.0%
10119 30
 
2.0%
10118 30
 
2.0%
10117 30
 
2.0%
10116 30
 
2.0%
Other values (40) 1200
80.0%
ValueCountFrequency (%)
10101 30
2.0%
10102 30
2.0%
10103 30
2.0%
10104 30
2.0%
10105 30
2.0%
10106 30
2.0%
10107 30
2.0%
10108 30
2.0%
10110 30
2.0%
10111 30
2.0%
ValueCountFrequency (%)
30111 30
2.0%
30110 30
2.0%
24205 30
2.0%
24201 30
2.0%
24113 30
2.0%
24101 30
2.0%
20301 30
2.0%
20199 30
2.0%
20107 30
2.0%
20105 30
2.0%

청소년유해업소업종명
Categorical

HIGH CORRELATION 

Distinct45
Distinct (%)3.0%
Missing0
Missing (%)0.0%
Memory size11.8 KiB
기타
 
90
패스트푸드
 
60
전통찻집
 
60
관광호텔
 
60
여인숙업
 
30
Other values (40)
1200 

Length

Max length11
Median length7.5
Mean length4.16
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row단란주점
2nd row무도학원업
3rd row무도장업
4th row노래연습장업
5th row비디오물감상실업

Common Values

ValueCountFrequency (%)
기타 90
 
6.0%
패스트푸드 60
 
4.0%
전통찻집 60
 
4.0%
관광호텔 60
 
4.0%
여인숙업 30
 
2.0%
간이주점 30
 
2.0%
무도장업 30
 
2.0%
노래연습장업 30
 
2.0%
비디오물감상실업 30
 
2.0%
일반게임제공업 30
 
2.0%
Other values (35) 1050
70.0%

Length

2024-05-11T04:47:10.317651image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
기타 120
 
7.8%
전통찻집 60
 
3.9%
관광호텔 60
 
3.9%
패스트푸드 60
 
3.9%
김밥(도시락 30
 
2.0%
경양식 30
 
2.0%
비어(바)살롱 30
 
2.0%
극장식당 30
 
2.0%
카바레 30
 
2.0%
탕류 30
 
2.0%
Other values (35) 1050
68.6%

건수
Real number (ℝ)

HIGH CORRELATION 

Distinct92
Distinct (%)6.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean173.30733
Minimum1
Maximum2659
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size13.3 KiB
2024-05-11T04:47:10.790087image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2
Q17
median45
Q3133
95-th percentile877
Maximum2659
Range2658
Interquartile range (IQR)126

Descriptive statistics

Standard deviation407.94223
Coefficient of variation (CV)2.353866
Kurtosis24.944734
Mean173.30733
Median Absolute Deviation (MAD)41
Skewness4.7307112
Sum259961
Variance166416.86
MonotonicityNot monotonic
2024-05-11T04:47:11.432436image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
4 92
 
6.1%
45 60
 
4.0%
8 60
 
4.0%
2 60
 
4.0%
14 60
 
4.0%
1 60
 
4.0%
3 60
 
4.0%
7 60
 
4.0%
5 58
 
3.9%
84 36
 
2.4%
Other values (82) 894
59.6%
ValueCountFrequency (%)
1 60
4.0%
2 60
4.0%
3 60
4.0%
4 92
6.1%
5 58
3.9%
6 30
 
2.0%
7 60
4.0%
8 60
4.0%
10 30
 
2.0%
11 30
 
2.0%
ValueCountFrequency (%)
2659 2
 
0.1%
2658 1
 
0.1%
2657 5
0.3%
2655 6
0.4%
2654 5
0.3%
2653 6
0.4%
2652 3
0.2%
2651 2
 
0.1%
930 4
0.3%
929 2
 
0.1%

Interactions

2024-05-11T04:47:04.609720image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T04:47:02.154347image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T04:47:03.316825image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T04:47:04.977736image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T04:47:02.516331image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T04:47:03.724345image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T04:47:05.511883image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T04:47:02.923865image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T04:47:04.171818image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-05-11T04:47:11.703317image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
처리일자청소년유해업소업종코드청소년유해업소업종명건수
처리일자1.0000.0000.0000.000
청소년유해업소업종코드0.0001.0000.9980.285
청소년유해업소업종명0.0000.9981.0000.990
건수0.0000.2850.9901.000
2024-05-11T04:47:11.968358image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
처리일자청소년유해업소업종코드건수청소년유해업소업종명
처리일자1.0000.0000.0000.000
청소년유해업소업종코드0.0001.000-0.2020.968
건수0.000-0.2021.0000.925
청소년유해업소업종명0.0000.9680.9251.000

Missing values

2024-05-11T04:47:06.075990image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-05-11T04:47:06.424959image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

추천시도코드처리일자청소년유해업소업종코드청소년유해업소업종명건수
030100002024051010301단란주점60
130100002024051030111무도학원업14
230100002024051030110무도장업2
330100002024051024205노래연습장업64
430100002024051024201비디오물감상실업1
530100002024051024113일반게임제공업5
630100002024051024101게임제공업3
730100002024051020301일반이용업69
830100002024051020199숙박업 기타22
930100002024051020107여인숙업11
추천시도코드처리일자청소년유해업소업종코드청소년유해업소업종명건수
149030100002024041120101관광호텔87
149130100002024041120102일반호텔62
149230100002024041120105여관업134
149330100002024041130110무도장업2
149430100002024041120107여인숙업11
149530100002024041120199숙박업 기타21
149630100002024041120301일반이용업70
149730100002024041124101게임제공업3
149830100002024041124113일반게임제공업5
149930100002024041124201비디오물감상실업1