Overview

Dataset statistics

Number of variables18
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows763
Duplicate rows (%)7.6%
Total size in memory1.6 MiB
Average record size in memory163.0 B

Variable types

Categorical12
Text1
Numeric5

Dataset

Description국가금연지원서비스 대상자의 상담기록 데이터로 니코틴패치, 껌, 캔디 등의 니코틴보조제 지급항목을 제공합니다.- 전국 보건소, 지역금연지원센터
Author한국건강증진개발원
URLhttps://www.data.go.kr/data/15092410/fileData.do

Alerts

껌1mg_지급갯수 has constant value ""Constant
껌3mg_지급갯수 has constant value ""Constant
껌5mg_지급갯수 has constant value ""Constant
껌6mg_지급갯수 has constant value ""Constant
캔디2mg_지급갯수 has constant value ""Constant
Dataset has 763 (7.6%) duplicate rowsDuplicates
등록유형 is highly overall correlated with 기관유형 and 1 other fieldsHigh correlation
서비스구분 is highly overall correlated with 기관유형 and 1 other fieldsHigh correlation
기관유형 is highly overall correlated with 서비스구분 and 1 other fieldsHigh correlation
기관유형 is highly imbalanced (72.0%)Imbalance
서비스구분 is highly imbalanced (80.0%)Imbalance
등록유형 is highly imbalanced (75.2%)Imbalance
껌4mg_지급갯수 is highly imbalanced (99.7%)Imbalance
패치1단계_지급갯수 has 6396 (64.0%) zerosZeros
패치2단계_지급갯수 has 5099 (51.0%) zerosZeros
패치3단계_지급갯수 has 6474 (64.7%) zerosZeros
껌2mg_지급갯수 has 6702 (67.0%) zerosZeros
캔디1mg_지급갯수 has 8143 (81.4%) zerosZeros

Reproduction

Analysis started2023-12-12 13:01:25.854973
Analysis finished2023-12-12 13:01:30.357029
Duration4.5 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

기관유형
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
보건소
9514 
금연지원센터
 
486

Length

Max length6
Median length3
Mean length3.1458
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row보건소
2nd row보건소
3rd row보건소
4th row보건소
5th row보건소

Common Values

ValueCountFrequency (%)
보건소 9514
95.1%
금연지원센터 486
 
4.9%

Length

2023-12-12T22:01:30.433071image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T22:01:30.532030image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
보건소 9514
95.1%
금연지원센터 486
 
4.9%

지역
Categorical

Distinct17
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
경기도
2375 
서울특별시
1314 
경상남도
694 
경상북도
669 
전라남도
596 
Other values (12)
4352 

Length

Max length7
Median length5
Mean length4.1598
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row세종특별자치시
2nd row경상북도
3rd row서울특별시
4th row광주광역시
5th row서울특별시

Common Values

ValueCountFrequency (%)
경기도 2375
23.8%
서울특별시 1314
13.1%
경상남도 694
 
6.9%
경상북도 669
 
6.7%
전라남도 596
 
6.0%
부산광역시 571
 
5.7%
대구광역시 489
 
4.9%
충청남도 473
 
4.7%
강원도 456
 
4.6%
충청북도 450
 
4.5%
Other values (7) 1913
19.1%

Length

2023-12-12T22:01:30.641668image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
경기도 2375
23.8%
서울특별시 1314
13.1%
경상남도 694
 
6.9%
경상북도 669
 
6.7%
전라남도 596
 
6.0%
부산광역시 571
 
5.7%
대구광역시 489
 
4.9%
충청남도 473
 
4.7%
강원도 456
 
4.6%
충청북도 450
 
4.5%
Other values (7) 1913
19.1%

서비스구분
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
보건소 금연클리닉
9514 
찾아가는 금연서비스
 
379
단기금연캠프
 
107

Length

Max length10
Median length9
Mean length9.0058
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row보건소 금연클리닉
2nd row보건소 금연클리닉
3rd row보건소 금연클리닉
4th row보건소 금연클리닉
5th row보건소 금연클리닉

Common Values

ValueCountFrequency (%)
보건소 금연클리닉 9514
95.1%
찾아가는 금연서비스 379
 
3.8%
단기금연캠프 107
 
1.1%

Length

2023-12-12T22:01:30.814560image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T22:01:30.948263image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
보건소 9514
47.8%
금연클리닉 9514
47.8%
찾아가는 379
 
1.9%
금연서비스 379
 
1.9%
단기금연캠프 107
 
0.5%
Distinct275
Distinct (%)2.8%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T22:01:31.162695image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length9
Mean length9.5768
Min length8

Characters and Unicode

Total characters95768
Distinct characters153
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row세종특별자치시보건소
2nd row경북 칠곡군보건소
3rd row서울 도봉구보건소
4th row광주 서구보건소
5th row서울 영등포구보건소
ValueCountFrequency (%)
경기 2273
 
10.7%
서울 1285
 
6.0%
경남 649
 
3.0%
경북 642
 
3.0%
전남 581
 
2.7%
부산 554
 
2.6%
충남 463
 
2.2%
대구 463
 
2.2%
충북 422
 
2.0%
강원 415
 
1.9%
Other values (273) 13595
63.7%
2023-12-12T22:01:31.714751image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
11342
 
11.8%
9563
 
10.0%
9514
 
9.9%
9354
 
9.8%
4962
 
5.2%
4527
 
4.7%
3855
 
4.0%
2638
 
2.8%
2485
 
2.6%
2213
 
2.3%
Other values (143) 35315
36.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 84426
88.2%
Space Separator 11342
 
11.8%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
9563
 
11.3%
9514
 
11.3%
9354
 
11.1%
4962
 
5.9%
4527
 
5.4%
3855
 
4.6%
2638
 
3.1%
2485
 
2.9%
2213
 
2.6%
1996
 
2.4%
Other values (142) 33319
39.5%
Space Separator
ValueCountFrequency (%)
11342
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 84426
88.2%
Common 11342
 
11.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
9563
 
11.3%
9514
 
11.3%
9354
 
11.1%
4962
 
5.9%
4527
 
5.4%
3855
 
4.6%
2638
 
3.1%
2485
 
2.9%
2213
 
2.6%
1996
 
2.4%
Other values (142) 33319
39.5%
Common
ValueCountFrequency (%)
11342
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 84426
88.2%
ASCII 11342
 
11.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
11342
100.0%
Hangul
ValueCountFrequency (%)
9563
 
11.3%
9514
 
11.3%
9354
 
11.1%
4962
 
5.9%
4527
 
5.4%
3855
 
4.6%
2638
 
3.1%
2485
 
2.9%
2213
 
2.6%
1996
 
2.4%
Other values (142) 33319
39.5%

출생년도
Categorical

Distinct8
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
1970~1979
2231 
1960~1969
2166 
1980~1989
1816 
1990~1999
1542 
1950~1959
1470 
Other values (3)
775 

Length

Max length9
Median length9
Mean length9
Min length9

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1980~1989
2nd row2000~2009
3rd row1950~1959
4th row1940~1949
5th row1950~1959

Common Values

ValueCountFrequency (%)
1970~1979 2231
22.3%
1960~1969 2166
21.7%
1980~1989 1816
18.2%
1990~1999 1542
15.4%
1950~1959 1470
14.7%
1940~1949 423
 
4.2%
2000~2009 324
 
3.2%
1930~1939 28
 
0.3%

Length

2023-12-12T22:01:31.889074image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T22:01:32.058063image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1970~1979 2231
22.3%
1960~1969 2166
21.7%
1980~1989 1816
18.2%
1990~1999 1542
15.4%
1950~1959 1470
14.7%
1940~1949 423
 
4.2%
2000~2009 324
 
3.2%
1930~1939 28
 
0.3%

성별
Categorical

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
8785 
1215 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row
2nd row
3rd row
4th row
5th row

Common Values

ValueCountFrequency (%)
8785
87.8%
1215
 
12.2%

Length

2023-12-12T22:01:32.219849image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T22:01:32.322501image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
8785
87.8%
1215
 
12.2%

등록유형
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct12
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
보건소
8517 
출장
967 
소규모 사업장
 
173
여성
 
78
입원환자
 
59
Other values (7)
 
206

Length

Max length7
Median length3
Mean length2.9792
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row보건소
2nd row출장
3rd row보건소
4th row보건소
5th row보건소

Common Values

ValueCountFrequency (%)
보건소 8517
85.2%
출장 967
 
9.7%
소규모 사업장 173
 
1.7%
여성 78
 
0.8%
입원환자 59
 
0.6%
치료형 48
 
0.5%
지역자율 44
 
0.4%
장애인 40
 
0.4%
저소득층 40
 
0.4%
캠페인 30
 
0.3%
Other values (2) 4
 
< 0.1%

Length

2023-12-12T22:01:32.426388image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
보건소 8517
83.7%
출장 967
 
9.5%
소규모 173
 
1.7%
사업장 173
 
1.7%
여성 78
 
0.8%
입원환자 59
 
0.6%
치료형 48
 
0.5%
지역자율 44
 
0.4%
장애인 40
 
0.4%
저소득층 40
 
0.4%
Other values (3) 34
 
0.3%

패치1단계_지급갯수
Real number (ℝ)

ZEROS 

Distinct30
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean6.4741
Minimum0
Maximum84
Zeros6396
Zeros (%)64.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T22:01:32.567173image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q314
95-th percentile28
Maximum84
Range84
Interquartile range (IQR)14

Descriptive statistics

Standard deviation10.691172
Coefficient of variation (CV)1.6513758
Kurtosis5.7545626
Mean6.4741
Median Absolute Deviation (MAD)0
Skewness2.0746876
Sum64741
Variance114.30116
MonotonicityNot monotonic
2023-12-12T22:01:32.697012image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=30)
ValueCountFrequency (%)
0 6396
64.0%
14 1435
 
14.3%
21 718
 
7.2%
7 685
 
6.9%
28 409
 
4.1%
42 115
 
1.1%
35 106
 
1.1%
49 23
 
0.2%
56 19
 
0.2%
2 16
 
0.2%
Other values (20) 78
 
0.8%
ValueCountFrequency (%)
0 6396
64.0%
1 7
 
0.1%
2 16
 
0.2%
3 15
 
0.1%
4 7
 
0.1%
5 2
 
< 0.1%
6 4
 
< 0.1%
7 685
 
6.9%
9 1
 
< 0.1%
10 1
 
< 0.1%
ValueCountFrequency (%)
84 6
 
0.1%
77 4
 
< 0.1%
70 6
 
0.1%
63 14
 
0.1%
56 19
 
0.2%
49 23
 
0.2%
45 1
 
< 0.1%
44 1
 
< 0.1%
43 2
 
< 0.1%
42 115
1.1%

패치2단계_지급갯수
Real number (ℝ)

ZEROS 

Distinct32
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean9.3016
Minimum0
Maximum84
Zeros5099
Zeros (%)51.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T22:01:32.825418image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q314
95-th percentile35
Maximum84
Range84
Interquartile range (IQR)14

Descriptive statistics

Standard deviation12.482669
Coefficient of variation (CV)1.3419916
Kurtosis3.5258854
Mean9.3016
Median Absolute Deviation (MAD)0
Skewness1.6718357
Sum93016
Variance155.81702
MonotonicityNot monotonic
2023-12-12T22:01:32.945325image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=32)
ValueCountFrequency (%)
0 5099
51.0%
14 1879
 
18.8%
21 954
 
9.5%
7 874
 
8.7%
28 562
 
5.6%
35 186
 
1.9%
42 185
 
1.8%
49 74
 
0.7%
56 48
 
0.5%
2 32
 
0.3%
Other values (22) 107
 
1.1%
ValueCountFrequency (%)
0 5099
51.0%
1 8
 
0.1%
2 32
 
0.3%
3 14
 
0.1%
4 8
 
0.1%
5 2
 
< 0.1%
6 3
 
< 0.1%
7 874
 
8.7%
8 1
 
< 0.1%
9 4
 
< 0.1%
ValueCountFrequency (%)
84 5
 
0.1%
81 1
 
< 0.1%
77 6
 
0.1%
70 18
 
0.2%
63 24
 
0.2%
56 48
 
0.5%
49 74
 
0.7%
42 185
1.8%
35 186
1.9%
34 1
 
< 0.1%

패치3단계_지급갯수
Real number (ℝ)

ZEROS 

Distinct29
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean7.1715
Minimum0
Maximum105
Zeros6474
Zeros (%)64.7%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T22:01:33.093657image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q314
95-th percentile35
Maximum105
Range105
Interquartile range (IQR)14

Descriptive statistics

Standard deviation12.807533
Coefficient of variation (CV)1.7858931
Kurtosis7.8690194
Mean7.1715
Median Absolute Deviation (MAD)0
Skewness2.4681332
Sum71715
Variance164.03289
MonotonicityNot monotonic
2023-12-12T22:01:33.223473image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=29)
ValueCountFrequency (%)
0 6474
64.7%
14 1135
 
11.3%
21 725
 
7.2%
7 718
 
7.2%
28 343
 
3.4%
42 196
 
2.0%
35 167
 
1.7%
56 56
 
0.6%
49 48
 
0.5%
84 27
 
0.3%
Other values (19) 111
 
1.1%
ValueCountFrequency (%)
0 6474
64.7%
1 8
 
0.1%
2 13
 
0.1%
3 10
 
0.1%
4 9
 
0.1%
5 4
 
< 0.1%
6 2
 
< 0.1%
7 718
 
7.2%
8 1
 
< 0.1%
9 1
 
< 0.1%
ValueCountFrequency (%)
105 3
 
< 0.1%
84 27
 
0.3%
77 11
 
0.1%
70 17
 
0.2%
63 22
 
0.2%
56 56
 
0.6%
49 48
 
0.5%
42 196
2.0%
38 1
 
< 0.1%
37 1
 
< 0.1%

껌1mg_지급갯수
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
0
10000 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 10000
100.0%

Length

2023-12-12T22:01:33.351691image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T22:01:33.440432image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 10000
100.0%

껌2mg_지급갯수
Real number (ℝ)

ZEROS 

Distinct110
Distinct (%)1.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean22.0668
Minimum0
Maximum1320
Zeros6702
Zeros (%)67.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T22:01:33.553230image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q324
95-th percentile96
Maximum1320
Range1320
Interquartile range (IQR)24

Descriptive statistics

Standard deviation60.631688
Coefficient of variation (CV)2.747643
Kurtosis110.96834
Mean22.0668
Median Absolute Deviation (MAD)0
Skewness7.9446108
Sum220668
Variance3676.2016
MonotonicityNot monotonic
2023-12-12T22:01:33.695176image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 6702
67.0%
30 834
 
8.3%
24 489
 
4.9%
60 430
 
4.3%
48 224
 
2.2%
90 197
 
2.0%
15 129
 
1.3%
120 128
 
1.3%
10 82
 
0.8%
72 76
 
0.8%
Other values (100) 709
 
7.1%
ValueCountFrequency (%)
0 6702
67.0%
1 12
 
0.1%
2 15
 
0.1%
3 20
 
0.2%
4 6
 
0.1%
5 7
 
0.1%
6 11
 
0.1%
7 2
 
< 0.1%
10 82
 
0.8%
12 61
 
0.6%
ValueCountFrequency (%)
1320 1
 
< 0.1%
1308 1
 
< 0.1%
1230 1
 
< 0.1%
1224 1
 
< 0.1%
1220 1
 
< 0.1%
720 3
< 0.1%
696 1
 
< 0.1%
630 2
< 0.1%
600 2
< 0.1%
576 1
 
< 0.1%

껌3mg_지급갯수
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
0
10000 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 10000
100.0%

Length

2023-12-12T22:01:33.820204image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T22:01:33.901408image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 10000
100.0%

껌4mg_지급갯수
Categorical

IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
0
9998 
30
 
2

Length

Max length2
Median length1
Mean length1.0002
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 9998
> 99.9%
30 2
 
< 0.1%

Length

2023-12-12T22:01:34.005258image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T22:01:34.104274image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 9998
> 99.9%
30 2
 
< 0.1%

껌5mg_지급갯수
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
0
10000 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 10000
100.0%

Length

2023-12-12T22:01:34.193251image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T22:01:34.276760image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 10000
100.0%

껌6mg_지급갯수
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
0
10000 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 10000
100.0%

Length

2023-12-12T22:01:34.372928image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T22:01:34.459615image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 10000
100.0%

캔디1mg_지급갯수
Real number (ℝ)

ZEROS 

Distinct47
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean12.1929
Minimum0
Maximum1188
Zeros8143
Zeros (%)81.4%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T22:01:34.569624image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile72
Maximum1188
Range1188
Interquartile range (IQR)0

Descriptive statistics

Standard deviation45.830795
Coefficient of variation (CV)3.75881
Kurtosis153.29796
Mean12.1929
Median Absolute Deviation (MAD)0
Skewness9.911673
Sum121929
Variance2100.4617
MonotonicityNot monotonic
2023-12-12T22:01:34.706409image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=47)
ValueCountFrequency (%)
0 8143
81.4%
36 845
 
8.5%
72 320
 
3.2%
12 220
 
2.2%
24 113
 
1.1%
108 85
 
0.9%
144 61
 
0.6%
48 28
 
0.3%
180 27
 
0.3%
216 27
 
0.3%
Other values (37) 131
 
1.3%
ValueCountFrequency (%)
0 8143
81.4%
1 8
 
0.1%
2 4
 
< 0.1%
4 1
 
< 0.1%
6 1
 
< 0.1%
12 220
 
2.2%
16 2
 
< 0.1%
18 1
 
< 0.1%
24 113
 
1.1%
25 1
 
< 0.1%
ValueCountFrequency (%)
1188 1
 
< 0.1%
972 1
 
< 0.1%
936 2
< 0.1%
792 1
 
< 0.1%
720 3
< 0.1%
700 1
 
< 0.1%
636 1
 
< 0.1%
576 1
 
< 0.1%
540 2
< 0.1%
504 2
< 0.1%

캔디2mg_지급갯수
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
0
10000 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 10000
100.0%

Length

2023-12-12T22:01:34.855975image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T22:01:35.275420image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 10000
100.0%

Interactions

2023-12-12T22:01:29.447320image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T22:01:27.432676image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T22:01:27.926210image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T22:01:28.414666image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T22:01:28.948750image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T22:01:29.547123image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T22:01:27.521183image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T22:01:28.017271image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T22:01:28.548429image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T22:01:29.039140image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T22:01:29.646256image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T22:01:27.641421image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T22:01:28.103475image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T22:01:28.658067image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T22:01:29.145706image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T22:01:29.759178image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T22:01:27.742594image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T22:01:28.206686image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T22:01:28.750236image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T22:01:29.245920image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T22:01:29.863787image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T22:01:27.829901image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T22:01:28.314924image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T22:01:28.847416image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T22:01:29.350532image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T22:01:35.345731image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
기관유형지역서비스구분출생년도성별등록유형패치1단계_지급갯수패치2단계_지급갯수패치3단계_지급갯수껌2mg_지급갯수껌4mg_지급갯수캔디1mg_지급갯수
기관유형1.0000.1231.0000.0090.0871.0000.0560.0770.0630.0000.0000.000
지역0.1231.0000.1830.1150.0380.1750.1340.1210.1180.0840.0570.097
서비스구분1.0000.1831.0000.0710.0381.0000.0450.0680.0980.0000.0000.000
출생년도0.0090.1150.0711.0000.1040.1800.0860.0810.0900.0110.0000.063
성별0.0870.0380.0380.1041.0000.3190.0840.0350.0240.0000.0000.000
등록유형1.0000.1751.0000.1800.3191.0000.0380.0490.0290.0000.0000.000
패치1단계_지급갯수0.0560.1340.0450.0860.0840.0381.0000.2090.1450.1200.0000.184
패치2단계_지급갯수0.0770.1210.0680.0810.0350.0490.2091.0000.1360.0730.0000.179
패치3단계_지급갯수0.0630.1180.0980.0900.0240.0290.1450.1361.0000.0520.0000.101
껌2mg_지급갯수0.0000.0840.0000.0110.0000.0000.1200.0730.0521.0000.0000.066
껌4mg_지급갯수0.0000.0570.0000.0000.0000.0000.0000.0000.0000.0001.0000.000
캔디1mg_지급갯수0.0000.0970.0000.0630.0000.0000.1840.1790.1010.0660.0001.000
2023-12-12T22:01:35.482426image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
등록유형서비스구분껌4mg_지급갯수성별출생년도기관유형지역
등록유형1.0001.0000.0000.2470.0770.9990.064
서비스구분1.0001.0000.0000.0620.0451.0000.099
껌4mg_지급갯수0.0000.0001.0000.0000.0000.0000.051
성별0.2470.0620.0001.0000.0780.0550.034
출생년도0.0770.0450.0000.0781.0000.0070.048
기관유형0.9991.0000.0000.0550.0071.0000.110
지역0.0640.0990.0510.0340.0480.1101.000
2023-12-12T22:01:35.598700image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
패치1단계_지급갯수패치2단계_지급갯수패치3단계_지급갯수껌2mg_지급갯수캔디1mg_지급갯수기관유형지역서비스구분출생년도성별등록유형껌4mg_지급갯수
패치1단계_지급갯수1.000-0.126-0.266-0.010-0.0410.0430.0520.0270.0410.0650.0160.000
패치2단계_지급갯수-0.1261.0000.013-0.097-0.0490.0590.0470.0400.0380.0270.0210.000
패치3단계_지급갯수-0.2660.0131.000-0.106-0.0220.0630.0450.0430.0460.0240.0050.000
껌2mg_지급갯수-0.010-0.097-0.1061.0000.0780.0000.0380.0000.0060.0000.0000.000
캔디1mg_지급갯수-0.041-0.049-0.0220.0781.0000.0000.0380.0000.0300.0000.0000.000
기관유형0.0430.0590.0630.0000.0001.0000.1101.0000.0070.0550.9990.000
지역0.0520.0470.0450.0380.0380.1101.0000.0990.0480.0340.0640.051
서비스구분0.0270.0400.0430.0000.0001.0000.0991.0000.0450.0621.0000.000
출생년도0.0410.0380.0460.0060.0300.0070.0480.0451.0000.0780.0770.000
성별0.0650.0270.0240.0000.0000.0550.0340.0620.0781.0000.2470.000
등록유형0.0160.0210.0050.0000.0000.9990.0641.0000.0770.2471.0000.000
껌4mg_지급갯수0.0000.0000.0000.0000.0000.0000.0510.0000.0000.0000.0001.000

Missing values

2023-12-12T22:01:30.011969image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T22:01:30.243469image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

기관유형지역서비스구분제공기관출생년도성별등록유형패치1단계_지급갯수패치2단계_지급갯수패치3단계_지급갯수껌1mg_지급갯수껌2mg_지급갯수껌3mg_지급갯수껌4mg_지급갯수껌5mg_지급갯수껌6mg_지급갯수캔디1mg_지급갯수캔디2mg_지급갯수
50411보건소세종특별자치시보건소 금연클리닉세종특별자치시보건소1980~1989보건소035210120000000
68557보건소경상북도보건소 금연클리닉경북 칠곡군보건소2000~2009출장070024000000
51349보건소서울특별시보건소 금연클리닉서울 도봉구보건소1950~1959보건소014000000000
28478보건소광주광역시보건소 금연클리닉광주 서구보건소1940~1949보건소056140000001320
10053보건소서울특별시보건소 금연클리닉서울 영등포구보건소1950~1959보건소021000000000
13279보건소서울특별시보건소 금연클리닉서울 중구보건소1970~1979출장07000000000
31167보건소경상남도보건소 금연클리닉경남 김해시보건소1970~1979보건소1400000000360
41471보건소부산광역시보건소 금연클리닉부산 동구보건소1950~1959보건소20000000000
35671보건소경상북도보건소 금연클리닉경북 울진군 보건소1970~1979보건소020000000360
59102보건소경기도보건소 금연클리닉경기 시흥시보건소1980~1989보건소000015000001440
기관유형지역서비스구분제공기관출생년도성별등록유형패치1단계_지급갯수패치2단계_지급갯수패치3단계_지급갯수껌1mg_지급갯수껌2mg_지급갯수껌3mg_지급갯수껌4mg_지급갯수껌5mg_지급갯수껌6mg_지급갯수캔디1mg_지급갯수캔디2mg_지급갯수
46706보건소전라남도보건소 금연클리닉전남 여수시보건소1980~1989보건소002100000000
55931보건소전라남도보건소 금연클리닉전남 구례군보건의료원1960~1969보건소002100000000
49635보건소경기도보건소 금연클리닉경기 포천시보건소1970~1979보건소003500000000
40868보건소경기도보건소 금연클리닉경기 화성시 동탄보건소2000~2009보건소0140030000000
8103보건소충청북도보건소 금연클리닉충북 충주시보건소1960~1969보건소214200144000000
19439보건소인천광역시보건소 금연클리닉인천 옹진군보건소1980~1989보건소770030000000
9502보건소충청북도보건소 금연클리닉충북 충주시보건소1960~1969보건소0210000000720
4180보건소강원도보건소 금연클리닉강원 정선군보건소1940~1949보건소28014000000360
62119보건소인천광역시보건소 금연클리닉인천 남동구보건소1990~1999보건소140000000000
70063보건소전라남도보건소 금연클리닉전남 여수시보건소1960~1969보건소210000000000

Duplicate rows

Most frequently occurring

기관유형지역서비스구분제공기관출생년도성별등록유형패치1단계_지급갯수패치2단계_지급갯수패치3단계_지급갯수껌1mg_지급갯수껌2mg_지급갯수껌3mg_지급갯수껌4mg_지급갯수껌5mg_지급갯수껌6mg_지급갯수캔디1mg_지급갯수캔디2mg_지급갯수# duplicates
363보건소광주광역시보건소 금연클리닉광주 북구보건소2000~2009출장070000000008
139보건소경기도보건소 금연클리닉경기 시흥시보건소1970~1979보건소1400000000006
364보건소광주광역시보건소 금연클리닉광주 북구보건소2000~2009출장700000000006
35보건소강원도보건소 금연클리닉강원 춘천시보건소1960~1969보건소14000000007205
89보건소경기도보건소 금연클리닉경기 남양주보건소1970~1979보건소0140000000005
229보건소경기도보건소 금연클리닉경기 평택시 송탄보건소1970~1979보건소14000150000005
239보건소경기도보건소 금연클리닉경기 평택시 평택보건소1970~1979보건소1400000000005
304보건소경상남도보건소 금연클리닉경남 창원시 창원보건소1970~1979보건소0014000000005
350보건소광주광역시보건소 금연클리닉광주 광산구보건소1970~1979보건소0140000000005
409보건소대전광역시보건소 금연클리닉대전 대덕구보건소1970~1979보건소2100000000005