Overview

Dataset statistics

Number of variables5
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory478.5 KiB
Average record size in memory49.0 B

Variable types

Text1
Categorical4

Dataset

Description경기도 광주시 흡연단속시스템 전수조사 현황에 관한 데이터로 단속키, 법정동코드, 조사내용, 조사구분 등의 항목을 제공합니다.
URLhttps://www.data.go.kr/data/15122186/fileData.do

Alerts

법정동코드 has constant value ""Constant
조사내용 is highly overall correlated with 전수조사내용키High correlation
전수조사내용키 is highly overall correlated with 조사내용High correlation

Reproduction

Analysis started2023-12-12 12:52:30.140615
Analysis finished2023-12-12 12:52:30.609796
Duration0.47 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct8120
Distinct (%)81.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T21:52:30.751373image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length24
Median length24
Mean length24
Min length24

Characters and Unicode

Total characters240000
Distinct characters11
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique6369 ?
Unique (%)63.7%

Sample

1st rowS00599999920210521110415
2nd rowS00299999920230102120747
3rd rowS00599999920211124123909
4th rowS00299999920210407103157
5th rowS00399999920200129230400
ValueCountFrequency (%)
s00299999920210427131842 3
 
< 0.1%
s00499999920210514201738 3
 
< 0.1%
s00499999920230214095727 3
 
< 0.1%
s00599999920211108111058 3
 
< 0.1%
s00399999920230310111513 3
 
< 0.1%
s00599999920210809114826 3
 
< 0.1%
s00599999920210624165104 3
 
< 0.1%
s00299999920230130113439 3
 
< 0.1%
s00499999920230303102634 3
 
< 0.1%
s00299999920230308120515 3
 
< 0.1%
Other values (8110) 9970
99.7%
2023-12-12T21:52:31.066120image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
9 64824
27.0%
0 52154
21.7%
2 33663
14.0%
1 30789
12.8%
5 13580
 
5.7%
3 13206
 
5.5%
S 10000
 
4.2%
4 9968
 
4.2%
8 4043
 
1.7%
6 4029
 
1.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 230000
95.8%
Uppercase Letter 10000
 
4.2%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
9 64824
28.2%
0 52154
22.7%
2 33663
14.6%
1 30789
13.4%
5 13580
 
5.9%
3 13206
 
5.7%
4 9968
 
4.3%
8 4043
 
1.8%
6 4029
 
1.8%
7 3744
 
1.6%
Uppercase Letter
ValueCountFrequency (%)
S 10000
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 230000
95.8%
Latin 10000
 
4.2%

Most frequent character per script

Common
ValueCountFrequency (%)
9 64824
28.2%
0 52154
22.7%
2 33663
14.6%
1 30789
13.4%
5 13580
 
5.9%
3 13206
 
5.7%
4 9968
 
4.3%
8 4043
 
1.8%
6 4029
 
1.8%
7 3744
 
1.6%
Latin
ValueCountFrequency (%)
S 10000
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 240000
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
9 64824
27.0%
0 52154
21.7%
2 33663
14.0%
1 30789
12.8%
5 13580
 
5.7%
3 13206
 
5.5%
S 10000
 
4.2%
4 9968
 
4.2%
8 4043
 
1.7%
6 4029
 
1.7%

법정동코드
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
4161000000
10000 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row4161000000
2nd row4161000000
3rd row4161000000
4th row4161000000
5th row4161000000

Common Values

ValueCountFrequency (%)
4161000000 10000
100.0%

Length

2023-12-12T21:52:31.207229image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T21:52:31.295363image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
4161000000 10000
100.0%

전수조사내용키
Categorical

HIGH CORRELATION 

Distinct6
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
JOSA001
2941 
JOSA002
2887 
JOSA005
2821 
JOSA006
465 
JOSA003
456 

Length

Max length7
Median length7
Mean length7
Min length7

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowJOSA005
2nd rowJOSA001
3rd rowJOSA005
4th rowJOSA006
5th rowJOSA005

Common Values

ValueCountFrequency (%)
JOSA001 2941
29.4%
JOSA002 2887
28.9%
JOSA005 2821
28.2%
JOSA006 465
 
4.7%
JOSA003 456
 
4.6%
JOSA004 430
 
4.3%

Length

2023-12-12T21:52:31.392444image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T21:52:31.492321image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
josa001 2941
29.4%
josa002 2887
28.9%
josa005 2821
28.2%
josa006 465
 
4.7%
josa003 456
 
4.6%
josa004 430
 
4.3%

조사내용
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
흡연자-금연구역에서 흡연행위
3406 
흡연실 시설기준
3343 
흡연실 설치여부
3251 

Length

Max length15
Median length8
Mean length10.3842
Min length8

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row흡연실 설치여부
2nd row흡연자-금연구역에서 흡연행위
3rd row흡연실 설치여부
4th row흡연자-금연구역에서 흡연행위
5th row흡연실 설치여부

Common Values

ValueCountFrequency (%)
흡연자-금연구역에서 흡연행위 3406
34.1%
흡연실 시설기준 3343
33.4%
흡연실 설치여부 3251
32.5%

Length

2023-12-12T21:52:31.624849image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T21:52:31.738081image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
흡연실 6594
33.0%
흡연자-금연구역에서 3406
17.0%
흡연행위 3406
17.0%
시설기준 3343
16.7%
설치여부 3251
16.3%

조사구분
Categorical

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
X
6952 
Y
3045 
N
 
3

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowX
2nd rowX
3rd rowY
4th rowX
5th rowY

Common Values

ValueCountFrequency (%)
X 6952
69.5%
Y 3045
30.4%
N 3
 
< 0.1%

Length

2023-12-12T21:52:31.907062image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T21:52:32.008209image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
x 6952
69.5%
y 3045
30.4%
n 3
 
< 0.1%

Correlations

2023-12-12T21:52:32.071070image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
전수조사내용키조사내용조사구분
전수조사내용키1.0001.0000.404
조사내용1.0001.0000.000
조사구분0.4040.0001.000
2023-12-12T21:52:32.175478image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
조사내용전수조사내용키조사구분
조사내용1.0001.0000.000
전수조사내용키1.0001.0000.183
조사구분0.0000.1831.000
2023-12-12T21:52:32.270928image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
전수조사내용키조사내용조사구분
전수조사내용키1.0001.0000.183
조사내용1.0001.0000.000
조사구분0.1830.0001.000

Missing values

2023-12-12T21:52:30.477286image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T21:52:30.563937image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

단속키법정동코드전수조사내용키조사내용조사구분
33113S005999999202105211104154161000000JOSA005흡연실 설치여부X
10371S002999999202301021207474161000000JOSA001흡연자-금연구역에서 흡연행위X
49853S005999999202111241239094161000000JOSA005흡연실 설치여부Y
6899S002999999202104071031574161000000JOSA006흡연자-금연구역에서 흡연행위X
13238S003999999202001292304004161000000JOSA005흡연실 설치여부Y
45913S005999999202110131210154161000000JOSA002흡연실 시설기준Y
43892S005999999202109141230054161000000JOSA005흡연실 설치여부Y
28539S005999999202104091122074161000000JOSA001흡연자-금연구역에서 흡연행위Y
33436S005999999202105251334024161000000JOSA002흡연실 시설기준X
11206S002999999202302161156344161000000JOSA002흡연실 시설기준X
단속키법정동코드전수조사내용키조사내용조사구분
5104S001999999202305010906404161000000JOSA002흡연실 시설기준X
20496S004999999202302091048504161000000JOSA001흡연자-금연구역에서 흡연행위X
14715S003999999202105201040024161000000JOSA001흡연자-금연구역에서 흡연행위X
25882S005999999202103221103524161000000JOSA002흡연실 시설기준X
9493S002999999202210051219084161000000JOSA002흡연실 시설기준X
19440S004999999202104121932124161000000JOSA003흡연실 시설기준X
18169S003999999202304210951374161000000JOSA002흡연실 시설기준X
5239S001999999202305031009464161000000JOSA004흡연실 설치여부X
7747S002999999202105251416054161000000JOSA002흡연실 시설기준X
35587S005999999202106151141334161000000JOSA002흡연실 시설기준Y