Overview

Dataset statistics

Number of variables4
Number of observations49
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.7 KiB
Average record size in memory35.7 B

Variable types

Numeric1
Categorical1
Text1
DateTime1

Dataset

Description인천광역시 남동구 폐업신고 원스톱서비스 대상업종에 대한 데이터로 연번, 분류, 업종, 데이터기준일자 항목을 제공합니다.
Author인천광역시 남동구
URLhttps://www.data.go.kr/data/15091166/fileData.do

Alerts

데이터기준일자 has constant value ""Constant
연번 is highly overall correlated with 분류High correlation
분류 is highly overall correlated with 연번High correlation
연번 has unique valuesUnique
업종 has unique valuesUnique

Reproduction

Analysis started2023-12-12 08:08:56.606202
Analysis finished2023-12-12 08:08:56.970016
Duration0.36 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct49
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean25
Minimum1
Maximum49
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size573.0 B
2023-12-12T17:08:57.039764image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile3.4
Q113
median25
Q337
95-th percentile46.6
Maximum49
Range48
Interquartile range (IQR)24

Descriptive statistics

Standard deviation14.28869
Coefficient of variation (CV)0.57154761
Kurtosis-1.2
Mean25
Median Absolute Deviation (MAD)12
Skewness0
Sum1225
Variance204.16667
MonotonicityStrictly increasing
2023-12-12T17:08:57.215734image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=49)
ValueCountFrequency (%)
1 1
 
2.0%
38 1
 
2.0%
28 1
 
2.0%
29 1
 
2.0%
30 1
 
2.0%
31 1
 
2.0%
32 1
 
2.0%
33 1
 
2.0%
34 1
 
2.0%
35 1
 
2.0%
Other values (39) 39
79.6%
ValueCountFrequency (%)
1 1
2.0%
2 1
2.0%
3 1
2.0%
4 1
2.0%
5 1
2.0%
6 1
2.0%
7 1
2.0%
8 1
2.0%
9 1
2.0%
10 1
2.0%
ValueCountFrequency (%)
49 1
2.0%
48 1
2.0%
47 1
2.0%
46 1
2.0%
45 1
2.0%
44 1
2.0%
43 1
2.0%
42 1
2.0%
41 1
2.0%
40 1
2.0%

분류
Categorical

HIGH CORRELATION 

Distinct10
Distinct (%)20.4%
Missing0
Missing (%)0.0%
Memory size524.0 B
농림축산분야
12 
기타분야
문화체육분야
여성가족분야
보건복지분야
Other values (5)
13 

Length

Max length6
Median length6
Mean length5.5918367
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row건설교통분야
2nd row건설교통분야
3rd row농림축산분야
4th row농림축산분야
5th row농림축산분야

Common Values

ValueCountFrequency (%)
농림축산분야 12
24.5%
기타분야 8
16.3%
문화체육분야 7
14.3%
여성가족분야 5
10.2%
보건복지분야 4
 
8.2%
식품위생분야 4
 
8.2%
해양수산분야 3
 
6.1%
건설교통분야 2
 
4.1%
산업자원분야 2
 
4.1%
환경분야 2
 
4.1%

Length

2023-12-12T17:08:57.376818image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T17:08:57.520131image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
농림축산분야 12
24.5%
기타분야 8
16.3%
문화체육분야 7
14.3%
여성가족분야 5
10.2%
보건복지분야 4
 
8.2%
식품위생분야 4
 
8.2%
해양수산분야 3
 
6.1%
건설교통분야 2
 
4.1%
산업자원분야 2
 
4.1%
환경분야 2
 
4.1%

업종
Text

UNIQUE 

Distinct49
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size524.0 B
2023-12-12T17:08:57.800868image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length10
Mean length6.5306122
Min length3

Characters and Unicode

Total characters320
Distinct characters114
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique49 ?
Unique (%)100.0%

Sample

1st row건설기계사업
2nd row자동차관리사업
3rd row가축거래상인
4th row가축사육업
5th row가축인공수정소
ValueCountFrequency (%)
3
 
5.4%
건설기계사업 1
 
1.8%
낚시어선업 1
 
1.8%
계량기사업 1
 
1.8%
석탄가공업 1
 
1.8%
건강기능식품영업 1
 
1.8%
공중위생업 1
 
1.8%
소독업 1
 
1.8%
식품위생업 1
 
1.8%
가정폭력피해자보호시설 1
 
1.8%
Other values (44) 44
78.6%
2023-12-12T17:08:58.179587image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
40
 
12.5%
11
 
3.4%
10
 
3.1%
9
 
2.8%
7
 
2.2%
7
 
2.2%
7
 
2.2%
7
 
2.2%
6
 
1.9%
6
 
1.9%
Other values (104) 210
65.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 308
96.2%
Space Separator 7
 
2.2%
Other Punctuation 3
 
0.9%
Open Punctuation 1
 
0.3%
Close Punctuation 1
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
40
 
13.0%
11
 
3.6%
10
 
3.2%
9
 
2.9%
7
 
2.3%
7
 
2.3%
7
 
2.3%
6
 
1.9%
6
 
1.9%
6
 
1.9%
Other values (100) 199
64.6%
Space Separator
ValueCountFrequency (%)
7
100.0%
Other Punctuation
ValueCountFrequency (%)
· 3
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 308
96.2%
Common 12
 
3.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
40
 
13.0%
11
 
3.6%
10
 
3.2%
9
 
2.9%
7
 
2.3%
7
 
2.3%
7
 
2.3%
6
 
1.9%
6
 
1.9%
6
 
1.9%
Other values (100) 199
64.6%
Common
ValueCountFrequency (%)
7
58.3%
· 3
25.0%
( 1
 
8.3%
) 1
 
8.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 308
96.2%
ASCII 9
 
2.8%
None 3
 
0.9%

Most frequent character per block

Hangul
ValueCountFrequency (%)
40
 
13.0%
11
 
3.6%
10
 
3.2%
9
 
2.9%
7
 
2.3%
7
 
2.3%
7
 
2.3%
6
 
1.9%
6
 
1.9%
6
 
1.9%
Other values (100) 199
64.6%
ASCII
ValueCountFrequency (%)
7
77.8%
( 1
 
11.1%
) 1
 
11.1%
None
ValueCountFrequency (%)
· 3
100.0%

데이터기준일자
Date

CONSTANT 

Distinct1
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size524.0 B
Minimum2023-09-19 00:00:00
Maximum2023-09-19 00:00:00
2023-12-12T17:08:58.322234image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:08:58.554519image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Interactions

2023-12-12T17:08:56.753431image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T17:08:58.670460image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번분류업종
연번1.0000.9591.000
분류0.9591.0001.000
업종1.0001.0001.000
2023-12-12T17:08:58.766845image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번분류
연번1.0000.686
분류0.6861.000

Missing values

2023-12-12T17:08:56.856054image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T17:08:56.936967image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번분류업종데이터기준일자
01건설교통분야건설기계사업2023-09-19
12건설교통분야자동차관리사업2023-09-19
23농림축산분야가축거래상인2023-09-19
34농림축산분야가축사육업2023-09-19
45농림축산분야가축인공수정소2023-09-19
56농림축산분야농어촌관광휴양사업2023-09-19
67농림축산분야동물병원2023-09-19
78농림축산분야동물판매업2023-09-19
89농림축산분야동물용의약품 등 제조업2023-09-19
910농림축산분야부화업2023-09-19
연번분류업종데이터기준일자
3940환경분야가축분뇨관련영업2023-09-19
4041환경분야분뇨·하수관련업2023-09-19
4142기타분야국내직업소개사업2023-09-19
4243기타분야담배소매업2023-09-19
4344기타분야담배판매업2023-09-19
4445기타분야방문판매신고업2023-09-19
4546기타분야옥외광고업2023-09-19
4647기타분야전화권유판매업2023-09-19
4748기타분야통신판매업2023-09-19
4849기타분야행정사2023-09-19