Overview

Dataset statistics

Number of variables7
Number of observations77
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory4.5 KiB
Average record size in memory59.7 B

Variable types

Numeric2
Categorical3
Text2

Dataset

Description인천광역시 남동구 물가조사현황에 대한 데이터로 (연번, 품목구분, 품목명, 규격 및 단위, 가격(원),물가기준일, 데이터기준일)을 제공합니다.
Author인천광역시 남동구
URLhttps://data.incheon.go.kr/findData/publicDataDetail?dataId=15067700&srcSe=7661IVAWM27C61E190

Alerts

물가기준일 has constant value ""Constant
데이터기준일 has constant value ""Constant
연번 is highly overall correlated with 품목구분High correlation
품목구분 is highly overall correlated with 연번High correlation
연번 has unique valuesUnique
가격(원) has 1 (1.3%) zerosZeros

Reproduction

Analysis started2024-04-17 09:19:56.878055
Analysis finished2024-04-17 09:19:58.093935
Duration1.22 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct77
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean39
Minimum1
Maximum77
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size825.0 B
2024-04-17T18:19:58.158180image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile4.8
Q120
median39
Q358
95-th percentile73.2
Maximum77
Range76
Interquartile range (IQR)38

Descriptive statistics

Standard deviation22.371857
Coefficient of variation (CV)0.57363737
Kurtosis-1.2
Mean39
Median Absolute Deviation (MAD)19
Skewness0
Sum3003
Variance500.5
MonotonicityStrictly increasing
2024-04-17T18:19:58.288442image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
1.3%
50 1
 
1.3%
57 1
 
1.3%
56 1
 
1.3%
55 1
 
1.3%
54 1
 
1.3%
53 1
 
1.3%
52 1
 
1.3%
51 1
 
1.3%
49 1
 
1.3%
Other values (67) 67
87.0%
ValueCountFrequency (%)
1 1
1.3%
2 1
1.3%
3 1
1.3%
4 1
1.3%
5 1
1.3%
6 1
1.3%
7 1
1.3%
8 1
1.3%
9 1
1.3%
10 1
1.3%
ValueCountFrequency (%)
77 1
1.3%
76 1
1.3%
75 1
1.3%
74 1
1.3%
73 1
1.3%
72 1
1.3%
71 1
1.3%
70 1
1.3%
69 1
1.3%
68 1
1.3%

품목구분
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)3.9%
Missing0
Missing (%)0.0%
Memory size748.0 B
생필품
34 
외식
24 
기타서비스
19 

Length

Max length5
Median length3
Mean length3.1818182
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row외식
2nd row외식
3rd row외식
4th row외식
5th row외식

Common Values

ValueCountFrequency (%)
생필품 34
44.2%
외식 24
31.2%
기타서비스 19
24.7%

Length

2024-04-17T18:19:58.417539image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-17T18:19:58.514057image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
생필품 34
44.2%
외식 24
31.2%
기타서비스 19
24.7%
Distinct76
Distinct (%)98.7%
Missing0
Missing (%)0.0%
Memory size748.0 B
2024-04-17T18:19:58.721401image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length8
Median length7
Mean length3.4545455
Min length1

Characters and Unicode

Total characters266
Distinct characters133
Distinct categories4 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique75 ?
Unique (%)97.4%

Sample

1st row설렁탕
2nd row냉면
3rd row비빔밥
4th row갈비탕
5th row삼계탕
ValueCountFrequency (%)
쇠고기 2
 
2.6%
1
 
1.3%
배추 1
 
1.3%
달걀 1
 
1.3%
닭고기 1
 
1.3%
돼지고기 1
 
1.3%
멸치 1
 
1.3%
고등어 1
 
1.3%
사과 1
 
1.3%
감자 1
 
1.3%
Other values (66) 66
85.7%
2024-04-17T18:19:59.084762image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
18
 
6.8%
11
 
4.1%
9
 
3.4%
8
 
3.0%
8
 
3.0%
6
 
2.3%
( 5
 
1.9%
5
 
1.9%
) 5
 
1.9%
5
 
1.9%
Other values (123) 186
69.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 254
95.5%
Open Punctuation 5
 
1.9%
Close Punctuation 5
 
1.9%
Uppercase Letter 2
 
0.8%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
18
 
7.1%
11
 
4.3%
9
 
3.5%
8
 
3.1%
8
 
3.1%
6
 
2.4%
5
 
2.0%
5
 
2.0%
4
 
1.6%
4
 
1.6%
Other values (119) 176
69.3%
Uppercase Letter
ValueCountFrequency (%)
C 1
50.0%
P 1
50.0%
Open Punctuation
ValueCountFrequency (%)
( 5
100.0%
Close Punctuation
ValueCountFrequency (%)
) 5
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 254
95.5%
Common 10
 
3.8%
Latin 2
 
0.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
18
 
7.1%
11
 
4.3%
9
 
3.5%
8
 
3.1%
8
 
3.1%
6
 
2.4%
5
 
2.0%
5
 
2.0%
4
 
1.6%
4
 
1.6%
Other values (119) 176
69.3%
Common
ValueCountFrequency (%)
( 5
50.0%
) 5
50.0%
Latin
ValueCountFrequency (%)
C 1
50.0%
P 1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 254
95.5%
ASCII 12
 
4.5%

Most frequent character per block

Hangul
ValueCountFrequency (%)
18
 
7.1%
11
 
4.3%
9
 
3.5%
8
 
3.1%
8
 
3.1%
6
 
2.4%
5
 
2.0%
5
 
2.0%
4
 
1.6%
4
 
1.6%
Other values (119) 176
69.3%
ASCII
ValueCountFrequency (%)
( 5
41.7%
) 5
41.7%
C 1
 
8.3%
P 1
 
8.3%
Distinct63
Distinct (%)81.8%
Missing0
Missing (%)0.0%
Memory size748.0 B
2024-04-17T18:19:59.315564image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length18
Median length14
Mean length8.7272727
Min length3

Characters and Unicode

Total characters672
Distinct characters180
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique56 ?
Unique (%)72.7%

Sample

1st row1인분(보통)
2nd row물냉면, 1인분(보통)
3rd row1인분(보통)
4th row1인분(보통)
5th row1인분(보통)
ValueCountFrequency (%)
1킬로 9
 
7.3%
1인분(보통 8
 
6.5%
포함 3
 
2.4%
1리터 3
 
2.4%
1인분(200그램 3
 
2.4%
600그램 3
 
2.4%
sk 3
 
2.4%
400그램 2
 
1.6%
3킬로 2
 
1.6%
1인분(공기밥 2
 
1.6%
Other values (83) 85
69.1%
2024-04-17T18:19:59.715232image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 65
 
9.7%
46
 
6.8%
( 45
 
6.7%
) 45
 
6.7%
0 33
 
4.9%
26
 
3.9%
21
 
3.1%
15
 
2.2%
15
 
2.2%
3 11
 
1.6%
Other values (170) 350
52.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 377
56.1%
Decimal Number 133
 
19.8%
Space Separator 46
 
6.8%
Open Punctuation 45
 
6.7%
Close Punctuation 45
 
6.7%
Other Punctuation 12
 
1.8%
Uppercase Letter 10
 
1.5%
Lowercase Letter 2
 
0.3%
Math Symbol 2
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
26
 
6.9%
21
 
5.6%
15
 
4.0%
15
 
4.0%
11
 
2.9%
11
 
2.9%
10
 
2.7%
9
 
2.4%
8
 
2.1%
8
 
2.1%
Other values (147) 243
64.5%
Decimal Number
ValueCountFrequency (%)
1 65
48.9%
0 33
24.8%
3 11
 
8.3%
2 10
 
7.5%
5 4
 
3.0%
4 3
 
2.3%
6 3
 
2.3%
8 2
 
1.5%
9 1
 
0.8%
7 1
 
0.8%
Uppercase Letter
ValueCountFrequency (%)
S 3
30.0%
K 3
30.0%
O 1
 
10.0%
X 1
 
10.0%
J 1
 
10.0%
C 1
 
10.0%
Other Punctuation
ValueCountFrequency (%)
, 10
83.3%
. 2
 
16.7%
Space Separator
ValueCountFrequency (%)
46
100.0%
Open Punctuation
ValueCountFrequency (%)
( 45
100.0%
Close Punctuation
ValueCountFrequency (%)
) 45
100.0%
Lowercase Letter
ValueCountFrequency (%)
x 2
100.0%
Math Symbol
ValueCountFrequency (%)
2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 377
56.1%
Common 283
42.1%
Latin 12
 
1.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
26
 
6.9%
21
 
5.6%
15
 
4.0%
15
 
4.0%
11
 
2.9%
11
 
2.9%
10
 
2.7%
9
 
2.4%
8
 
2.1%
8
 
2.1%
Other values (147) 243
64.5%
Common
ValueCountFrequency (%)
1 65
23.0%
46
16.3%
( 45
15.9%
) 45
15.9%
0 33
11.7%
3 11
 
3.9%
, 10
 
3.5%
2 10
 
3.5%
5 4
 
1.4%
4 3
 
1.1%
Other values (6) 11
 
3.9%
Latin
ValueCountFrequency (%)
S 3
25.0%
K 3
25.0%
x 2
16.7%
O 1
 
8.3%
X 1
 
8.3%
J 1
 
8.3%
C 1
 
8.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 377
56.1%
ASCII 293
43.6%
Math Operators 2
 
0.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 65
22.2%
46
15.7%
( 45
15.4%
) 45
15.4%
0 33
11.3%
3 11
 
3.8%
, 10
 
3.4%
2 10
 
3.4%
5 4
 
1.4%
4 3
 
1.0%
Other values (12) 21
 
7.2%
Hangul
ValueCountFrequency (%)
26
 
6.9%
21
 
5.6%
15
 
4.0%
15
 
4.0%
11
 
2.9%
11
 
2.9%
10
 
2.7%
9
 
2.4%
8
 
2.1%
8
 
2.1%
Other values (147) 243
64.5%
Math Operators
ValueCountFrequency (%)
2
100.0%

가격(원)
Real number (ℝ)

ZEROS 

Distinct75
Distinct (%)97.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean15403.857
Minimum0
Maximum134022
Zeros1
Zeros (%)1.3%
Negative0
Negative (%)0.0%
Memory size825.0 B
2024-04-17T18:19:59.845808image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile1394
Q13327
median7286
Q313952
95-th percentile70773.6
Maximum134022
Range134022
Interquartile range (IQR)10625

Descriptive statistics

Standard deviation24028.103
Coefficient of variation (CV)1.5598758
Kurtosis10.451396
Mean15403.857
Median Absolute Deviation (MAD)4904
Skewness3.1141649
Sum1186097
Variance5.7734976 × 108
MonotonicityNot monotonic
2024-04-17T18:19:59.979780image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
29700 2
 
2.6%
5667 2
 
2.6%
9545 1
 
1.3%
8950 1
 
1.3%
7730 1
 
1.3%
6950 1
 
1.3%
15000 1
 
1.3%
69467 1
 
1.3%
7280 1
 
1.3%
4217 1
 
1.3%
Other values (65) 65
84.4%
ValueCountFrequency (%)
0 1
1.3%
781 1
1.3%
1100 1
1.3%
1250 1
1.3%
1430 1
1.3%
1508 1
1.3%
1627 1
1.3%
1633 1
1.3%
1747 1
1.3%
1847 1
1.3%
ValueCountFrequency (%)
134022 1
1.3%
108000 1
1.3%
78750 1
1.3%
76000 1
1.3%
69467 1
1.3%
65117 1
1.3%
43500 1
1.3%
38333 1
1.3%
35611 1
1.3%
29700 2
2.6%

물가기준일
Categorical

CONSTANT 

Distinct1
Distinct (%)1.3%
Missing0
Missing (%)0.0%
Memory size748.0 B
2023-08-31
77 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2023-08-31
2nd row2023-08-31
3rd row2023-08-31
4th row2023-08-31
5th row2023-08-31

Common Values

ValueCountFrequency (%)
2023-08-31 77
100.0%

Length

2024-04-17T18:20:00.101068image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-17T18:20:00.185430image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2023-08-31 77
100.0%

데이터기준일
Categorical

CONSTANT 

Distinct1
Distinct (%)1.3%
Missing0
Missing (%)0.0%
Memory size748.0 B
2023-10-06
77 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2023-10-06
2nd row2023-10-06
3rd row2023-10-06
4th row2023-10-06
5th row2023-10-06

Common Values

ValueCountFrequency (%)
2023-10-06 77
100.0%

Length

2024-04-17T18:20:00.268517image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-17T18:20:00.353626image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2023-10-06 77
100.0%

Interactions

2024-04-17T18:19:57.444534image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T18:19:57.281805image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T18:19:57.527061image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T18:19:57.362983image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-17T18:20:00.411419image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번품목구분품목명규격 및 단위가격(원)
연번1.0000.9460.9430.9790.173
품목구분0.9461.0000.7961.0000.338
품목명0.9430.7961.0000.9900.000
규격 및 단위0.9791.0000.9901.0000.992
가격(원)0.1730.3380.0000.9921.000
2024-04-17T18:20:00.499511image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번가격(원)품목구분
연번1.000-0.2530.889
가격(원)-0.2531.0000.218
품목구분0.8890.2181.000

Missing values

2024-04-17T18:19:57.951631image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-17T18:19:58.054772image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번품목구분품목명규격 및 단위가격(원)물가기준일데이터기준일
01외식설렁탕1인분(보통)95452023-08-312023-10-06
12외식냉면물냉면, 1인분(보통)72862023-08-312023-10-06
23외식비빔밥1인분(보통)66752023-08-312023-10-06
34외식갈비탕1인분(보통)120712023-08-312023-10-06
45외식삼계탕1인분(보통)143502023-08-312023-10-06
56외식김치찌개1인분(공기밥 포함)71672023-08-312023-10-06
67외식된장찌개1인분(공기밥 포함)68332023-08-312023-10-06
78외식칼국수1인분(보통)63332023-08-312023-10-06
89외식라면(외식)1인분(보통)38102023-08-312023-10-06
910외식자장면1인분(홀기준)56672023-08-312023-10-06
연번품목구분품목명규격 및 단위가격(원)물가기준일데이터기준일
6768생필품분유남양XO 800그램 1단계297002023-08-312023-10-06
6869생필품두부손두부 1모23332023-08-312023-10-06
6970생필품고추장해찬들 1킬로122002023-08-312023-10-06
7071생필품소주참이슬 1병15082023-08-312023-10-06
7172생필품세제비트(리필) 2.7킬로93402023-08-312023-10-06
7273생필품샴푸엘라스틴 600그램83302023-08-312023-10-06
7374생필품화장지뽀삐 30롤217672023-08-312023-10-06
7475생필품휘발유SK 1리터17472023-08-312023-10-06
7576생필품등유SK 1리터02023-08-312023-10-06
7677생필품경유SK 1리터16272023-08-312023-10-06