Amino acid dipepetide frequency for Emiliania huxleyi virus 86 (isolate United Kingdom/English Channel/1999) (EhV-86)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.13AlaAla: 5.13 ± 0.257
1.057AlaCys: 1.057 ± 0.096
2.919AlaAsp: 2.919 ± 0.155
3.244AlaGlu: 3.244 ± 0.193
2.618AlaPhe: 2.618 ± 0.133
3.447AlaGly: 3.447 ± 0.205
1.642AlaHis: 1.642 ± 0.136
4.943AlaIle: 4.943 ± 0.214
3.52AlaLys: 3.52 ± 0.2
5.138AlaLeu: 5.138 ± 0.221
1.967AlaMet: 1.967 ± 0.155
3.179AlaAsn: 3.179 ± 0.198
3.26AlaPro: 3.26 ± 0.243
1.911AlaGln: 1.911 ± 0.123
3.244AlaArg: 3.244 ± 0.17
5.081AlaSer: 5.081 ± 0.182
4.106AlaThr: 4.106 ± 0.184
4.284AlaVal: 4.284 ± 0.226
0.707AlaTrp: 0.707 ± 0.072
2.537AlaTyr: 2.537 ± 0.162
0.0AlaXaa: 0.0 ± 0.0
Cys
1.106CysAla: 1.106 ± 0.104
0.537CysCys: 0.537 ± 0.094
0.951CysAsp: 0.951 ± 0.095
1.0CysGlu: 1.0 ± 0.129
0.634CysPhe: 0.634 ± 0.072
1.358CysGly: 1.358 ± 0.152
0.309CysHis: 0.309 ± 0.053
1.618CysIle: 1.618 ± 0.113
1.187CysLys: 1.187 ± 0.126
1.089CysLeu: 1.089 ± 0.136
0.691CysMet: 0.691 ± 0.08
1.341CysAsn: 1.341 ± 0.116
0.862CysPro: 0.862 ± 0.097
0.431CysGln: 0.431 ± 0.063
0.732CysArg: 0.732 ± 0.099
1.285CysSer: 1.285 ± 0.141
1.26CysThr: 1.26 ± 0.134
1.163CysVal: 1.163 ± 0.114
0.13CysTrp: 0.13 ± 0.036
0.61CysTyr: 0.61 ± 0.07
0.0CysXaa: 0.0 ± 0.0
Asp
4.252AspAla: 4.252 ± 0.244
0.919AspCys: 0.919 ± 0.113
4.358AspAsp: 4.358 ± 0.229
4.187AspGlu: 4.187 ± 0.214
2.138AspPhe: 2.138 ± 0.136
3.195AspGly: 3.195 ± 0.178
1.163AspHis: 1.163 ± 0.1
4.667AspIle: 4.667 ± 0.209
2.886AspLys: 2.886 ± 0.199
3.618AspLeu: 3.618 ± 0.171
2.073AspMet: 2.073 ± 0.141
3.041AspAsn: 3.041 ± 0.181
2.496AspPro: 2.496 ± 0.158
1.528AspGln: 1.528 ± 0.112
2.496AspArg: 2.496 ± 0.164
3.488AspSer: 3.488 ± 0.18
4.138AspThr: 4.138 ± 0.185
3.959AspVal: 3.959 ± 0.213
0.569AspTrp: 0.569 ± 0.062
1.992AspTyr: 1.992 ± 0.144
0.0AspXaa: 0.0 ± 0.0
Glu
2.658GluAla: 2.658 ± 0.167
1.146GluCys: 1.146 ± 0.103
2.618GluAsp: 2.618 ± 0.173
3.154GluGlu: 3.154 ± 0.219
2.179GluPhe: 2.179 ± 0.15
1.902GluGly: 1.902 ± 0.138
1.561GluHis: 1.561 ± 0.13
3.618GluIle: 3.618 ± 0.191
2.87GluLys: 2.87 ± 0.214
4.927GluLeu: 4.927 ± 0.267
1.74GluMet: 1.74 ± 0.131
3.236GluAsn: 3.236 ± 0.152
2.089GluPro: 2.089 ± 0.175
2.195GluGln: 2.195 ± 0.119
2.35GluArg: 2.35 ± 0.141
3.057GluSer: 3.057 ± 0.153
3.39GluThr: 3.39 ± 0.189
2.756GluVal: 2.756 ± 0.158
0.642GluTrp: 0.642 ± 0.076
2.854GluTyr: 2.854 ± 0.142
0.0GluXaa: 0.0 ± 0.0
Phe
2.602PheAla: 2.602 ± 0.145
0.813PheCys: 0.813 ± 0.104
2.504PheAsp: 2.504 ± 0.162
2.008PheGlu: 2.008 ± 0.125
1.423PhePhe: 1.423 ± 0.132
2.423PheGly: 2.423 ± 0.135
0.927PheHis: 0.927 ± 0.097
3.154PheIle: 3.154 ± 0.184
1.886PheLys: 1.886 ± 0.14
2.837PheLeu: 2.837 ± 0.184
1.52PheMet: 1.52 ± 0.108
2.236PheAsn: 2.236 ± 0.139
1.748PhePro: 1.748 ± 0.153
0.732PheGln: 0.732 ± 0.072
1.463PheArg: 1.463 ± 0.099
2.707PheSer: 2.707 ± 0.162
2.65PheThr: 2.65 ± 0.154
2.829PheVal: 2.829 ± 0.185
0.415PheTrp: 0.415 ± 0.061
1.455PheTyr: 1.455 ± 0.097
0.0PheXaa: 0.0 ± 0.0
Gly
3.39GlyAla: 3.39 ± 0.187
0.951GlyCys: 0.951 ± 0.089
3.398GlyAsp: 3.398 ± 0.166
2.236GlyGlu: 2.236 ± 0.148
2.195GlyPhe: 2.195 ± 0.128
3.455GlyGly: 3.455 ± 0.278
1.146GlyHis: 1.146 ± 0.109
4.415GlyIle: 4.415 ± 0.187
3.48GlyLys: 3.48 ± 0.221
3.569GlyLeu: 3.569 ± 0.187
1.61GlyMet: 1.61 ± 0.119
3.081GlyAsn: 3.081 ± 0.219
1.65GlyPro: 1.65 ± 0.124
1.423GlyGln: 1.423 ± 0.119
2.325GlyArg: 2.325 ± 0.13
3.642GlySer: 3.642 ± 0.22
3.821GlyThr: 3.821 ± 0.199
3.593GlyVal: 3.593 ± 0.204
0.764GlyTrp: 0.764 ± 0.129
2.39GlyTyr: 2.39 ± 0.162
0.0GlyXaa: 0.0 ± 0.0
His
1.902HisAla: 1.902 ± 0.152
0.504HisCys: 0.504 ± 0.06
1.358HisAsp: 1.358 ± 0.109
1.358HisGlu: 1.358 ± 0.102
0.911HisPhe: 0.911 ± 0.089
1.577HisGly: 1.577 ± 0.123
0.813HisHis: 0.813 ± 0.088
2.049HisIle: 2.049 ± 0.132
1.472HisLys: 1.472 ± 0.12
1.74HisLeu: 1.74 ± 0.132
0.862HisMet: 0.862 ± 0.084
1.374HisAsn: 1.374 ± 0.116
1.415HisPro: 1.415 ± 0.101
0.748HisGln: 0.748 ± 0.082
1.041HisArg: 1.041 ± 0.079
1.423HisSer: 1.423 ± 0.107
1.74HisThr: 1.74 ± 0.121
1.886HisVal: 1.886 ± 0.132
0.195HisTrp: 0.195 ± 0.037
0.959HisTyr: 0.959 ± 0.097
0.0HisXaa: 0.0 ± 0.0
Ile
5.138IleAla: 5.138 ± 0.213
1.398IleCys: 1.398 ± 0.099
4.293IleAsp: 4.293 ± 0.188
3.691IleGlu: 3.691 ± 0.213
2.472IlePhe: 2.472 ± 0.166
4.382IleGly: 4.382 ± 0.225
1.878IleHis: 1.878 ± 0.137
5.642IleIle: 5.642 ± 0.267
4.203IleLys: 4.203 ± 0.246
5.748IleLeu: 5.748 ± 0.237
1.837IleMet: 1.837 ± 0.143
3.845IleAsn: 3.845 ± 0.193
3.471IlePro: 3.471 ± 0.17
2.219IleGln: 2.219 ± 0.128
3.252IleArg: 3.252 ± 0.178
5.179IleSer: 5.179 ± 0.251
5.041IleThr: 5.041 ± 0.226
5.276IleVal: 5.276 ± 0.243
0.707IleTrp: 0.707 ± 0.082
2.919IleTyr: 2.919 ± 0.161
0.0IleXaa: 0.0 ± 0.0
Lys
2.756LysAla: 2.756 ± 0.219
1.106LysCys: 1.106 ± 0.107
2.528LysAsp: 2.528 ± 0.169
2.829LysGlu: 2.829 ± 0.185
2.276LysPhe: 2.276 ± 0.151
1.935LysGly: 1.935 ± 0.156
2.024LysHis: 2.024 ± 0.138
3.528LysIle: 3.528 ± 0.197
4.927LysLys: 4.927 ± 0.299
4.935LysLeu: 4.935 ± 0.291
1.78LysMet: 1.78 ± 0.099
3.829LysAsn: 3.829 ± 0.208
2.398LysPro: 2.398 ± 0.172
2.203LysGln: 2.203 ± 0.159
3.894LysArg: 3.894 ± 0.265
3.545LysSer: 3.545 ± 0.208
4.065LysThr: 4.065 ± 0.192
2.984LysVal: 2.984 ± 0.178
0.724LysTrp: 0.724 ± 0.08
3.415LysTyr: 3.415 ± 0.186
0.0LysXaa: 0.0 ± 0.0
Leu
5.179LeuAla: 5.179 ± 0.219
1.512LeuCys: 1.512 ± 0.141
3.683LeuAsp: 3.683 ± 0.179
3.545LeuGlu: 3.545 ± 0.175
3.504LeuPhe: 3.504 ± 0.188
3.707LeuGly: 3.707 ± 0.23
2.057LeuHis: 2.057 ± 0.113
5.236LeuIle: 5.236 ± 0.253
4.268LeuLys: 4.268 ± 0.231
6.203LeuLeu: 6.203 ± 0.256
2.293LeuMet: 2.293 ± 0.14
3.748LeuAsn: 3.748 ± 0.194
4.528LeuPro: 4.528 ± 0.432
2.309LeuGln: 2.309 ± 0.138
3.382LeuArg: 3.382 ± 0.183
5.976LeuSer: 5.976 ± 0.244
4.902LeuThr: 4.902 ± 0.204
4.317LeuVal: 4.317 ± 0.184
0.813LeuTrp: 0.813 ± 0.075
3.455LeuTyr: 3.455 ± 0.207
0.0LeuXaa: 0.0 ± 0.0
Met
1.959MetAla: 1.959 ± 0.147
0.463MetCys: 0.463 ± 0.052
1.821MetAsp: 1.821 ± 0.129
1.659MetGlu: 1.659 ± 0.12
1.472MetPhe: 1.472 ± 0.127
1.366MetGly: 1.366 ± 0.102
0.894MetHis: 0.894 ± 0.089
1.756MetIle: 1.756 ± 0.129
1.772MetLys: 1.772 ± 0.136
2.423MetLeu: 2.423 ± 0.173
1.089MetMet: 1.089 ± 0.095
1.602MetAsn: 1.602 ± 0.114
1.618MetPro: 1.618 ± 0.131
1.138MetGln: 1.138 ± 0.097
1.317MetArg: 1.317 ± 0.113
2.715MetSer: 2.715 ± 0.161
2.0MetThr: 2.0 ± 0.134
1.537MetVal: 1.537 ± 0.136
0.382MetTrp: 0.382 ± 0.047
1.707MetTyr: 1.707 ± 0.137
0.0MetXaa: 0.0 ± 0.0
Asn
4.065AsnAla: 4.065 ± 0.189
0.772AsnCys: 0.772 ± 0.1
3.772AsnAsp: 3.772 ± 0.184
2.894AsnGlu: 2.894 ± 0.18
1.61AsnPhe: 1.61 ± 0.101
3.789AsnGly: 3.789 ± 0.201
1.195AsnHis: 1.195 ± 0.12
4.203AsnIle: 4.203 ± 0.194
3.675AsnLys: 3.675 ± 0.218
3.593AsnLeu: 3.593 ± 0.187
1.797AsnMet: 1.797 ± 0.121
3.675AsnAsn: 3.675 ± 0.179
2.715AsnPro: 2.715 ± 0.154
1.366AsnGln: 1.366 ± 0.096
2.585AsnArg: 2.585 ± 0.143
3.415AsnSer: 3.415 ± 0.162
4.106AsnThr: 4.106 ± 0.189
3.764AsnVal: 3.764 ± 0.198
0.553AsnTrp: 0.553 ± 0.063
1.984AsnTyr: 1.984 ± 0.149
0.0AsnXaa: 0.0 ± 0.0
Pro
2.707ProAla: 2.707 ± 0.198
0.74ProCys: 0.74 ± 0.078
2.87ProAsp: 2.87 ± 0.169
2.463ProGlu: 2.463 ± 0.163
1.845ProPhe: 1.845 ± 0.129
2.618ProGly: 2.618 ± 0.193
1.033ProHis: 1.033 ± 0.084
3.089ProIle: 3.089 ± 0.162
2.211ProLys: 2.211 ± 0.159
3.78ProLeu: 3.78 ± 0.435
1.203ProMet: 1.203 ± 0.105
2.398ProAsn: 2.398 ± 0.166
16.764ProPro: 16.764 ± 5.05
1.398ProGln: 1.398 ± 0.103
1.886ProArg: 1.886 ± 0.114
8.26ProSer: 8.26 ± 1.458
3.593ProThr: 3.593 ± 0.22
3.74ProVal: 3.74 ± 0.22
0.455ProTrp: 0.455 ± 0.069
1.756ProTyr: 1.756 ± 0.117
0.0ProXaa: 0.0 ± 0.0
Gln
1.732GlnAla: 1.732 ± 0.12
0.878GlnCys: 0.878 ± 0.13
1.244GlnAsp: 1.244 ± 0.098
1.423GlnGlu: 1.423 ± 0.104
1.398GlnPhe: 1.398 ± 0.101
1.154GlnGly: 1.154 ± 0.094
0.967GlnHis: 0.967 ± 0.088
2.073GlnIle: 2.073 ± 0.121
1.967GlnLys: 1.967 ± 0.171
2.577GlnLeu: 2.577 ± 0.156
1.0GlnMet: 1.0 ± 0.092
1.545GlnAsn: 1.545 ± 0.122
1.39GlnPro: 1.39 ± 0.126
1.325GlnGln: 1.325 ± 0.113
1.748GlnArg: 1.748 ± 0.119
2.122GlnSer: 2.122 ± 0.137
2.016GlnThr: 2.016 ± 0.152
1.837GlnVal: 1.837 ± 0.123
0.398GlnTrp: 0.398 ± 0.057
1.382GlnTyr: 1.382 ± 0.117
0.0GlnXaa: 0.0 ± 0.0
Arg
2.992ArgAla: 2.992 ± 0.167
0.748ArgCys: 0.748 ± 0.075
2.764ArgAsp: 2.764 ± 0.18
2.293ArgGlu: 2.293 ± 0.142
1.675ArgPhe: 1.675 ± 0.119
2.187ArgGly: 2.187 ± 0.13
1.195ArgHis: 1.195 ± 0.089
3.488ArgIle: 3.488 ± 0.171
3.106ArgLys: 3.106 ± 0.207
3.203ArgLeu: 3.203 ± 0.196
1.683ArgMet: 1.683 ± 0.156
2.976ArgAsn: 2.976 ± 0.177
1.902ArgPro: 1.902 ± 0.128
1.878ArgGln: 1.878 ± 0.133
3.439ArgArg: 3.439 ± 0.303
3.333ArgSer: 3.333 ± 0.204
2.951ArgThr: 2.951 ± 0.176
2.943ArgVal: 2.943 ± 0.167
0.455ArgTrp: 0.455 ± 0.061
1.829ArgTyr: 1.829 ± 0.124
0.0ArgXaa: 0.0 ± 0.0
Ser
4.707SerAla: 4.707 ± 0.214
1.252SerCys: 1.252 ± 0.119
4.26SerAsp: 4.26 ± 0.211
3.569SerGlu: 3.569 ± 0.177
2.715SerPhe: 2.715 ± 0.155
3.935SerGly: 3.935 ± 0.213
1.829SerHis: 1.829 ± 0.129
4.927SerIle: 4.927 ± 0.216
3.919SerLys: 3.919 ± 0.17
5.065SerLeu: 5.065 ± 0.216
2.049SerMet: 2.049 ± 0.132
3.667SerAsn: 3.667 ± 0.188
7.154SerPro: 7.154 ± 1.456
1.967SerGln: 1.967 ± 0.131
3.65SerArg: 3.65 ± 0.221
6.179SerSer: 6.179 ± 0.297
5.146SerThr: 5.146 ± 0.24
4.984SerVal: 4.984 ± 0.19
0.87SerTrp: 0.87 ± 0.092
3.089SerTyr: 3.089 ± 0.149
0.0SerXaa: 0.0 ± 0.0
Thr
4.301ThrAla: 4.301 ± 0.22
1.406ThrCys: 1.406 ± 0.123
4.032ThrAsp: 4.032 ± 0.2
3.683ThrGlu: 3.683 ± 0.212
2.577ThrPhe: 2.577 ± 0.138
3.707ThrGly: 3.707 ± 0.194
1.805ThrHis: 1.805 ± 0.124
4.813ThrIle: 4.813 ± 0.236
3.797ThrLys: 3.797 ± 0.204
5.382ThrLeu: 5.382 ± 0.23
1.789ThrMet: 1.789 ± 0.13
3.496ThrAsn: 3.496 ± 0.172
3.967ThrPro: 3.967 ± 0.224
2.154ThrGln: 2.154 ± 0.155
3.358ThrArg: 3.358 ± 0.191
4.943ThrSer: 4.943 ± 0.201
5.13ThrThr: 5.13 ± 0.251
4.301ThrVal: 4.301 ± 0.157
0.805ThrTrp: 0.805 ± 0.082
2.602ThrTyr: 2.602 ± 0.174
0.0ThrXaa: 0.0 ± 0.0
Val
4.211ValAla: 4.211 ± 0.212
1.195ValCys: 1.195 ± 0.099
4.024ValAsp: 4.024 ± 0.221
3.146ValGlu: 3.146 ± 0.168
2.48ValPhe: 2.48 ± 0.162
3.488ValGly: 3.488 ± 0.216
1.504ValHis: 1.504 ± 0.117
5.122ValIle: 5.122 ± 0.26
3.423ValLys: 3.423 ± 0.223
5.065ValLeu: 5.065 ± 0.216
2.057ValMet: 2.057 ± 0.129
3.496ValAsn: 3.496 ± 0.179
3.455ValPro: 3.455 ± 0.192
1.789ValGln: 1.789 ± 0.126
2.691ValArg: 2.691 ± 0.145
4.772ValSer: 4.772 ± 0.208
4.195ValThr: 4.195 ± 0.168
4.471ValVal: 4.471 ± 0.249
0.691ValTrp: 0.691 ± 0.082
2.886ValTyr: 2.886 ± 0.177
0.0ValXaa: 0.0 ± 0.0
Trp
0.577TrpAla: 0.577 ± 0.078
0.195TrpCys: 0.195 ± 0.041
0.789TrpAsp: 0.789 ± 0.114
0.431TrpGlu: 0.431 ± 0.065
0.553TrpPhe: 0.553 ± 0.065
0.659TrpGly: 0.659 ± 0.069
0.358TrpHis: 0.358 ± 0.057
0.732TrpIle: 0.732 ± 0.083
0.772TrpLys: 0.772 ± 0.081
0.837TrpLeu: 0.837 ± 0.087
0.301TrpMet: 0.301 ± 0.057
0.659TrpAsn: 0.659 ± 0.07
0.382TrpPro: 0.382 ± 0.059
0.35TrpGln: 0.35 ± 0.062
0.447TrpArg: 0.447 ± 0.063
0.797TrpSer: 0.797 ± 0.076
0.667TrpThr: 0.667 ± 0.079
0.593TrpVal: 0.593 ± 0.061
0.236TrpTrp: 0.236 ± 0.046
0.585TrpTyr: 0.585 ± 0.064
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.569TyrAla: 2.569 ± 0.154
0.699TyrCys: 0.699 ± 0.089
2.976TyrAsp: 2.976 ± 0.162
2.13TyrGlu: 2.13 ± 0.129
1.659TyrPhe: 1.659 ± 0.125
2.252TyrGly: 2.252 ± 0.15
1.024TyrHis: 1.024 ± 0.092
3.52TyrIle: 3.52 ± 0.178
2.341TyrLys: 2.341 ± 0.166
2.886TyrLeu: 2.886 ± 0.161
1.341TyrMet: 1.341 ± 0.115
3.106TyrAsn: 3.106 ± 0.173
1.545TyrPro: 1.545 ± 0.13
1.114TyrGln: 1.114 ± 0.08
1.74TyrArg: 1.74 ± 0.156
3.016TyrSer: 3.016 ± 0.156
3.138TyrThr: 3.138 ± 0.16
2.943TyrVal: 2.943 ± 0.182
0.406TyrTrp: 0.406 ± 0.057
1.87TyrTyr: 1.87 ± 0.15
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 472 proteins (123003 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski