Amino acid dipepetide frequency for Port-miou virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.635AlaAla: 3.635 ± 0.295
1.15AlaCys: 1.15 ± 0.105
2.204AlaAsp: 2.204 ± 0.149
3.867AlaGlu: 3.867 ± 0.193
2.755AlaPhe: 2.755 ± 0.194
2.398AlaGly: 2.398 ± 0.17
0.87AlaHis: 0.87 ± 0.092
2.678AlaIle: 2.678 ± 0.166
4.747AlaLys: 4.747 ± 0.249
5.172AlaLeu: 5.172 ± 0.287
1.121AlaMet: 1.121 ± 0.107
2.021AlaAsn: 2.021 ± 0.129
2.088AlaPro: 2.088 ± 0.178
1.711AlaGln: 1.711 ± 0.143
2.649AlaArg: 2.649 ± 0.172
4.505AlaSer: 4.505 ± 0.239
3.335AlaThr: 3.335 ± 0.238
3.616AlaVal: 3.616 ± 0.195
0.561AlaTrp: 0.561 ± 0.071
1.576AlaTyr: 1.576 ± 0.133
0.0AlaXaa: 0.0 ± 0.0
Cys
1.305CysAla: 1.305 ± 0.108
0.696CysCys: 0.696 ± 0.074
1.402CysAsp: 1.402 ± 0.139
1.789CysGlu: 1.789 ± 0.176
1.615CysPhe: 1.615 ± 0.153
1.934CysGly: 1.934 ± 0.161
0.522CysHis: 0.522 ± 0.07
1.092CysIle: 1.092 ± 0.096
1.75CysLys: 1.75 ± 0.168
2.117CysLeu: 2.117 ± 0.157
0.377CysMet: 0.377 ± 0.056
0.822CysAsn: 0.822 ± 0.101
1.431CysPro: 1.431 ± 0.137
0.793CysGln: 0.793 ± 0.096
1.083CysArg: 1.083 ± 0.106
2.108CysSer: 2.108 ± 0.19
1.054CysThr: 1.054 ± 0.103
1.518CysVal: 1.518 ± 0.113
0.406CysTrp: 0.406 ± 0.074
0.754CysTyr: 0.754 ± 0.089
0.0CysXaa: 0.0 ± 0.0
Asp
2.668AspAla: 2.668 ± 0.148
1.257AspCys: 1.257 ± 0.138
2.61AspAsp: 2.61 ± 0.18
4.148AspGlu: 4.148 ± 0.238
3.074AspPhe: 3.074 ± 0.167
4.254AspGly: 4.254 ± 0.205
0.657AspHis: 0.657 ± 0.099
3.567AspIle: 3.567 ± 0.221
3.577AspLys: 3.577 ± 0.196
3.5AspLeu: 3.5 ± 0.219
0.947AspMet: 0.947 ± 0.088
1.866AspAsn: 1.866 ± 0.138
2.021AspPro: 2.021 ± 0.152
1.073AspGln: 1.073 ± 0.095
2.156AspArg: 2.156 ± 0.135
3.393AspSer: 3.393 ± 0.192
2.311AspThr: 2.311 ± 0.154
3.838AspVal: 3.838 ± 0.209
0.764AspTrp: 0.764 ± 0.098
1.576AspTyr: 1.576 ± 0.117
0.0AspXaa: 0.0 ± 0.0
Glu
4.061GluAla: 4.061 ± 0.207
1.537GluCys: 1.537 ± 0.126
4.322GluAsp: 4.322 ± 0.252
9.011GluGlu: 9.011 ± 0.609
4.708GluPhe: 4.708 ± 0.207
4.235GluGly: 4.235 ± 0.206
1.615GluHis: 1.615 ± 0.127
4.177GluIle: 4.177 ± 0.236
9.214GluLys: 9.214 ± 0.371
6.255GluLeu: 6.255 ± 0.262
2.03GluMet: 2.03 ± 0.154
4.012GluAsn: 4.012 ± 0.204
1.943GluPro: 1.943 ± 0.162
2.688GluGln: 2.688 ± 0.189
5.25GluArg: 5.25 ± 0.272
3.916GluSer: 3.916 ± 0.218
5.018GluThr: 5.018 ± 0.229
4.157GluVal: 4.157 ± 0.214
1.218GluTrp: 1.218 ± 0.11
2.765GluTyr: 2.765 ± 0.145
0.0GluXaa: 0.0 ± 0.0
Phe
3.538PheAla: 3.538 ± 0.209
1.992PheCys: 1.992 ± 0.173
3.007PheAsp: 3.007 ± 0.18
4.641PheGlu: 4.641 ± 0.223
2.929PhePhe: 2.929 ± 0.183
4.09PheGly: 4.09 ± 0.218
0.918PheHis: 0.918 ± 0.092
2.011PheIle: 2.011 ± 0.145
2.059PheLys: 2.059 ± 0.131
6.294PheLeu: 6.294 ± 0.261
1.238PheMet: 1.238 ± 0.112
1.131PheAsn: 1.131 ± 0.125
2.359PhePro: 2.359 ± 0.147
1.663PheGln: 1.663 ± 0.143
2.784PheArg: 2.784 ± 0.194
4.998PheSer: 4.998 ± 0.255
2.001PheThr: 2.001 ± 0.14
4.844PheVal: 4.844 ± 0.238
1.17PheTrp: 1.17 ± 0.119
1.721PheTyr: 1.721 ± 0.12
0.0PheXaa: 0.0 ± 0.0
Gly
3.268GlyAla: 3.268 ± 0.227
1.431GlyCys: 1.431 ± 0.149
2.775GlyAsp: 2.775 ± 0.197
4.602GlyGlu: 4.602 ± 0.23
2.688GlyPhe: 2.688 ± 0.169
3.268GlyGly: 3.268 ± 0.223
0.976GlyHis: 0.976 ± 0.102
3.877GlyIle: 3.877 ± 0.214
6.797GlyLys: 6.797 ± 0.283
4.148GlyLeu: 4.148 ± 0.189
1.344GlyMet: 1.344 ± 0.121
2.852GlyAsn: 2.852 ± 0.189
1.595GlyPro: 1.595 ± 0.133
2.291GlyGln: 2.291 ± 0.251
3.374GlyArg: 3.374 ± 0.176
4.418GlySer: 4.418 ± 0.245
4.834GlyThr: 4.834 ± 0.303
4.099GlyVal: 4.099 ± 0.218
0.851GlyTrp: 0.851 ± 0.096
2.514GlyTyr: 2.514 ± 0.169
0.0GlyXaa: 0.0 ± 0.0
His
0.657HisAla: 0.657 ± 0.076
0.464HisCys: 0.464 ± 0.075
0.561HisAsp: 0.561 ± 0.073
1.296HisGlu: 1.296 ± 0.115
0.918HisPhe: 0.918 ± 0.1
1.953HisGly: 1.953 ± 0.182
0.367HisHis: 0.367 ± 0.056
1.16HisIle: 1.16 ± 0.106
1.499HisLys: 1.499 ± 0.141
1.479HisLeu: 1.479 ± 0.122
0.309HisMet: 0.309 ± 0.05
0.677HisAsn: 0.677 ± 0.071
0.793HisPro: 0.793 ± 0.085
0.686HisGln: 0.686 ± 0.088
1.063HisArg: 1.063 ± 0.108
1.412HisSer: 1.412 ± 0.123
0.831HisThr: 0.831 ± 0.087
0.909HisVal: 0.909 ± 0.097
0.251HisTrp: 0.251 ± 0.048
0.512HisTyr: 0.512 ± 0.071
0.0HisXaa: 0.0 ± 0.0
Ile
3.016IleAla: 3.016 ± 0.212
1.315IleCys: 1.315 ± 0.116
2.388IleAsp: 2.388 ± 0.143
2.823IleGlu: 2.823 ± 0.177
3.19IlePhe: 3.19 ± 0.197
3.132IleGly: 3.132 ± 0.245
0.976IleHis: 0.976 ± 0.091
2.494IleIle: 2.494 ± 0.207
3.461IleLys: 3.461 ± 0.181
5.211IleLeu: 5.211 ± 0.262
0.976IleMet: 0.976 ± 0.1
1.528IleAsn: 1.528 ± 0.125
2.804IlePro: 2.804 ± 0.178
2.001IleGln: 2.001 ± 0.153
2.755IleArg: 2.755 ± 0.179
4.766IleSer: 4.766 ± 0.244
2.146IleThr: 2.146 ± 0.16
3.567IleVal: 3.567 ± 0.213
0.889IleTrp: 0.889 ± 0.095
1.402IleTyr: 1.402 ± 0.121
0.0IleXaa: 0.0 ± 0.0
Lys
4.283LysAla: 4.283 ± 0.251
1.557LysCys: 1.557 ± 0.155
4.631LysAsp: 4.631 ± 0.226
8.866LysGlu: 8.866 ± 0.409
3.596LysPhe: 3.596 ± 0.203
4.476LysGly: 4.476 ± 0.218
1.905LysHis: 1.905 ± 0.159
4.409LysIle: 4.409 ± 0.227
11.389LysLys: 11.389 ± 0.733
6.323LysLeu: 6.323 ± 0.261
1.75LysMet: 1.75 ± 0.161
4.902LysAsn: 4.902 ± 0.246
2.011LysPro: 2.011 ± 0.192
2.369LysGln: 2.369 ± 0.181
5.511LysArg: 5.511 ± 0.232
4.505LysSer: 4.505 ± 0.255
5.443LysThr: 5.443 ± 0.217
4.921LysVal: 4.921 ± 0.264
0.88LysTrp: 0.88 ± 0.09
3.364LysTyr: 3.364 ± 0.197
0.0LysXaa: 0.0 ± 0.0
Leu
4.747LeuAla: 4.747 ± 0.226
2.746LeuCys: 2.746 ± 0.186
4.409LeuAsp: 4.409 ± 0.207
7.754LeuGlu: 7.754 ± 0.319
5.114LeuPhe: 5.114 ± 0.24
5.172LeuGly: 5.172 ± 0.226
1.499LeuHis: 1.499 ± 0.114
2.784LeuIle: 2.784 ± 0.19
5.685LeuLys: 5.685 ± 0.262
9.04LeuLeu: 9.04 ± 0.331
1.624LeuMet: 1.624 ± 0.132
2.755LeuAsn: 2.755 ± 0.204
4.679LeuPro: 4.679 ± 0.203
3.732LeuGln: 3.732 ± 0.363
4.573LeuArg: 4.573 ± 0.221
7.657LeuSer: 7.657 ± 0.337
3.142LeuThr: 3.142 ± 0.175
5.859LeuVal: 5.859 ± 0.229
1.731LeuTrp: 1.731 ± 0.127
2.581LeuTyr: 2.581 ± 0.148
0.0LeuXaa: 0.0 ± 0.0
Met
1.092MetAla: 1.092 ± 0.109
0.522MetCys: 0.522 ± 0.077
1.141MetAsp: 1.141 ± 0.109
1.808MetGlu: 1.808 ± 0.148
1.092MetPhe: 1.092 ± 0.107
1.276MetGly: 1.276 ± 0.115
0.406MetHis: 0.406 ± 0.06
0.812MetIle: 0.812 ± 0.093
1.508MetLys: 1.508 ± 0.135
1.45MetLeu: 1.45 ± 0.125
0.474MetMet: 0.474 ± 0.066
0.996MetAsn: 0.996 ± 0.086
0.561MetPro: 0.561 ± 0.081
1.083MetGln: 1.083 ± 0.097
1.015MetArg: 1.015 ± 0.102
2.088MetSer: 2.088 ± 0.143
1.383MetThr: 1.383 ± 0.123
0.967MetVal: 0.967 ± 0.115
0.174MetTrp: 0.174 ± 0.044
0.522MetTyr: 0.522 ± 0.067
0.0MetXaa: 0.0 ± 0.0
Asn
2.34AsnAla: 2.34 ± 0.154
0.86AsnCys: 0.86 ± 0.112
1.537AsnAsp: 1.537 ± 0.118
2.262AsnGlu: 2.262 ± 0.163
2.475AsnPhe: 2.475 ± 0.153
3.316AsnGly: 3.316 ± 0.197
0.609AsnHis: 0.609 ± 0.083
4.157AsnIle: 4.157 ± 0.222
3.393AsnLys: 3.393 ± 0.226
3.123AsnLeu: 3.123 ± 0.171
0.889AsnMet: 0.889 ± 0.095
1.673AsnAsn: 1.673 ± 0.157
2.137AsnPro: 2.137 ± 0.154
0.822AsnGln: 0.822 ± 0.085
1.624AsnArg: 1.624 ± 0.145
2.91AsnSer: 2.91 ± 0.159
2.359AsnThr: 2.359 ± 0.246
2.91AsnVal: 2.91 ± 0.172
0.541AsnTrp: 0.541 ± 0.062
1.383AsnTyr: 1.383 ± 0.117
0.0AsnXaa: 0.0 ± 0.0
Pro
1.45ProAla: 1.45 ± 0.123
0.793ProCys: 0.793 ± 0.091
2.04ProAsp: 2.04 ± 0.141
4.148ProGlu: 4.148 ± 0.2
2.098ProPhe: 2.098 ± 0.143
2.03ProGly: 2.03 ± 0.131
0.764ProHis: 0.764 ± 0.085
2.03ProIle: 2.03 ± 0.145
3.49ProLys: 3.49 ± 0.241
3.132ProLeu: 3.132 ± 0.173
0.706ProMet: 0.706 ± 0.091
1.847ProAsn: 1.847 ± 0.154
1.566ProPro: 1.566 ± 0.178
1.663ProGln: 1.663 ± 0.138
1.876ProArg: 1.876 ± 0.134
3.277ProSer: 3.277 ± 0.393
2.272ProThr: 2.272 ± 0.175
2.301ProVal: 2.301 ± 0.155
0.512ProTrp: 0.512 ± 0.066
1.325ProTyr: 1.325 ± 0.115
0.0ProXaa: 0.0 ± 0.0
Gln
1.547GlnAla: 1.547 ± 0.131
0.551GlnCys: 0.551 ± 0.097
1.634GlnAsp: 1.634 ± 0.131
3.132GlnGlu: 3.132 ± 0.195
0.957GlnPhe: 0.957 ± 0.096
2.088GlnGly: 2.088 ± 0.238
0.57GlnHis: 0.57 ± 0.074
1.721GlnIle: 1.721 ± 0.146
4.399GlnLys: 4.399 ± 0.241
2.233GlnLeu: 2.233 ± 0.144
1.054GlnMet: 1.054 ± 0.102
2.001GlnAsn: 2.001 ± 0.121
0.918GlnPro: 0.918 ± 0.109
1.45GlnGln: 1.45 ± 0.145
2.726GlnArg: 2.726 ± 0.171
1.605GlnSer: 1.605 ± 0.123
2.34GlnThr: 2.34 ± 0.209
2.359GlnVal: 2.359 ± 0.208
0.541GlnTrp: 0.541 ± 0.072
0.996GlnTyr: 0.996 ± 0.095
0.0GlnXaa: 0.0 ± 0.0
Arg
2.523ArgAla: 2.523 ± 0.163
1.005ArgCys: 1.005 ± 0.095
2.63ArgAsp: 2.63 ± 0.173
5.327ArgGlu: 5.327 ± 0.251
2.359ArgPhe: 2.359 ± 0.134
2.871ArgGly: 2.871 ± 0.178
1.218ArgHis: 1.218 ± 0.112
3.2ArgIle: 3.2 ± 0.163
5.472ArgLys: 5.472 ± 0.261
4.689ArgLeu: 4.689 ± 0.223
1.296ArgMet: 1.296 ± 0.127
2.697ArgAsn: 2.697 ± 0.192
1.682ArgPro: 1.682 ± 0.128
2.059ArgGln: 2.059 ± 0.165
2.939ArgArg: 2.939 ± 0.191
2.92ArgSer: 2.92 ± 0.164
2.9ArgThr: 2.9 ± 0.214
3.771ArgVal: 3.771 ± 0.209
0.677ArgTrp: 0.677 ± 0.073
1.76ArgTyr: 1.76 ± 0.124
0.0ArgXaa: 0.0 ± 0.0
Ser
3.8SerAla: 3.8 ± 0.245
2.079SerCys: 2.079 ± 0.141
3.548SerAsp: 3.548 ± 0.215
5.395SerGlu: 5.395 ± 0.244
5.288SerPhe: 5.288 ± 0.251
5.375SerGly: 5.375 ± 0.223
1.15SerHis: 1.15 ± 0.125
2.949SerIle: 2.949 ± 0.182
5.733SerLys: 5.733 ± 0.278
7.464SerLeu: 7.464 ± 0.329
1.296SerMet: 1.296 ± 0.113
2.823SerAsn: 2.823 ± 0.196
3.132SerPro: 3.132 ± 0.302
3.094SerGln: 3.094 ± 0.174
3.925SerArg: 3.925 ± 0.22
6.729SerSer: 6.729 ± 0.347
3.306SerThr: 3.306 ± 0.225
4.815SerVal: 4.815 ± 0.241
1.16SerTrp: 1.16 ± 0.112
2.04SerTyr: 2.04 ± 0.137
0.0SerXaa: 0.0 ± 0.0
Thr
2.688ThrAla: 2.688 ± 0.187
1.296ThrCys: 1.296 ± 0.123
2.146ThrAsp: 2.146 ± 0.183
3.442ThrGlu: 3.442 ± 0.195
2.997ThrPhe: 2.997 ± 0.187
3.471ThrGly: 3.471 ± 0.28
0.938ThrHis: 0.938 ± 0.101
2.494ThrIle: 2.494 ± 0.146
5.124ThrLys: 5.124 ± 0.277
5.317ThrLeu: 5.317 ± 0.37
0.918ThrMet: 0.918 ± 0.095
2.543ThrAsn: 2.543 ± 0.269
2.717ThrPro: 2.717 ± 0.251
1.885ThrGln: 1.885 ± 0.157
2.997ThrArg: 2.997 ± 0.196
3.703ThrSer: 3.703 ± 0.193
3.519ThrThr: 3.519 ± 0.304
3.123ThrVal: 3.123 ± 0.16
0.735ThrTrp: 0.735 ± 0.1
1.537ThrTyr: 1.537 ± 0.117
0.0ThrXaa: 0.0 ± 0.0
Val
3.674ValAla: 3.674 ± 0.247
1.914ValCys: 1.914 ± 0.145
3.297ValAsp: 3.297 ± 0.166
4.341ValGlu: 4.341 ± 0.276
4.418ValPhe: 4.418 ± 0.24
3.509ValGly: 3.509 ± 0.22
1.092ValHis: 1.092 ± 0.105
2.997ValIle: 2.997 ± 0.196
3.993ValLys: 3.993 ± 0.237
6.623ValLeu: 6.623 ± 0.27
1.073ValMet: 1.073 ± 0.118
1.982ValAsn: 1.982 ± 0.135
3.403ValPro: 3.403 ± 0.188
2.224ValGln: 2.224 ± 0.152
3.142ValArg: 3.142 ± 0.175
6.42ValSer: 6.42 ± 0.275
2.929ValThr: 2.929 ± 0.184
4.795ValVal: 4.795 ± 0.219
1.17ValTrp: 1.17 ± 0.114
2.253ValTyr: 2.253 ± 0.135
0.0ValXaa: 0.0 ± 0.0
Trp
0.57TrpAla: 0.57 ± 0.066
0.522TrpCys: 0.522 ± 0.095
0.928TrpAsp: 0.928 ± 0.092
1.025TrpGlu: 1.025 ± 0.103
1.17TrpPhe: 1.17 ± 0.112
0.57TrpGly: 0.57 ± 0.095
0.232TrpHis: 0.232 ± 0.051
0.696TrpIle: 0.696 ± 0.089
1.518TrpLys: 1.518 ± 0.137
1.218TrpLeu: 1.218 ± 0.116
0.29TrpMet: 0.29 ± 0.049
0.812TrpAsn: 0.812 ± 0.086
0.271TrpPro: 0.271 ± 0.047
0.445TrpGln: 0.445 ± 0.063
1.025TrpArg: 1.025 ± 0.099
1.354TrpSer: 1.354 ± 0.138
0.812TrpThr: 0.812 ± 0.101
0.725TrpVal: 0.725 ± 0.075
0.251TrpTrp: 0.251 ± 0.048
0.619TrpTyr: 0.619 ± 0.082
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.798TyrAla: 1.798 ± 0.145
0.88TyrCys: 0.88 ± 0.101
2.05TyrAsp: 2.05 ± 0.149
2.456TyrGlu: 2.456 ± 0.16
1.943TyrPhe: 1.943 ± 0.146
2.581TyrGly: 2.581 ± 0.182
0.512TyrHis: 0.512 ± 0.073
1.537TyrIle: 1.537 ± 0.122
2.282TyrLys: 2.282 ± 0.144
2.504TyrLeu: 2.504 ± 0.15
0.648TyrMet: 0.648 ± 0.085
1.383TyrAsn: 1.383 ± 0.113
1.209TyrPro: 1.209 ± 0.11
1.092TyrGln: 1.092 ± 0.111
1.528TyrArg: 1.528 ± 0.127
2.417TyrSer: 2.417 ± 0.164
1.653TyrThr: 1.653 ± 0.114
2.156TyrVal: 2.156 ± 0.131
0.57TyrTrp: 0.57 ± 0.081
0.957TyrTyr: 0.957 ± 0.103
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 410 proteins (103435 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski