Amino acid dipepetide frequency for Pacmanvirus A23

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.969AlaAla: 3.969 ± 0.294
0.869AlaCys: 0.869 ± 0.094
3.45AlaAsp: 3.45 ± 0.192
3.757AlaGlu: 3.757 ± 0.189
2.061AlaPhe: 2.061 ± 0.13
2.879AlaGly: 2.879 ± 0.209
1.022AlaHis: 1.022 ± 0.092
4.063AlaIle: 4.063 ± 0.232
4.6AlaLys: 4.6 ± 0.188
3.995AlaLeu: 3.995 ± 0.203
1.065AlaMet: 1.065 ± 0.086
2.82AlaAsn: 2.82 ± 0.135
1.797AlaPro: 1.797 ± 0.112
1.721AlaGln: 1.721 ± 0.146
2.581AlaArg: 2.581 ± 0.144
2.998AlaSer: 2.998 ± 0.193
3.305AlaThr: 3.305 ± 0.228
3.365AlaVal: 3.365 ± 0.196
0.477AlaTrp: 0.477 ± 0.06
1.934AlaTyr: 1.934 ± 0.137
0.0AlaXaa: 0.0 ± 0.0
Cys
1.218CysAla: 1.218 ± 0.112
0.443CysCys: 0.443 ± 0.067
1.32CysAsp: 1.32 ± 0.13
1.354CysGlu: 1.354 ± 0.126
0.826CysPhe: 0.826 ± 0.087
1.67CysGly: 1.67 ± 0.151
0.298CysHis: 0.298 ± 0.057
1.704CysIle: 1.704 ± 0.131
2.078CysLys: 2.078 ± 0.161
1.329CysLeu: 1.329 ± 0.123
0.571CysMet: 0.571 ± 0.073
1.218CysAsn: 1.218 ± 0.116
0.792CysPro: 0.792 ± 0.1
0.528CysGln: 0.528 ± 0.072
0.92CysArg: 0.92 ± 0.095
1.278CysSer: 1.278 ± 0.109
1.082CysThr: 1.082 ± 0.103
1.278CysVal: 1.278 ± 0.116
0.247CysTrp: 0.247 ± 0.048
0.809CysTyr: 0.809 ± 0.093
0.0CysXaa: 0.0 ± 0.0
Asp
2.922AspAla: 2.922 ± 0.176
1.218AspCys: 1.218 ± 0.109
4.234AspAsp: 4.234 ± 0.19
5.29AspGlu: 5.29 ± 0.243
2.862AspPhe: 2.862 ± 0.15
2.734AspGly: 2.734 ± 0.135
0.792AspHis: 0.792 ± 0.082
6.082AspIle: 6.082 ± 0.242
4.745AspLys: 4.745 ± 0.227
4.498AspLeu: 4.498 ± 0.204
1.729AspMet: 1.729 ± 0.128
4.089AspAsn: 4.089 ± 0.183
1.959AspPro: 1.959 ± 0.156
1.116AspGln: 1.116 ± 0.09
2.121AspArg: 2.121 ± 0.15
3.927AspSer: 3.927 ± 0.196
3.365AspThr: 3.365 ± 0.153
2.956AspVal: 2.956 ± 0.161
0.673AspTrp: 0.673 ± 0.074
3.186AspTyr: 3.186 ± 0.177
0.0AspXaa: 0.0 ± 0.0
Glu
2.768GluAla: 2.768 ± 0.153
1.184GluCys: 1.184 ± 0.107
2.998GluAsp: 2.998 ± 0.185
4.293GluGlu: 4.293 ± 0.25
3.791GluPhe: 3.791 ± 0.174
1.942GluGly: 1.942 ± 0.146
1.278GluHis: 1.278 ± 0.113
6.993GluIle: 6.993 ± 0.259
5.665GluLys: 5.665 ± 0.225
7.385GluLeu: 7.385 ± 0.275
1.908GluMet: 1.908 ± 0.142
4.361GluAsn: 4.361 ± 0.229
2.164GluPro: 2.164 ± 0.151
3.126GluGln: 3.126 ± 0.212
2.939GluArg: 2.939 ± 0.182
3.961GluSer: 3.961 ± 0.195
3.484GluThr: 3.484 ± 0.194
3.314GluVal: 3.314 ± 0.185
0.673GluTrp: 0.673 ± 0.069
3.791GluTyr: 3.791 ± 0.177
0.0GluXaa: 0.0 ± 0.0
Phe
2.36PheAla: 2.36 ± 0.164
0.971PheCys: 0.971 ± 0.087
3.492PheAsp: 3.492 ± 0.178
2.862PheGlu: 2.862 ± 0.151
1.082PhePhe: 1.082 ± 0.101
2.266PheGly: 2.266 ± 0.168
0.877PheHis: 0.877 ± 0.077
3.876PheIle: 3.876 ± 0.204
3.603PheLys: 3.603 ± 0.194
2.411PheLeu: 2.411 ± 0.179
1.048PheMet: 1.048 ± 0.092
3.348PheAsn: 3.348 ± 0.162
1.423PhePro: 1.423 ± 0.124
1.15PheGln: 1.15 ± 0.101
1.593PheArg: 1.593 ± 0.114
2.683PheSer: 2.683 ± 0.144
3.092PheThr: 3.092 ± 0.159
2.249PheVal: 2.249 ± 0.133
0.392PheTrp: 0.392 ± 0.055
1.831PheTyr: 1.831 ± 0.135
0.0PheXaa: 0.0 ± 0.0
Gly
2.871GlyAla: 2.871 ± 0.257
0.946GlyCys: 0.946 ± 0.087
2.802GlyAsp: 2.802 ± 0.19
3.067GlyGlu: 3.067 ± 0.186
2.24GlyPhe: 2.24 ± 0.127
3.075GlyGly: 3.075 ± 0.275
0.775GlyHis: 0.775 ± 0.089
3.492GlyIle: 3.492 ± 0.192
4.566GlyLys: 4.566 ± 0.226
3.484GlyLeu: 3.484 ± 0.208
1.073GlyMet: 1.073 ± 0.099
2.939GlyAsn: 2.939 ± 0.202
1.218GlyPro: 1.218 ± 0.104
1.244GlyGln: 1.244 ± 0.121
2.198GlyArg: 2.198 ± 0.139
3.467GlySer: 3.467 ± 0.29
2.93GlyThr: 2.93 ± 0.236
3.092GlyVal: 3.092 ± 0.185
0.511GlyTrp: 0.511 ± 0.06
2.325GlyTyr: 2.325 ± 0.149
0.0GlyXaa: 0.0 ± 0.0
His
0.954HisAla: 0.954 ± 0.089
0.622HisCys: 0.622 ± 0.079
0.784HisAsp: 0.784 ± 0.089
1.244HisGlu: 1.244 ± 0.101
0.733HisPhe: 0.733 ± 0.086
1.21HisGly: 1.21 ± 0.126
0.451HisHis: 0.451 ± 0.072
1.576HisIle: 1.576 ± 0.13
1.712HisLys: 1.712 ± 0.135
1.848HisLeu: 1.848 ± 0.139
0.511HisMet: 0.511 ± 0.062
1.133HisAsn: 1.133 ± 0.113
0.681HisPro: 0.681 ± 0.078
0.579HisGln: 0.579 ± 0.076
0.852HisArg: 0.852 ± 0.084
1.116HisSer: 1.116 ± 0.101
0.843HisThr: 0.843 ± 0.089
1.21HisVal: 1.21 ± 0.099
0.264HisTrp: 0.264 ± 0.045
0.886HisTyr: 0.886 ± 0.082
0.0HisXaa: 0.0 ± 0.0
Ile
5.136IleAla: 5.136 ± 0.257
1.934IleCys: 1.934 ± 0.138
6.312IleAsp: 6.312 ± 0.277
5.895IleGlu: 5.895 ± 0.274
3.135IlePhe: 3.135 ± 0.169
3.85IleGly: 3.85 ± 0.24
1.738IleHis: 1.738 ± 0.134
7.675IleIle: 7.675 ± 0.31
7.445IleLys: 7.445 ± 0.239
5.98IleLeu: 5.98 ± 0.245
1.883IleMet: 1.883 ± 0.111
6.431IleAsn: 6.431 ± 0.262
2.845IlePro: 2.845 ± 0.145
2.453IleGln: 2.453 ± 0.15
3.382IleArg: 3.382 ± 0.175
5.878IleSer: 5.878 ± 0.251
5.085IleThr: 5.085 ± 0.212
4.685IleVal: 4.685 ± 0.2
0.647IleTrp: 0.647 ± 0.083
4.191IleTyr: 4.191 ± 0.219
0.0IleXaa: 0.0 ± 0.0
Lys
3.373LysAla: 3.373 ± 0.189
2.044LysCys: 2.044 ± 0.139
3.85LysAsp: 3.85 ± 0.202
4.889LysGlu: 4.889 ± 0.279
4.038LysPhe: 4.038 ± 0.185
2.572LysGly: 2.572 ± 0.15
1.755LysHis: 1.755 ± 0.12
7.879LysIle: 7.879 ± 0.311
7.385LysLys: 7.385 ± 0.346
8.663LysLeu: 8.663 ± 0.29
2.07LysMet: 2.07 ± 0.14
5.818LysAsn: 5.818 ± 0.232
2.837LysPro: 2.837 ± 0.187
3.578LysGln: 3.578 ± 0.174
3.186LysArg: 3.186 ± 0.189
5.034LysSer: 5.034 ± 0.277
4.302LysThr: 4.302 ± 0.183
4.302LysVal: 4.302 ± 0.208
1.167LysTrp: 1.167 ± 0.103
5.818LysTyr: 5.818 ± 0.242
0.0LysXaa: 0.0 ± 0.0
Leu
4.276LeuAla: 4.276 ± 0.167
1.38LeuCys: 1.38 ± 0.118
5.034LeuAsp: 5.034 ± 0.216
4.583LeuGlu: 4.583 ± 0.213
3.458LeuPhe: 3.458 ± 0.182
3.603LeuGly: 3.603 ± 0.201
1.695LeuHis: 1.695 ± 0.111
6.508LeuIle: 6.508 ± 0.266
6.099LeuLys: 6.099 ± 0.257
6.431LeuLeu: 6.431 ± 0.25
1.848LeuMet: 1.848 ± 0.134
5.102LeuAsn: 5.102 ± 0.197
3.22LeuPro: 3.22 ± 0.218
2.624LeuGln: 2.624 ± 0.148
3.731LeuArg: 3.731 ± 0.209
5.801LeuSer: 5.801 ± 0.216
4.941LeuThr: 4.941 ± 0.194
4.404LeuVal: 4.404 ± 0.207
0.528LeuTrp: 0.528 ± 0.068
3.765LeuTyr: 3.765 ± 0.169
0.0LeuXaa: 0.0 ± 0.0
Met
1.201MetAla: 1.201 ± 0.106
0.443MetCys: 0.443 ± 0.059
1.704MetAsp: 1.704 ± 0.133
2.172MetGlu: 2.172 ± 0.136
1.09MetPhe: 1.09 ± 0.088
1.056MetGly: 1.056 ± 0.093
0.417MetHis: 0.417 ± 0.06
1.797MetIle: 1.797 ± 0.129
1.653MetLys: 1.653 ± 0.128
1.908MetLeu: 1.908 ± 0.121
0.664MetMet: 0.664 ± 0.07
1.738MetAsn: 1.738 ± 0.127
0.843MetPro: 0.843 ± 0.082
1.022MetGln: 1.022 ± 0.1
1.107MetArg: 1.107 ± 0.091
2.113MetSer: 2.113 ± 0.117
1.227MetThr: 1.227 ± 0.076
1.21MetVal: 1.21 ± 0.095
0.17MetTrp: 0.17 ± 0.039
1.039MetTyr: 1.039 ± 0.095
0.0MetXaa: 0.0 ± 0.0
Asn
2.99AsnAla: 2.99 ± 0.188
1.525AsnCys: 1.525 ± 0.143
3.399AsnAsp: 3.399 ± 0.178
4.429AsnGlu: 4.429 ± 0.224
2.564AsnPhe: 2.564 ± 0.157
3.441AsnGly: 3.441 ± 0.199
1.32AsnHis: 1.32 ± 0.109
6.516AsnIle: 6.516 ± 0.261
5.886AsnLys: 5.886 ± 0.297
5.528AsnLeu: 5.528 ± 0.231
1.678AsnMet: 1.678 ± 0.125
4.966AsnAsn: 4.966 ± 0.293
2.956AsnPro: 2.956 ± 0.169
1.755AsnGln: 1.755 ± 0.132
2.939AsnArg: 2.939 ± 0.153
4.021AsnSer: 4.021 ± 0.21
3.697AsnThr: 3.697 ± 0.222
3.739AsnVal: 3.739 ± 0.165
0.741AsnTrp: 0.741 ± 0.076
3.331AsnTyr: 3.331 ± 0.189
0.0AsnXaa: 0.0 ± 0.0
Pro
2.24ProAla: 2.24 ± 0.158
0.554ProCys: 0.554 ± 0.075
2.206ProAsp: 2.206 ± 0.148
3.322ProGlu: 3.322 ± 0.177
1.124ProPhe: 1.124 ± 0.097
1.78ProGly: 1.78 ± 0.149
0.537ProHis: 0.537 ± 0.074
2.641ProIle: 2.641 ± 0.166
2.76ProLys: 2.76 ± 0.181
2.257ProLeu: 2.257 ± 0.151
0.639ProMet: 0.639 ± 0.07
2.445ProAsn: 2.445 ± 0.155
1.056ProPro: 1.056 ± 0.117
1.158ProGln: 1.158 ± 0.105
1.354ProArg: 1.354 ± 0.122
1.874ProSer: 1.874 ± 0.148
2.547ProThr: 2.547 ± 0.144
2.419ProVal: 2.419 ± 0.136
0.273ProTrp: 0.273 ± 0.046
1.371ProTyr: 1.371 ± 0.091
0.0ProXaa: 0.0 ± 0.0
Gln
1.457GlnAla: 1.457 ± 0.122
0.588GlnCys: 0.588 ± 0.079
1.022GlnAsp: 1.022 ± 0.094
1.712GlnGlu: 1.712 ± 0.116
1.721GlnPhe: 1.721 ± 0.129
1.107GlnGly: 1.107 ± 0.118
0.767GlnHis: 0.767 ± 0.073
3.024GlnIle: 3.024 ± 0.166
2.862GlnLys: 2.862 ± 0.172
2.99GlnLeu: 2.99 ± 0.184
0.971GlnMet: 0.971 ± 0.097
2.47GlnAsn: 2.47 ± 0.151
1.261GlnPro: 1.261 ± 0.115
1.567GlnGln: 1.567 ± 0.156
1.704GlnArg: 1.704 ± 0.111
1.925GlnSer: 1.925 ± 0.133
1.942GlnThr: 1.942 ± 0.148
1.695GlnVal: 1.695 ± 0.115
0.443GlnTrp: 0.443 ± 0.057
1.653GlnTyr: 1.653 ± 0.109
0.0GlnXaa: 0.0 ± 0.0
Arg
2.342ArgAla: 2.342 ± 0.135
1.005ArgCys: 1.005 ± 0.088
2.683ArgAsp: 2.683 ± 0.174
3.024ArgGlu: 3.024 ± 0.185
1.891ArgPhe: 1.891 ± 0.14
1.951ArgGly: 1.951 ± 0.168
0.92ArgHis: 0.92 ± 0.093
3.441ArgIle: 3.441 ± 0.146
4.148ArgLys: 4.148 ± 0.213
3.067ArgLeu: 3.067 ± 0.183
1.176ArgMet: 1.176 ± 0.098
2.913ArgAsn: 2.913 ± 0.174
1.329ArgPro: 1.329 ± 0.117
1.516ArgGln: 1.516 ± 0.114
1.831ArgArg: 1.831 ± 0.135
1.925ArgSer: 1.925 ± 0.127
1.976ArgThr: 1.976 ± 0.125
2.624ArgVal: 2.624 ± 0.146
0.468ArgTrp: 0.468 ± 0.059
2.019ArgTyr: 2.019 ± 0.139
0.0ArgXaa: 0.0 ± 0.0
Ser
3.492SerAla: 3.492 ± 0.202
1.201SerCys: 1.201 ± 0.134
4.617SerAsp: 4.617 ± 0.243
4.31SerGlu: 4.31 ± 0.173
2.479SerPhe: 2.479 ± 0.162
4.191SerGly: 4.191 ± 0.271
1.21SerHis: 1.21 ± 0.096
4.864SerIle: 4.864 ± 0.218
5.273SerLys: 5.273 ± 0.186
4.608SerLeu: 4.608 ± 0.207
1.414SerMet: 1.414 ± 0.115
3.782SerAsn: 3.782 ± 0.219
1.917SerPro: 1.917 ± 0.14
2.13SerGln: 2.13 ± 0.141
2.675SerArg: 2.675 ± 0.16
4.498SerSer: 4.498 ± 0.259
4.148SerThr: 4.148 ± 0.205
3.748SerVal: 3.748 ± 0.174
0.554SerTrp: 0.554 ± 0.072
2.377SerTyr: 2.377 ± 0.14
0.0SerXaa: 0.0 ± 0.0
Thr
3.373ThrAla: 3.373 ± 0.204
0.911ThrCys: 0.911 ± 0.096
3.586ThrAsp: 3.586 ± 0.183
4.14ThrGlu: 4.14 ± 0.208
2.283ThrPhe: 2.283 ± 0.132
3.535ThrGly: 3.535 ± 0.29
0.937ThrHis: 0.937 ± 0.086
4.796ThrIle: 4.796 ± 0.245
4.242ThrLys: 4.242 ± 0.173
4.14ThrLeu: 4.14 ± 0.176
1.423ThrMet: 1.423 ± 0.127
3.595ThrAsn: 3.595 ± 0.21
2.462ThrPro: 2.462 ± 0.154
2.027ThrGln: 2.027 ± 0.172
2.164ThrArg: 2.164 ± 0.13
3.535ThrSer: 3.535 ± 0.206
3.867ThrThr: 3.867 ± 0.295
3.918ThrVal: 3.918 ± 0.22
0.571ThrTrp: 0.571 ± 0.079
2.402ThrTyr: 2.402 ± 0.167
0.0ThrXaa: 0.0 ± 0.0
Val
2.93ValAla: 2.93 ± 0.178
1.337ValCys: 1.337 ± 0.12
3.458ValAsp: 3.458 ± 0.201
3.833ValGlu: 3.833 ± 0.219
2.547ValPhe: 2.547 ± 0.16
2.683ValGly: 2.683 ± 0.169
1.107ValHis: 1.107 ± 0.109
4.77ValIle: 4.77 ± 0.228
4.915ValLys: 4.915 ± 0.206
4.012ValLeu: 4.012 ± 0.173
1.235ValMet: 1.235 ± 0.098
4.055ValAsn: 4.055 ± 0.167
2.172ValPro: 2.172 ± 0.144
1.644ValGln: 1.644 ± 0.117
2.334ValArg: 2.334 ± 0.128
3.688ValSer: 3.688 ± 0.183
3.288ValThr: 3.288 ± 0.185
3.177ValVal: 3.177 ± 0.172
0.477ValTrp: 0.477 ± 0.062
2.641ValTyr: 2.641 ± 0.141
0.0ValXaa: 0.0 ± 0.0
Trp
0.554TrpAla: 0.554 ± 0.064
0.451TrpCys: 0.451 ± 0.092
0.63TrpAsp: 0.63 ± 0.071
0.639TrpGlu: 0.639 ± 0.058
0.698TrpPhe: 0.698 ± 0.09
0.4TrpGly: 0.4 ± 0.057
0.332TrpHis: 0.332 ± 0.05
0.716TrpIle: 0.716 ± 0.077
0.63TrpLys: 0.63 ± 0.074
0.818TrpLeu: 0.818 ± 0.085
0.324TrpMet: 0.324 ± 0.056
0.571TrpAsn: 0.571 ± 0.072
0.111TrpPro: 0.111 ± 0.026
0.298TrpGln: 0.298 ± 0.05
0.562TrpArg: 0.562 ± 0.074
0.767TrpSer: 0.767 ± 0.097
0.417TrpThr: 0.417 ± 0.067
0.511TrpVal: 0.511 ± 0.063
0.17TrpTrp: 0.17 ± 0.042
0.426TrpTyr: 0.426 ± 0.062
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.3TyrAla: 2.3 ± 0.121
1.337TyrCys: 1.337 ± 0.14
2.913TyrAsp: 2.913 ± 0.162
3.203TyrGlu: 3.203 ± 0.151
1.985TyrPhe: 1.985 ± 0.132
2.368TyrGly: 2.368 ± 0.15
0.98TyrHis: 0.98 ± 0.091
4.165TyrIle: 4.165 ± 0.185
4.293TyrLys: 4.293 ± 0.225
3.68TyrLeu: 3.68 ± 0.167
1.337TyrMet: 1.337 ± 0.112
3.637TyrAsn: 3.637 ± 0.18
1.516TyrPro: 1.516 ± 0.119
1.644TyrGln: 1.644 ± 0.106
2.078TyrArg: 2.078 ± 0.143
2.998TyrSer: 2.998 ± 0.174
2.368TyrThr: 2.368 ± 0.148
2.411TyrVal: 2.411 ± 0.136
0.579TyrTrp: 0.579 ± 0.084
2.291TyrTyr: 2.291 ± 0.151
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 465 proteins (117397 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski