Amino acid dipepetide frequency for Paramecium bursaria Chlorella virus NY2A (PBCV-NY2A)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.08AlaAla: 3.08 ± 0.164
0.823AlaCys: 0.823 ± 0.077
2.385AlaAsp: 2.385 ± 0.161
2.124AlaGlu: 2.124 ± 0.132
2.736AlaPhe: 2.736 ± 0.16
3.016AlaGly: 3.016 ± 0.3
1.11AlaHis: 1.11 ± 0.087
3.871AlaIle: 3.871 ± 0.165
3.437AlaLys: 3.437 ± 0.204
3.769AlaLeu: 3.769 ± 0.191
1.371AlaMet: 1.371 ± 0.109
3.055AlaAsn: 3.055 ± 0.282
2.691AlaPro: 2.691 ± 0.434
1.397AlaGln: 1.397 ± 0.108
2.978AlaArg: 2.978 ± 0.178
4.215AlaSer: 4.215 ± 0.243
3.048AlaThr: 3.048 ± 0.203
2.965AlaVal: 2.965 ± 0.163
0.383AlaTrp: 0.383 ± 0.054
1.652AlaTyr: 1.652 ± 0.102
0.0AlaXaa: 0.0 ± 0.0
Cys
0.886CysAla: 0.886 ± 0.079
0.472CysCys: 0.472 ± 0.058
1.295CysAsp: 1.295 ± 0.103
0.733CysGlu: 0.733 ± 0.077
1.371CysPhe: 1.371 ± 0.108
1.428CysGly: 1.428 ± 0.137
0.574CysHis: 0.574 ± 0.064
1.569CysIle: 1.569 ± 0.114
1.078CysLys: 1.078 ± 0.128
1.556CysLeu: 1.556 ± 0.107
0.478CysMet: 0.478 ± 0.068
0.95CysAsn: 0.95 ± 0.08
1.071CysPro: 1.071 ± 0.129
0.415CysGln: 0.415 ± 0.049
1.256CysArg: 1.256 ± 0.109
1.792CysSer: 1.792 ± 0.137
0.899CysThr: 0.899 ± 0.079
1.837CysVal: 1.837 ± 0.152
0.172CysTrp: 0.172 ± 0.029
0.561CysTyr: 0.561 ± 0.052
0.0CysXaa: 0.0 ± 0.0
Asp
3.322AspAla: 3.322 ± 0.17
0.848AspCys: 0.848 ± 0.075
4.547AspAsp: 4.547 ± 0.239
3.96AspGlu: 3.96 ± 0.212
2.908AspPhe: 2.908 ± 0.159
3.59AspGly: 3.59 ± 0.181
1.186AspHis: 1.186 ± 0.09
5.287AspIle: 5.287 ± 0.201
3.246AspLys: 3.246 ± 0.153
3.813AspLeu: 3.813 ± 0.193
1.588AspMet: 1.588 ± 0.087
2.64AspAsn: 2.64 ± 0.149
2.226AspPro: 2.226 ± 0.137
0.982AspGln: 0.982 ± 0.077
2.685AspArg: 2.685 ± 0.126
3.284AspSer: 3.284 ± 0.161
3.348AspThr: 3.348 ± 0.125
4.33AspVal: 4.33 ± 0.177
0.517AspTrp: 0.517 ± 0.068
1.83AspTyr: 1.83 ± 0.116
0.0AspXaa: 0.0 ± 0.0
Glu
2.13GluAla: 2.13 ± 0.131
1.173GluCys: 1.173 ± 0.088
2.761GluAsp: 2.761 ± 0.16
3.074GluGlu: 3.074 ± 0.185
2.589GluPhe: 2.589 ± 0.144
1.473GluGly: 1.473 ± 0.093
1.671GluHis: 1.671 ± 0.1
4.158GluIle: 4.158 ± 0.192
4.177GluLys: 4.177 ± 0.201
4.107GluLeu: 4.107 ± 0.185
1.397GluMet: 1.397 ± 0.122
3.22GluAsn: 3.22 ± 0.149
1.645GluPro: 1.645 ± 0.12
1.607GluGln: 1.607 ± 0.105
3.278GluArg: 3.278 ± 0.145
3.252GluSer: 3.252 ± 0.17
3.361GluThr: 3.361 ± 0.163
2.468GluVal: 2.468 ± 0.172
0.721GluTrp: 0.721 ± 0.073
2.697GluTyr: 2.697 ± 0.165
0.0GluXaa: 0.0 ± 0.0
Phe
3.08PheAla: 3.08 ± 0.175
1.186PheCys: 1.186 ± 0.102
3.705PheAsp: 3.705 ± 0.169
3.316PheGlu: 3.316 ± 0.16
3.622PhePhe: 3.622 ± 0.186
3.686PheGly: 3.686 ± 0.236
1.346PheHis: 1.346 ± 0.102
3.775PheIle: 3.775 ± 0.175
2.513PheLys: 2.513 ± 0.131
4.783PheLeu: 4.783 ± 0.246
1.441PheMet: 1.441 ± 0.09
1.722PheAsn: 1.722 ± 0.105
2.589PhePro: 2.589 ± 0.192
1.346PheGln: 1.346 ± 0.106
2.991PheArg: 2.991 ± 0.172
4.426PheSer: 4.426 ± 0.218
3.112PheThr: 3.112 ± 0.161
4.936PheVal: 4.936 ± 0.196
0.542PheTrp: 0.542 ± 0.054
1.773PheTyr: 1.773 ± 0.115
0.0PheXaa: 0.0 ± 0.0
Gly
2.908GlyAla: 2.908 ± 0.288
1.052GlyCys: 1.052 ± 0.079
2.774GlyAsp: 2.774 ± 0.151
2.634GlyGlu: 2.634 ± 0.137
3.303GlyPhe: 3.303 ± 0.224
3.265GlyGly: 3.265 ± 0.196
1.052GlyHis: 1.052 ± 0.101
3.947GlyIle: 3.947 ± 0.18
3.954GlyLys: 3.954 ± 0.278
3.271GlyLeu: 3.271 ± 0.213
1.033GlyMet: 1.033 ± 0.091
4.528GlyAsn: 4.528 ± 0.826
1.454GlyPro: 1.454 ± 0.137
1.46GlyGln: 1.46 ± 0.139
3.042GlyArg: 3.042 ± 0.175
3.986GlySer: 3.986 ± 0.317
3.137GlyThr: 3.137 ± 0.216
3.546GlyVal: 3.546 ± 0.23
0.619GlyTrp: 0.619 ± 0.071
2.245GlyTyr: 2.245 ± 0.134
0.0GlyXaa: 0.0 ± 0.0
His
1.18HisAla: 1.18 ± 0.1
0.415HisCys: 0.415 ± 0.059
1.556HisAsp: 1.556 ± 0.115
1.454HisGlu: 1.454 ± 0.118
1.224HisPhe: 1.224 ± 0.094
1.295HisGly: 1.295 ± 0.101
1.097HisHis: 1.097 ± 0.097
2.028HisIle: 2.028 ± 0.13
1.352HisLys: 1.352 ± 0.097
2.117HisLeu: 2.117 ± 0.111
0.65HisMet: 0.65 ± 0.061
1.008HisAsn: 1.008 ± 0.076
1.033HisPro: 1.033 ± 0.087
0.631HisGln: 0.631 ± 0.062
2.073HisArg: 2.073 ± 0.145
1.416HisSer: 1.416 ± 0.102
1.409HisThr: 1.409 ± 0.119
2.13HisVal: 2.13 ± 0.131
0.332HisTrp: 0.332 ± 0.046
0.759HisTyr: 0.759 ± 0.08
0.0HisXaa: 0.0 ± 0.0
Ile
4.253IleAla: 4.253 ± 0.204
1.677IleCys: 1.677 ± 0.134
4.987IleAsp: 4.987 ± 0.203
4.037IleGlu: 4.037 ± 0.212
4.591IlePhe: 4.591 ± 0.219
3.845IleGly: 3.845 ± 0.231
2.034IleHis: 2.034 ± 0.132
5.688IleIle: 5.688 ± 0.251
3.724IleLys: 3.724 ± 0.194
6.332IleLeu: 6.332 ± 0.247
2.098IleMet: 2.098 ± 0.117
3.284IleAsn: 3.284 ± 0.164
3.622IlePro: 3.622 ± 0.184
2.124IleGln: 2.124 ± 0.128
4.636IleArg: 4.636 ± 0.214
6.823IleSer: 6.823 ± 0.228
4.343IleThr: 4.343 ± 0.222
5.624IleVal: 5.624 ± 0.196
0.721IleTrp: 0.721 ± 0.079
2.659IleTyr: 2.659 ± 0.153
0.0IleXaa: 0.0 ± 0.0
Lys
2.672LysAla: 2.672 ± 0.19
1.601LysCys: 1.601 ± 0.195
3.316LysAsp: 3.316 ± 0.187
3.278LysGlu: 3.278 ± 0.167
3.086LysPhe: 3.086 ± 0.143
2.423LysGly: 2.423 ± 0.148
1.83LysHis: 1.83 ± 0.128
5.478LysIle: 5.478 ± 0.231
6.709LysLys: 6.709 ± 0.374
5.312LysLeu: 5.312 ± 0.225
2.334LysMet: 2.334 ± 0.13
4.974LysAsn: 4.974 ± 0.234
3.66LysPro: 3.66 ± 0.565
2.149LysGln: 2.149 ± 0.164
3.514LysArg: 3.514 ± 0.174
4.789LysSer: 4.789 ± 0.208
4.655LysThr: 4.655 ± 0.25
2.851LysVal: 2.851 ± 0.217
0.65LysTrp: 0.65 ± 0.068
3.265LysTyr: 3.265 ± 0.148
0.0LysXaa: 0.0 ± 0.0
Leu
3.469LeuAla: 3.469 ± 0.193
1.607LeuCys: 1.607 ± 0.119
4.1LeuAsp: 4.1 ± 0.156
3.775LeuGlu: 3.775 ± 0.169
4.355LeuPhe: 4.355 ± 0.24
3.699LeuGly: 3.699 ± 0.227
2.111LeuHis: 2.111 ± 0.153
4.872LeuIle: 4.872 ± 0.22
5.0LeuLys: 5.0 ± 0.207
6.192LeuLeu: 6.192 ± 0.292
2.34LeuMet: 2.34 ± 0.127
3.597LeuAsn: 3.597 ± 0.206
3.558LeuPro: 3.558 ± 0.202
2.366LeuGln: 2.366 ± 0.149
4.77LeuArg: 4.77 ± 0.189
6.549LeuSer: 6.549 ± 0.243
4.368LeuThr: 4.368 ± 0.224
4.642LeuVal: 4.642 ± 0.177
0.842LeuTrp: 0.842 ± 0.077
2.831LeuTyr: 2.831 ± 0.151
0.0LeuXaa: 0.0 ± 0.0
Met
1.039MetAla: 1.039 ± 0.08
0.631MetCys: 0.631 ± 0.059
1.078MetAsp: 1.078 ± 0.088
1.199MetGlu: 1.199 ± 0.099
2.041MetPhe: 2.041 ± 0.132
1.18MetGly: 1.18 ± 0.122
0.421MetHis: 0.421 ± 0.054
2.359MetIle: 2.359 ± 0.128
2.436MetLys: 2.436 ± 0.127
2.2MetLeu: 2.2 ± 0.131
1.141MetMet: 1.141 ± 0.112
2.092MetAsn: 2.092 ± 0.138
1.014MetPro: 1.014 ± 0.092
0.619MetGln: 0.619 ± 0.067
1.46MetArg: 1.46 ± 0.087
2.997MetSer: 2.997 ± 0.152
2.232MetThr: 2.232 ± 0.115
1.307MetVal: 1.307 ± 0.098
0.395MetTrp: 0.395 ± 0.066
1.148MetTyr: 1.148 ± 0.086
0.0MetXaa: 0.0 ± 0.0
Asn
3.08AsnAla: 3.08 ± 0.221
0.772AsnCys: 0.772 ± 0.09
3.093AsnAsp: 3.093 ± 0.154
2.551AsnGlu: 2.551 ± 0.129
2.608AsnPhe: 2.608 ± 0.147
3.526AsnGly: 3.526 ± 0.28
1.301AsnHis: 1.301 ± 0.093
5.376AsnIle: 5.376 ± 0.367
3.501AsnLys: 3.501 ± 0.225
3.641AsnLeu: 3.641 ± 0.169
1.645AsnMet: 1.645 ± 0.087
2.736AsnAsn: 2.736 ± 0.156
2.487AsnPro: 2.487 ± 0.168
1.052AsnGln: 1.052 ± 0.085
2.544AsnArg: 2.544 ± 0.167
3.444AsnSer: 3.444 ± 0.183
3.724AsnThr: 3.724 ± 0.251
5.446AsnVal: 5.446 ± 0.597
0.485AsnTrp: 0.485 ± 0.075
1.569AsnTyr: 1.569 ± 0.133
0.0AsnXaa: 0.0 ± 0.0
Pro
2.723ProAla: 2.723 ± 0.445
0.772ProCys: 0.772 ± 0.071
2.723ProAsp: 2.723 ± 0.137
2.71ProGlu: 2.71 ± 0.179
1.983ProPhe: 1.983 ± 0.107
2.334ProGly: 2.334 ± 0.173
0.823ProHis: 0.823 ± 0.094
2.787ProIle: 2.787 ± 0.166
3.737ProLys: 3.737 ± 0.477
2.863ProLeu: 2.863 ± 0.166
1.269ProMet: 1.269 ± 0.127
2.092ProAsn: 2.092 ± 0.137
1.996ProPro: 1.996 ± 0.144
1.122ProGln: 1.122 ± 0.114
2.257ProArg: 2.257 ± 0.146
3.845ProSer: 3.845 ± 0.234
2.857ProThr: 2.857 ± 0.199
2.927ProVal: 2.927 ± 0.236
0.37ProTrp: 0.37 ± 0.055
1.295ProTyr: 1.295 ± 0.088
0.0ProXaa: 0.0 ± 0.0
Gln
1.059GlnAla: 1.059 ± 0.083
0.561GlnCys: 0.561 ± 0.078
1.192GlnAsp: 1.192 ± 0.096
0.969GlnGlu: 0.969 ± 0.077
1.377GlnPhe: 1.377 ± 0.103
0.995GlnGly: 0.995 ± 0.098
0.797GlnHis: 0.797 ± 0.075
1.939GlnIle: 1.939 ± 0.11
2.334GlnLys: 2.334 ± 0.155
1.97GlnLeu: 1.97 ± 0.141
0.855GlnMet: 0.855 ± 0.083
1.735GlnAsn: 1.735 ± 0.127
1.078GlnPro: 1.078 ± 0.107
1.199GlnGln: 1.199 ± 0.137
1.894GlnArg: 1.894 ± 0.129
1.913GlnSer: 1.913 ± 0.121
1.735GlnThr: 1.735 ± 0.104
1.479GlnVal: 1.479 ± 0.143
0.357GlnTrp: 0.357 ± 0.051
1.212GlnTyr: 1.212 ± 0.097
0.0GlnXaa: 0.0 ± 0.0
Arg
2.226ArgAla: 2.226 ± 0.135
1.473ArgCys: 1.473 ± 0.096
3.061ArgAsp: 3.061 ± 0.154
3.016ArgGlu: 3.016 ± 0.152
2.806ArgPhe: 2.806 ± 0.17
2.615ArgGly: 2.615 ± 0.155
1.633ArgHis: 1.633 ± 0.094
4.458ArgIle: 4.458 ± 0.207
4.26ArgLys: 4.26 ± 0.208
3.992ArgLeu: 3.992 ± 0.175
1.983ArgMet: 1.983 ± 0.129
2.991ArgAsn: 2.991 ± 0.128
2.2ArgPro: 2.2 ± 0.149
1.773ArgGln: 1.773 ± 0.143
4.355ArgArg: 4.355 ± 0.219
4.898ArgSer: 4.898 ± 0.204
3.335ArgThr: 3.335 ± 0.188
3.909ArgVal: 3.909 ± 0.175
0.823ArgTrp: 0.823 ± 0.073
2.385ArgTyr: 2.385 ± 0.119
0.0ArgXaa: 0.0 ± 0.0
Ser
4.056SerAla: 4.056 ± 0.224
1.696SerCys: 1.696 ± 0.111
3.794SerAsp: 3.794 ± 0.189
3.756SerGlu: 3.756 ± 0.153
4.426SerPhe: 4.426 ± 0.229
5.165SerGly: 5.165 ± 0.325
1.951SerHis: 1.951 ± 0.114
5.809SerIle: 5.809 ± 0.252
5.312SerLys: 5.312 ± 0.247
5.624SerLeu: 5.624 ± 0.233
2.251SerMet: 2.251 ± 0.134
4.088SerAsn: 4.088 ± 0.257
3.316SerPro: 3.316 ± 0.213
2.002SerGln: 2.002 ± 0.128
5.044SerArg: 5.044 ± 0.208
7.716SerSer: 7.716 ± 0.358
4.949SerThr: 4.949 ± 0.208
5.452SerVal: 5.452 ± 0.223
0.893SerTrp: 0.893 ± 0.079
2.729SerTyr: 2.729 ± 0.153
0.0SerXaa: 0.0 ± 0.0
Thr
2.972ThrAla: 2.972 ± 0.216
1.295ThrCys: 1.295 ± 0.102
2.519ThrAsp: 2.519 ± 0.133
2.57ThrGlu: 2.57 ± 0.106
3.998ThrPhe: 3.998 ± 0.171
3.794ThrGly: 3.794 ± 0.317
1.505ThrHis: 1.505 ± 0.11
5.044ThrIle: 5.044 ± 0.207
4.234ThrLys: 4.234 ± 0.198
4.598ThrLeu: 4.598 ± 0.186
1.709ThrMet: 1.709 ± 0.109
3.444ThrAsn: 3.444 ± 0.211
3.291ThrPro: 3.291 ± 0.246
1.696ThrGln: 1.696 ± 0.135
3.431ThrArg: 3.431 ± 0.152
5.624ThrSer: 5.624 ± 0.221
4.12ThrThr: 4.12 ± 0.207
2.768ThrVal: 2.768 ± 0.173
0.714ThrTrp: 0.714 ± 0.063
2.085ThrTyr: 2.085 ± 0.125
0.0ThrXaa: 0.0 ± 0.0
Val
3.201ValAla: 3.201 ± 0.176
1.352ValCys: 1.352 ± 0.113
4.03ValAsp: 4.03 ± 0.152
3.405ValGlu: 3.405 ± 0.16
3.915ValPhe: 3.915 ± 0.166
3.418ValGly: 3.418 ± 0.306
1.658ValHis: 1.658 ± 0.11
4.732ValIle: 4.732 ± 0.221
4.12ValLys: 4.12 ± 0.257
5.414ValLeu: 5.414 ± 0.268
2.002ValMet: 2.002 ± 0.117
3.303ValAsn: 3.303 ± 0.221
2.991ValPro: 2.991 ± 0.202
1.633ValGln: 1.633 ± 0.099
3.546ValArg: 3.546 ± 0.155
5.72ValSer: 5.72 ± 0.261
3.635ValThr: 3.635 ± 0.246
4.853ValVal: 4.853 ± 0.232
0.574ValTrp: 0.574 ± 0.068
2.787ValTyr: 2.787 ± 0.155
0.0ValXaa: 0.0 ± 0.0
Trp
0.459TrpAla: 0.459 ± 0.047
0.325TrpCys: 0.325 ± 0.065
0.568TrpAsp: 0.568 ± 0.063
0.427TrpGlu: 0.427 ± 0.057
0.759TrpPhe: 0.759 ± 0.07
0.427TrpGly: 0.427 ± 0.054
0.159TrpHis: 0.159 ± 0.034
0.899TrpIle: 0.899 ± 0.087
0.899TrpLys: 0.899 ± 0.087
0.759TrpLeu: 0.759 ± 0.07
0.312TrpMet: 0.312 ± 0.043
0.925TrpAsn: 0.925 ± 0.097
0.261TrpPro: 0.261 ± 0.042
0.21TrpGln: 0.21 ± 0.033
0.548TrpArg: 0.548 ± 0.064
0.842TrpSer: 0.842 ± 0.072
0.701TrpThr: 0.701 ± 0.08
0.466TrpVal: 0.466 ± 0.062
0.179TrpTrp: 0.179 ± 0.038
0.491TrpTyr: 0.491 ± 0.057
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.308TyrAla: 2.308 ± 0.138
0.663TyrCys: 0.663 ± 0.075
2.653TyrAsp: 2.653 ± 0.172
1.97TyrGlu: 1.97 ± 0.132
2.187TyrPhe: 2.187 ± 0.143
2.034TyrGly: 2.034 ± 0.119
0.899TyrHis: 0.899 ± 0.081
3.029TyrIle: 3.029 ± 0.18
2.525TyrLys: 2.525 ± 0.128
2.736TyrLeu: 2.736 ± 0.137
1.084TyrMet: 1.084 ± 0.089
2.2TyrAsn: 2.2 ± 0.133
1.301TyrPro: 1.301 ± 0.099
0.867TyrGln: 0.867 ± 0.074
1.888TyrArg: 1.888 ± 0.122
2.5TyrSer: 2.5 ± 0.133
2.442TyrThr: 2.442 ± 0.121
2.353TyrVal: 2.353 ± 0.145
0.332TyrTrp: 0.332 ± 0.038
1.244TyrTyr: 1.244 ± 0.1
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 873 proteins (156815 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski