Amino acid dipepetide frequency for Yaba-like disease virus (YLDV)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.289AlaAla: 1.289 ± 0.191
0.655AlaCys: 0.655 ± 0.101
1.201AlaAsp: 1.201 ± 0.134
1.332AlaGlu: 1.332 ± 0.179
1.441AlaPhe: 1.441 ± 0.185
0.961AlaGly: 0.961 ± 0.184
0.393AlaHis: 0.393 ± 0.104
3.516AlaIle: 3.516 ± 0.26
2.359AlaLys: 2.359 ± 0.187
2.708AlaLeu: 2.708 ± 0.234
0.633AlaMet: 0.633 ± 0.13
2.38AlaAsn: 2.38 ± 0.229
0.546AlaPro: 0.546 ± 0.106
0.415AlaGln: 0.415 ± 0.095
1.157AlaArg: 1.157 ± 0.138
2.38AlaSer: 2.38 ± 0.244
1.485AlaThr: 1.485 ± 0.158
1.813AlaVal: 1.813 ± 0.24
0.218AlaTrp: 0.218 ± 0.071
1.245AlaTyr: 1.245 ± 0.166
0.0AlaXaa: 0.0 ± 0.0
Cys
0.655CysAla: 0.655 ± 0.11
0.612CysCys: 0.612 ± 0.12
1.354CysAsp: 1.354 ± 0.19
1.048CysGlu: 1.048 ± 0.159
1.136CysPhe: 1.136 ± 0.148
1.005CysGly: 1.005 ± 0.149
0.197CysHis: 0.197 ± 0.06
1.966CysIle: 1.966 ± 0.195
1.987CysLys: 1.987 ± 0.269
1.944CysLeu: 1.944 ± 0.219
0.524CysMet: 0.524 ± 0.102
1.682CysAsn: 1.682 ± 0.203
0.612CysPro: 0.612 ± 0.12
0.284CysGln: 0.284 ± 0.089
0.546CysArg: 0.546 ± 0.09
1.835CysSer: 1.835 ± 0.213
0.983CysThr: 0.983 ± 0.155
1.594CysVal: 1.594 ± 0.203
0.197CysTrp: 0.197 ± 0.072
1.441CysTyr: 1.441 ± 0.146
0.0CysXaa: 0.0 ± 0.0
Asp
1.463AspAla: 1.463 ± 0.18
0.895AspCys: 0.895 ± 0.168
3.451AspAsp: 3.451 ± 0.317
3.582AspGlu: 3.582 ± 0.287
3.189AspPhe: 3.189 ± 0.228
1.835AspGly: 1.835 ± 0.188
0.677AspHis: 0.677 ± 0.126
6.639AspIle: 6.639 ± 0.4
4.084AspLys: 4.084 ± 0.297
4.433AspLeu: 4.433 ± 0.296
1.42AspMet: 1.42 ± 0.184
4.586AspAsn: 4.586 ± 0.297
1.157AspPro: 1.157 ± 0.141
0.895AspGln: 0.895 ± 0.145
1.223AspArg: 1.223 ± 0.159
3.735AspSer: 3.735 ± 0.316
2.664AspThr: 2.664 ± 0.237
3.953AspVal: 3.953 ± 0.344
0.415AspTrp: 0.415 ± 0.094
3.079AspTyr: 3.079 ± 0.243
0.0AspXaa: 0.0 ± 0.0
Glu
1.398GluAla: 1.398 ± 0.19
1.092GluCys: 1.092 ± 0.136
3.32GluAsp: 3.32 ± 0.282
3.735GluGlu: 3.735 ± 0.313
2.293GluPhe: 2.293 ± 0.23
1.332GluGly: 1.332 ± 0.18
0.743GluHis: 0.743 ± 0.112
6.202GluIle: 6.202 ± 0.416
5.984GluLys: 5.984 ± 0.296
4.936GluLeu: 4.936 ± 0.347
1.551GluMet: 1.551 ± 0.167
4.543GluAsn: 4.543 ± 0.341
1.485GluPro: 1.485 ± 0.172
1.332GluGln: 1.332 ± 0.176
1.289GluArg: 1.289 ± 0.238
3.56GluSer: 3.56 ± 0.311
2.992GluThr: 2.992 ± 0.287
2.533GluVal: 2.533 ± 0.205
0.349GluTrp: 0.349 ± 0.105
2.97GluTyr: 2.97 ± 0.232
0.0GluXaa: 0.0 ± 0.0
Phe
1.398PheAla: 1.398 ± 0.182
1.332PheCys: 1.332 ± 0.158
2.643PheAsp: 2.643 ± 0.238
2.664PheGlu: 2.664 ± 0.215
3.298PhePhe: 3.298 ± 0.355
2.249PheGly: 2.249 ± 0.236
0.743PheHis: 0.743 ± 0.121
6.115PheIle: 6.115 ± 0.406
4.783PheLys: 4.783 ± 0.287
6.421PheLeu: 6.421 ± 0.453
1.354PheMet: 1.354 ± 0.174
5.7PheAsn: 5.7 ± 0.394
1.813PhePro: 1.813 ± 0.196
1.005PheGln: 1.005 ± 0.123
1.529PheArg: 1.529 ± 0.175
5.241PheSer: 5.241 ± 0.311
2.643PheThr: 2.643 ± 0.265
3.953PheVal: 3.953 ± 0.348
0.48PheTrp: 0.48 ± 0.102
3.079PheTyr: 3.079 ± 0.223
0.0PheXaa: 0.0 ± 0.0
Gly
1.289GlyAla: 1.289 ± 0.221
0.786GlyCys: 0.786 ± 0.143
1.987GlyAsp: 1.987 ± 0.184
1.856GlyGlu: 1.856 ± 0.216
2.053GlyPhe: 2.053 ± 0.219
1.9GlyGly: 1.9 ± 0.27
0.393GlyHis: 0.393 ± 0.098
3.582GlyIle: 3.582 ± 0.26
4.215GlyLys: 4.215 ± 0.369
2.424GlyLeu: 2.424 ± 0.185
0.677GlyMet: 0.677 ± 0.104
3.167GlyAsn: 3.167 ± 0.242
0.59GlyPro: 0.59 ± 0.112
0.524GlyGln: 0.524 ± 0.09
1.157GlyArg: 1.157 ± 0.193
2.206GlySer: 2.206 ± 0.219
1.507GlyThr: 1.507 ± 0.248
2.621GlyVal: 2.621 ± 0.216
0.218GlyTrp: 0.218 ± 0.073
2.271GlyTyr: 2.271 ± 0.225
0.0GlyXaa: 0.0 ± 0.0
His
0.371HisAla: 0.371 ± 0.073
0.437HisCys: 0.437 ± 0.087
0.764HisAsp: 0.764 ± 0.127
0.633HisGlu: 0.633 ± 0.114
1.048HisPhe: 1.048 ± 0.158
0.677HisGly: 0.677 ± 0.135
0.262HisHis: 0.262 ± 0.081
1.594HisIle: 1.594 ± 0.183
1.201HisLys: 1.201 ± 0.139
1.551HisLeu: 1.551 ± 0.186
0.633HisMet: 0.633 ± 0.137
1.223HisAsn: 1.223 ± 0.169
0.415HisPro: 0.415 ± 0.093
0.349HisGln: 0.349 ± 0.069
0.415HisArg: 0.415 ± 0.098
1.157HisSer: 1.157 ± 0.165
0.699HisThr: 0.699 ± 0.146
1.376HisVal: 1.376 ± 0.185
0.197HisTrp: 0.197 ± 0.058
0.808HisTyr: 0.808 ± 0.134
0.0HisXaa: 0.0 ± 0.0
Ile
3.079IleAla: 3.079 ± 0.25
2.271IleCys: 2.271 ± 0.218
6.049IleAsp: 6.049 ± 0.337
5.154IleGlu: 5.154 ± 0.366
6.049IlePhe: 6.049 ± 0.39
3.407IleGly: 3.407 ± 0.26
1.856IleHis: 1.856 ± 0.184
10.177IleIle: 10.177 ± 0.46
10.133IleLys: 10.133 ± 0.605
9.5IleLeu: 9.5 ± 0.5
2.097IleMet: 2.097 ± 0.244
10.417IleAsn: 10.417 ± 0.501
3.167IlePro: 3.167 ± 0.244
2.271IleGln: 2.271 ± 0.227
2.533IleArg: 2.533 ± 0.226
8.102IleSer: 8.102 ± 0.398
5.547IleThr: 5.547 ± 0.322
6.006IleVal: 6.006 ± 0.356
0.546IleTrp: 0.546 ± 0.11
4.543IleTyr: 4.543 ± 0.319
0.0IleXaa: 0.0 ± 0.0
Lys
2.162LysAla: 2.162 ± 0.225
1.878LysCys: 1.878 ± 0.233
4.477LysAsp: 4.477 ± 0.289
5.635LysGlu: 5.635 ± 0.328
4.455LysPhe: 4.455 ± 0.329
2.795LysGly: 2.795 ± 0.221
1.922LysHis: 1.922 ± 0.22
10.505LysIle: 10.505 ± 0.468
11.553LysLys: 11.553 ± 0.602
8.736LysLeu: 8.736 ± 0.443
2.118LysMet: 2.118 ± 0.205
9.675LysAsn: 9.675 ± 0.504
2.184LysPro: 2.184 ± 0.177
2.468LysGln: 2.468 ± 0.204
2.926LysArg: 2.926 ± 0.25
6.814LysSer: 6.814 ± 0.378
5.722LysThr: 5.722 ± 0.342
5.22LysVal: 5.22 ± 0.309
0.677LysTrp: 0.677 ± 0.131
5.853LysTyr: 5.853 ± 0.426
0.0LysXaa: 0.0 ± 0.0
Leu
2.926LeuAla: 2.926 ± 0.228
1.485LeuCys: 1.485 ± 0.209
4.39LeuAsp: 4.39 ± 0.288
5.394LeuGlu: 5.394 ± 0.412
5.984LeuPhe: 5.984 ± 0.433
2.97LeuGly: 2.97 ± 0.3
1.463LeuHis: 1.463 ± 0.239
8.495LeuIle: 8.495 ± 0.513
9.435LeuLys: 9.435 ± 0.483
9.435LeuLeu: 9.435 ± 0.5
2.075LeuMet: 2.075 ± 0.219
7.797LeuAsn: 7.797 ± 0.477
2.708LeuPro: 2.708 ± 0.268
2.031LeuGln: 2.031 ± 0.203
2.577LeuArg: 2.577 ± 0.227
8.561LeuSer: 8.561 ± 0.378
5.394LeuThr: 5.394 ± 0.34
5.11LeuVal: 5.11 ± 0.344
0.48LeuTrp: 0.48 ± 0.095
4.215LeuTyr: 4.215 ± 0.326
0.0LeuXaa: 0.0 ± 0.0
Met
0.983MetAla: 0.983 ± 0.165
0.59MetCys: 0.59 ± 0.096
1.376MetAsp: 1.376 ± 0.186
1.289MetGlu: 1.289 ± 0.153
1.966MetPhe: 1.966 ± 0.218
0.874MetGly: 0.874 ± 0.136
0.459MetHis: 0.459 ± 0.094
2.315MetIle: 2.315 ± 0.21
1.594MetLys: 1.594 ± 0.199
2.424MetLeu: 2.424 ± 0.218
0.459MetMet: 0.459 ± 0.115
1.485MetAsn: 1.485 ± 0.167
0.743MetPro: 0.743 ± 0.123
0.437MetGln: 0.437 ± 0.095
0.459MetArg: 0.459 ± 0.111
1.966MetSer: 1.966 ± 0.241
1.289MetThr: 1.289 ± 0.15
1.048MetVal: 1.048 ± 0.165
0.153MetTrp: 0.153 ± 0.052
1.354MetTyr: 1.354 ± 0.19
0.0MetXaa: 0.0 ± 0.0
Asn
2.031AsnAla: 2.031 ± 0.257
1.703AsnCys: 1.703 ± 0.197
5.132AsnAsp: 5.132 ± 0.38
4.543AsnGlu: 4.543 ± 0.304
5.241AsnPhe: 5.241 ± 0.297
3.276AsnGly: 3.276 ± 0.315
1.092AsnHis: 1.092 ± 0.158
9.871AsnIle: 9.871 ± 0.575
9.216AsnLys: 9.216 ± 0.442
7.076AsnLeu: 7.076 ± 0.421
2.162AsnMet: 2.162 ± 0.206
9.522AsnAsn: 9.522 ± 0.6
2.49AsnPro: 2.49 ± 0.33
1.398AsnGln: 1.398 ± 0.163
2.075AsnArg: 2.075 ± 0.174
6.29AsnSer: 6.29 ± 0.34
4.171AsnThr: 4.171 ± 0.316
6.006AsnVal: 6.006 ± 0.376
0.612AsnTrp: 0.612 ± 0.133
4.215AsnTyr: 4.215 ± 0.261
0.0AsnXaa: 0.0 ± 0.0
Pro
0.633ProAla: 0.633 ± 0.117
0.59ProCys: 0.59 ± 0.116
1.223ProAsp: 1.223 ± 0.198
1.703ProGlu: 1.703 ± 0.21
1.922ProPhe: 1.922 ± 0.242
1.07ProGly: 1.07 ± 0.134
0.437ProHis: 0.437 ± 0.119
2.686ProIle: 2.686 ± 0.278
2.512ProLys: 2.512 ± 0.223
2.905ProLeu: 2.905 ± 0.3
0.568ProMet: 0.568 ± 0.105
2.446ProAsn: 2.446 ± 0.22
1.201ProPro: 1.201 ± 0.185
0.459ProGln: 0.459 ± 0.089
0.721ProArg: 0.721 ± 0.135
2.337ProSer: 2.337 ± 0.255
1.463ProThr: 1.463 ± 0.183
1.682ProVal: 1.682 ± 0.182
0.218ProTrp: 0.218 ± 0.061
1.42ProTyr: 1.42 ± 0.169
0.0ProXaa: 0.0 ± 0.0
Gln
0.524GlnAla: 0.524 ± 0.112
0.415GlnCys: 0.415 ± 0.089
0.961GlnAsp: 0.961 ± 0.129
1.245GlnGlu: 1.245 ± 0.13
0.961GlnPhe: 0.961 ± 0.127
0.524GlnGly: 0.524 ± 0.11
0.306GlnHis: 0.306 ± 0.088
1.922GlnIle: 1.922 ± 0.211
2.075GlnLys: 2.075 ± 0.265
2.14GlnLeu: 2.14 ± 0.202
0.502GlnMet: 0.502 ± 0.1
1.551GlnAsn: 1.551 ± 0.15
0.59GlnPro: 0.59 ± 0.119
0.852GlnGln: 0.852 ± 0.146
0.786GlnArg: 0.786 ± 0.125
1.398GlnSer: 1.398 ± 0.144
1.201GlnThr: 1.201 ± 0.18
1.092GlnVal: 1.092 ± 0.149
0.175GlnTrp: 0.175 ± 0.063
0.961GlnTyr: 0.961 ± 0.138
0.0GlnXaa: 0.0 ± 0.0
Arg
0.699ArgAla: 0.699 ± 0.121
0.677ArgCys: 0.677 ± 0.141
1.507ArgAsp: 1.507 ± 0.196
1.551ArgGlu: 1.551 ± 0.156
1.725ArgPhe: 1.725 ± 0.199
1.267ArgGly: 1.267 ± 0.172
0.764ArgHis: 0.764 ± 0.122
2.271ArgIle: 2.271 ± 0.222
2.359ArgLys: 2.359 ± 0.267
2.446ArgLeu: 2.446 ± 0.24
0.677ArgMet: 0.677 ± 0.127
1.638ArgAsn: 1.638 ± 0.17
0.743ArgPro: 0.743 ± 0.146
0.852ArgGln: 0.852 ± 0.116
0.917ArgArg: 0.917 ± 0.154
1.966ArgSer: 1.966 ± 0.196
1.332ArgThr: 1.332 ± 0.174
1.594ArgVal: 1.594 ± 0.223
0.197ArgTrp: 0.197 ± 0.075
1.507ArgTyr: 1.507 ± 0.174
0.0ArgXaa: 0.0 ± 0.0
Ser
2.009SerAla: 2.009 ± 0.234
1.856SerCys: 1.856 ± 0.203
4.149SerAsp: 4.149 ± 0.324
4.281SerGlu: 4.281 ± 0.35
4.805SerPhe: 4.805 ± 0.386
2.795SerGly: 2.795 ± 0.242
1.201SerHis: 1.201 ± 0.164
7.447SerIle: 7.447 ± 0.417
7.95SerLys: 7.95 ± 0.384
7.578SerLeu: 7.578 ± 0.323
2.053SerMet: 2.053 ± 0.186
6.071SerAsn: 6.071 ± 0.421
1.966SerPro: 1.966 ± 0.2
1.725SerGln: 1.725 ± 0.225
1.944SerArg: 1.944 ± 0.278
6.661SerSer: 6.661 ± 0.556
4.63SerThr: 4.63 ± 0.36
5.831SerVal: 5.831 ± 0.299
0.502SerTrp: 0.502 ± 0.135
3.931SerTyr: 3.931 ± 0.333
0.0SerXaa: 0.0 ± 0.0
Thr
1.551ThrAla: 1.551 ± 0.213
1.441ThrCys: 1.441 ± 0.191
2.577ThrAsp: 2.577 ± 0.244
2.555ThrGlu: 2.555 ± 0.258
3.036ThrPhe: 3.036 ± 0.241
1.987ThrGly: 1.987 ± 0.242
1.223ThrHis: 1.223 ± 0.178
5.547ThrIle: 5.547 ± 0.33
4.783ThrLys: 4.783 ± 0.286
5.089ThrLeu: 5.089 ± 0.309
1.005ThrMet: 1.005 ± 0.14
3.494ThrAsn: 3.494 ± 0.252
1.813ThrPro: 1.813 ± 0.241
0.874ThrGln: 0.874 ± 0.131
1.354ThrArg: 1.354 ± 0.174
4.717ThrSer: 4.717 ± 0.349
2.948ThrThr: 2.948 ± 0.269
3.516ThrVal: 3.516 ± 0.284
0.393ThrTrp: 0.393 ± 0.092
2.686ThrTyr: 2.686 ± 0.286
0.0ThrXaa: 0.0 ± 0.0
Val
1.966ValAla: 1.966 ± 0.201
1.398ValCys: 1.398 ± 0.191
3.341ValAsp: 3.341 ± 0.291
2.664ValGlu: 2.664 ± 0.29
4.215ValPhe: 4.215 ± 0.321
1.638ValGly: 1.638 ± 0.203
0.808ValHis: 0.808 ± 0.115
5.875ValIle: 5.875 ± 0.426
6.486ValLys: 6.486 ± 0.364
5.744ValLeu: 5.744 ± 0.315
1.07ValMet: 1.07 ± 0.154
6.29ValAsn: 6.29 ± 0.44
2.031ValPro: 2.031 ± 0.19
0.983ValGln: 0.983 ± 0.161
1.66ValArg: 1.66 ± 0.197
5.809ValSer: 5.809 ± 0.439
3.32ValThr: 3.32 ± 0.309
3.538ValVal: 3.538 ± 0.361
0.175ValTrp: 0.175 ± 0.058
3.8ValTyr: 3.8 ± 0.326
0.0ValXaa: 0.0 ± 0.0
Trp
0.197TrpAla: 0.197 ± 0.08
0.175TrpCys: 0.175 ± 0.064
0.197TrpAsp: 0.197 ± 0.059
0.284TrpGlu: 0.284 ± 0.075
0.502TrpPhe: 0.502 ± 0.096
0.218TrpGly: 0.218 ± 0.064
0.066TrpHis: 0.066 ± 0.037
0.786TrpIle: 0.786 ± 0.147
1.048TrpLys: 1.048 ± 0.19
0.502TrpLeu: 0.502 ± 0.11
0.415TrpMet: 0.415 ± 0.106
0.262TrpAsn: 0.262 ± 0.09
0.218TrpPro: 0.218 ± 0.063
0.044TrpGln: 0.044 ± 0.029
0.153TrpArg: 0.153 ± 0.057
0.59TrpSer: 0.59 ± 0.103
0.24TrpThr: 0.24 ± 0.06
0.371TrpVal: 0.371 ± 0.088
0.044TrpTrp: 0.044 ± 0.027
0.284TrpTyr: 0.284 ± 0.071
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.507TyrAla: 1.507 ± 0.196
1.31TyrCys: 1.31 ± 0.185
3.276TyrAsp: 3.276 ± 0.354
2.49TyrGlu: 2.49 ± 0.225
3.123TyrPhe: 3.123 ± 0.235
2.643TyrGly: 2.643 ± 0.265
0.808TyrHis: 0.808 ± 0.117
5.394TyrIle: 5.394 ± 0.322
4.018TyrLys: 4.018 ± 0.269
4.936TyrLeu: 4.936 ± 0.291
1.245TyrMet: 1.245 ± 0.164
4.193TyrAsn: 4.193 ± 0.319
1.725TyrPro: 1.725 ± 0.189
0.983TyrGln: 0.983 ± 0.16
1.267TyrArg: 1.267 ± 0.155
4.062TyrSer: 4.062 ± 0.285
2.359TyrThr: 2.359 ± 0.197
3.975TyrVal: 3.975 ± 0.255
0.349TyrTrp: 0.349 ± 0.103
2.49TyrTyr: 2.49 ± 0.253
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 150 proteins (45790 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski