Amino acid dipepetide frequency for Yichang virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.462AlaAla: 3.462 ± 1.143
0.907AlaCys: 0.907 ± 0.234
4.616AlaAsp: 4.616 ± 0.733
2.143AlaGlu: 2.143 ± 0.471
1.896AlaPhe: 1.896 ± 0.246
1.978AlaGly: 1.978 ± 0.431
1.896AlaHis: 1.896 ± 0.352
5.193AlaIle: 5.193 ± 0.401
2.72AlaLys: 2.72 ± 0.329
5.358AlaLeu: 5.358 ± 0.74
0.742AlaMet: 0.742 ± 0.451
2.803AlaAsn: 2.803 ± 0.366
1.649AlaPro: 1.649 ± 0.226
2.968AlaGln: 2.968 ± 0.354
1.072AlaArg: 1.072 ± 0.175
2.391AlaSer: 2.391 ± 0.583
3.627AlaThr: 3.627 ± 0.426
3.215AlaVal: 3.215 ± 0.499
0.165AlaTrp: 0.165 ± 0.075
3.38AlaTyr: 3.38 ± 0.483
0.0AlaXaa: 0.0 ± 0.0
Cys
1.154CysAla: 1.154 ± 0.514
0.165CysCys: 0.165 ± 0.079
0.989CysAsp: 0.989 ± 0.161
1.154CysGlu: 1.154 ± 0.252
0.989CysPhe: 0.989 ± 0.331
1.319CysGly: 1.319 ± 0.286
0.742CysHis: 0.742 ± 0.143
1.072CysIle: 1.072 ± 0.322
0.824CysLys: 0.824 ± 0.177
1.814CysLeu: 1.814 ± 0.319
0.659CysMet: 0.659 ± 0.186
1.566CysAsn: 1.566 ± 0.234
1.319CysPro: 1.319 ± 0.22
0.659CysGln: 0.659 ± 0.299
0.989CysArg: 0.989 ± 0.237
1.814CysSer: 1.814 ± 0.289
2.226CysThr: 2.226 ± 0.314
0.989CysVal: 0.989 ± 0.331
0.0CysTrp: 0.0 ± 0.0
1.731CysTyr: 1.731 ± 0.191
0.0CysXaa: 0.0 ± 0.0
Asp
2.226AspAla: 2.226 ± 0.346
1.237AspCys: 1.237 ± 0.111
2.061AspAsp: 2.061 ± 0.253
3.462AspGlu: 3.462 ± 0.373
4.122AspPhe: 4.122 ± 0.51
2.72AspGly: 2.72 ± 0.258
0.659AspHis: 0.659 ± 0.201
4.616AspIle: 4.616 ± 0.655
1.237AspLys: 1.237 ± 0.111
4.864AspLeu: 4.864 ± 0.572
1.978AspMet: 1.978 ± 0.229
3.38AspAsn: 3.38 ± 0.473
4.699AspPro: 4.699 ± 0.745
2.885AspGln: 2.885 ± 0.257
1.896AspArg: 1.896 ± 0.342
3.71AspSer: 3.71 ± 0.128
3.545AspThr: 3.545 ± 0.539
2.968AspVal: 2.968 ± 0.279
0.659AspTrp: 0.659 ± 0.186
3.792AspTyr: 3.792 ± 0.478
0.0AspXaa: 0.0 ± 0.0
Glu
1.566GluAla: 1.566 ± 0.164
0.907GluCys: 0.907 ± 0.212
1.649GluAsp: 1.649 ± 0.106
2.226GluGlu: 2.226 ± 0.434
2.72GluPhe: 2.72 ± 0.492
1.401GluGly: 1.401 ± 0.178
1.978GluHis: 1.978 ± 0.415
3.05GluIle: 3.05 ± 0.515
2.143GluLys: 2.143 ± 0.219
5.358GluLeu: 5.358 ± 0.32
0.742GluMet: 0.742 ± 0.148
2.72GluAsn: 2.72 ± 0.348
1.401GluPro: 1.401 ± 0.278
2.061GluGln: 2.061 ± 0.39
2.803GluArg: 2.803 ± 0.424
3.462GluSer: 3.462 ± 0.479
3.132GluThr: 3.132 ± 0.555
2.72GluVal: 2.72 ± 0.385
0.495GluTrp: 0.495 ± 0.119
2.885GluTyr: 2.885 ± 0.848
0.0GluXaa: 0.0 ± 0.0
Phe
2.061PheAla: 2.061 ± 0.131
1.484PheCys: 1.484 ± 0.232
3.792PheAsp: 3.792 ± 0.393
2.226PheGlu: 2.226 ± 0.342
1.319PhePhe: 1.319 ± 0.295
2.968PheGly: 2.968 ± 0.279
0.495PheHis: 0.495 ± 0.238
2.308PheIle: 2.308 ± 0.35
2.143PheLys: 2.143 ± 0.58
3.792PheLeu: 3.792 ± 0.419
1.566PheMet: 1.566 ± 0.204
2.885PheAsn: 2.885 ± 0.293
1.814PhePro: 1.814 ± 0.086
2.226PheGln: 2.226 ± 0.218
2.061PheArg: 2.061 ± 0.176
2.638PheSer: 2.638 ± 0.423
2.803PheThr: 2.803 ± 0.391
3.38PheVal: 3.38 ± 0.395
0.165PheTrp: 0.165 ± 0.079
2.226PheTyr: 2.226 ± 0.29
0.0PheXaa: 0.0 ± 0.0
Gly
1.072GlyAla: 1.072 ± 0.175
0.33GlyCys: 0.33 ± 0.068
2.885GlyAsp: 2.885 ± 0.526
1.154GlyGlu: 1.154 ± 0.318
2.226GlyPhe: 2.226 ± 0.137
1.484GlyGly: 1.484 ± 0.259
1.731GlyHis: 1.731 ± 0.225
2.555GlyIle: 2.555 ± 0.277
2.061GlyLys: 2.061 ± 0.345
2.72GlyLeu: 2.72 ± 0.46
0.824GlyMet: 0.824 ± 0.188
2.143GlyAsn: 2.143 ± 0.58
1.401GlyPro: 1.401 ± 0.386
0.577GlyGln: 0.577 ± 0.317
1.484GlyArg: 1.484 ± 0.301
2.803GlySer: 2.803 ± 0.238
2.72GlyThr: 2.72 ± 0.301
1.484GlyVal: 1.484 ± 0.286
0.495GlyTrp: 0.495 ± 0.251
2.555GlyTyr: 2.555 ± 0.565
0.0GlyXaa: 0.0 ± 0.0
His
1.649HisAla: 1.649 ± 0.743
0.989HisCys: 0.989 ± 0.161
1.731HisAsp: 1.731 ± 0.144
1.319HisGlu: 1.319 ± 0.356
2.308HisPhe: 2.308 ± 0.275
0.989HisGly: 0.989 ± 0.237
1.401HisHis: 1.401 ± 0.197
2.391HisIle: 2.391 ± 0.177
2.226HisLys: 2.226 ± 0.338
2.555HisLeu: 2.555 ± 0.299
1.237HisMet: 1.237 ± 0.247
1.731HisAsn: 1.731 ± 0.537
1.896HisPro: 1.896 ± 0.292
1.072HisGln: 1.072 ± 0.327
1.484HisArg: 1.484 ± 0.162
1.649HisSer: 1.649 ± 0.433
2.226HisThr: 2.226 ± 0.424
2.638HisVal: 2.638 ± 0.304
0.495HisTrp: 0.495 ± 0.238
2.143HisTyr: 2.143 ± 0.461
0.0HisXaa: 0.0 ± 0.0
Ile
4.122IleAla: 4.122 ± 0.506
1.319IleCys: 1.319 ± 0.147
3.215IleAsp: 3.215 ± 0.194
3.132IleGlu: 3.132 ± 0.188
2.391IlePhe: 2.391 ± 0.766
1.731IleGly: 1.731 ± 0.231
1.566IleHis: 1.566 ± 0.517
3.627IleIle: 3.627 ± 0.625
3.297IleLys: 3.297 ± 0.41
6.018IleLeu: 6.018 ± 0.544
1.484IleMet: 1.484 ± 0.297
4.534IleAsn: 4.534 ± 0.458
3.874IlePro: 3.874 ± 0.902
3.297IleGln: 3.297 ± 0.211
2.473IleArg: 2.473 ± 0.233
3.38IleSer: 3.38 ± 0.473
6.512IleThr: 6.512 ± 0.876
3.957IleVal: 3.957 ± 0.234
0.577IleTrp: 0.577 ± 0.101
4.122IleTyr: 4.122 ± 0.483
0.0IleXaa: 0.0 ± 0.0
Lys
2.555LysAla: 2.555 ± 0.283
0.659LysCys: 0.659 ± 0.201
2.803LysAsp: 2.803 ± 0.272
2.143LysGlu: 2.143 ± 0.477
1.484LysPhe: 1.484 ± 0.342
1.484LysGly: 1.484 ± 0.17
2.803LysHis: 2.803 ± 0.414
3.05LysIle: 3.05 ± 0.598
1.319LysLys: 1.319 ± 0.136
6.347LysLeu: 6.347 ± 0.619
0.659LysMet: 0.659 ± 0.184
3.462LysAsn: 3.462 ± 0.566
4.039LysPro: 4.039 ± 0.514
2.226LysGln: 2.226 ± 0.568
2.143LysArg: 2.143 ± 0.236
2.061LysSer: 2.061 ± 0.705
2.885LysThr: 2.885 ± 0.553
2.885LysVal: 2.885 ± 0.282
0.33LysTrp: 0.33 ± 0.068
4.287LysTyr: 4.287 ± 0.738
0.0LysXaa: 0.0 ± 0.0
Leu
5.193LeuAla: 5.193 ± 0.323
2.803LeuCys: 2.803 ± 0.395
5.276LeuAsp: 5.276 ± 0.215
3.957LeuGlu: 3.957 ± 0.228
3.71LeuPhe: 3.71 ± 0.201
3.132LeuGly: 3.132 ± 0.276
3.462LeuHis: 3.462 ± 0.643
5.193LeuIle: 5.193 ± 0.747
4.369LeuLys: 4.369 ± 0.583
8.573LeuLeu: 8.573 ± 0.687
1.154LeuMet: 1.154 ± 0.299
5.028LeuAsn: 5.028 ± 0.263
3.462LeuPro: 3.462 ± 0.43
5.605LeuGln: 5.605 ± 0.879
5.441LeuArg: 5.441 ± 0.365
6.512LeuSer: 6.512 ± 0.715
8.985LeuThr: 8.985 ± 0.335
6.018LeuVal: 6.018 ± 0.391
1.154LeuTrp: 1.154 ± 0.242
5.853LeuTyr: 5.853 ± 0.871
0.0LeuXaa: 0.0 ± 0.0
Met
1.566MetAla: 1.566 ± 0.172
0.33MetCys: 0.33 ± 0.149
0.495MetAsp: 0.495 ± 0.119
0.659MetGlu: 0.659 ± 0.317
1.566MetPhe: 1.566 ± 0.303
0.412MetGly: 0.412 ± 0.13
0.659MetHis: 0.659 ± 0.305
0.659MetIle: 0.659 ± 0.186
1.154MetLys: 1.154 ± 0.326
2.473MetLeu: 2.473 ± 0.562
0.0MetMet: 0.0 ± 0.0
0.742MetAsn: 0.742 ± 0.143
1.484MetPro: 1.484 ± 0.157
1.154MetGln: 1.154 ± 0.318
0.742MetArg: 0.742 ± 0.176
1.154MetSer: 1.154 ± 0.187
1.319MetThr: 1.319 ± 0.355
0.907MetVal: 0.907 ± 0.247
0.0MetTrp: 0.0 ± 0.0
1.814MetTyr: 1.814 ± 0.243
0.0MetXaa: 0.0 ± 0.0
Asn
2.885AsnAla: 2.885 ± 0.257
1.566AsnCys: 1.566 ± 0.261
2.143AsnAsp: 2.143 ± 0.627
1.978AsnGlu: 1.978 ± 0.358
2.638AsnPhe: 2.638 ± 0.343
2.226AsnGly: 2.226 ± 0.385
2.061AsnHis: 2.061 ± 0.475
4.369AsnIle: 4.369 ± 0.744
3.792AsnLys: 3.792 ± 0.463
6.183AsnLeu: 6.183 ± 0.46
0.907AsnMet: 0.907 ± 0.211
4.534AsnAsn: 4.534 ± 1.256
4.204AsnPro: 4.204 ± 0.658
3.38AsnGln: 3.38 ± 0.423
2.143AsnArg: 2.143 ± 0.473
3.957AsnSer: 3.957 ± 0.382
5.193AsnThr: 5.193 ± 0.879
3.71AsnVal: 3.71 ± 0.112
0.412AsnTrp: 0.412 ± 0.182
2.803AsnTyr: 2.803 ± 0.616
0.0AsnXaa: 0.0 ± 0.0
Pro
2.968ProAla: 2.968 ± 0.459
0.577ProCys: 0.577 ± 0.223
1.978ProAsp: 1.978 ± 0.211
3.215ProGlu: 3.215 ± 0.544
1.566ProPhe: 1.566 ± 0.25
2.226ProGly: 2.226 ± 0.33
1.401ProHis: 1.401 ± 0.458
3.545ProIle: 3.545 ± 0.449
3.627ProLys: 3.627 ± 0.39
4.204ProLeu: 4.204 ± 0.613
0.824ProMet: 0.824 ± 0.146
3.215ProAsn: 3.215 ± 0.367
2.555ProPro: 2.555 ± 0.182
3.71ProGln: 3.71 ± 0.489
2.308ProArg: 2.308 ± 0.605
4.369ProSer: 4.369 ± 0.543
5.111ProThr: 5.111 ± 0.36
3.462ProVal: 3.462 ± 0.262
0.412ProTrp: 0.412 ± 0.098
2.061ProTyr: 2.061 ± 0.297
0.0ProXaa: 0.0 ± 0.0
Gln
2.555GlnAla: 2.555 ± 0.407
1.154GlnCys: 1.154 ± 0.163
2.061GlnAsp: 2.061 ± 0.415
1.566GlnGlu: 1.566 ± 0.309
2.143GlnPhe: 2.143 ± 0.31
1.401GlnGly: 1.401 ± 0.284
2.968GlnHis: 2.968 ± 0.463
2.061GlnIle: 2.061 ± 0.176
2.308GlnLys: 2.308 ± 0.233
5.523GlnLeu: 5.523 ± 0.426
0.495GlnMet: 0.495 ± 0.127
4.534GlnAsn: 4.534 ± 0.505
1.814GlnPro: 1.814 ± 0.575
2.803GlnGln: 2.803 ± 1.575
3.462GlnArg: 3.462 ± 0.774
2.226GlnSer: 2.226 ± 0.141
2.968GlnThr: 2.968 ± 0.811
2.226GlnVal: 2.226 ± 0.21
0.0GlnTrp: 0.0 ± 0.0
3.462GlnTyr: 3.462 ± 0.576
0.0GlnXaa: 0.0 ± 0.0
Arg
2.72ArgAla: 2.72 ± 0.547
0.907ArgCys: 0.907 ± 0.202
4.451ArgAsp: 4.451 ± 0.706
2.308ArgGlu: 2.308 ± 0.181
2.473ArgPhe: 2.473 ± 0.308
0.824ArgGly: 0.824 ± 0.178
1.072ArgHis: 1.072 ± 0.153
3.297ArgIle: 3.297 ± 0.784
1.978ArgLys: 1.978 ± 0.343
3.132ArgLeu: 3.132 ± 0.307
0.742ArgMet: 0.742 ± 0.281
2.555ArgAsn: 2.555 ± 0.678
2.143ArgPro: 2.143 ± 0.399
2.391ArgGln: 2.391 ± 0.277
2.061ArgArg: 2.061 ± 0.541
2.803ArgSer: 2.803 ± 0.272
3.957ArgThr: 3.957 ± 0.537
2.638ArgVal: 2.638 ± 0.708
0.495ArgTrp: 0.495 ± 0.259
2.473ArgTyr: 2.473 ± 0.256
0.0ArgXaa: 0.0 ± 0.0
Ser
3.05SerAla: 3.05 ± 0.313
2.143SerCys: 2.143 ± 0.316
4.204SerAsp: 4.204 ± 0.362
3.132SerGlu: 3.132 ± 0.526
2.226SerPhe: 2.226 ± 0.459
1.649SerGly: 1.649 ± 0.22
2.061SerHis: 2.061 ± 0.652
3.462SerIle: 3.462 ± 0.214
3.215SerLys: 3.215 ± 0.885
5.523SerLeu: 5.523 ± 0.615
1.319SerMet: 1.319 ± 0.431
2.803SerAsn: 2.803 ± 0.222
2.638SerPro: 2.638 ± 0.381
2.885SerGln: 2.885 ± 0.436
2.555SerArg: 2.555 ± 0.278
4.287SerSer: 4.287 ± 1.268
5.605SerThr: 5.605 ± 0.209
4.204SerVal: 4.204 ± 0.677
0.165SerTrp: 0.165 ± 0.079
3.957SerTyr: 3.957 ± 0.245
0.0SerXaa: 0.0 ± 0.0
Thr
4.369ThrAla: 4.369 ± 0.568
1.484ThrCys: 1.484 ± 0.245
2.308ThrAsp: 2.308 ± 0.413
4.864ThrGlu: 4.864 ± 0.292
3.462ThrPhe: 3.462 ± 0.454
2.555ThrGly: 2.555 ± 0.522
2.885ThrHis: 2.885 ± 0.249
5.853ThrIle: 5.853 ± 0.465
4.039ThrLys: 4.039 ± 0.263
8.326ThrLeu: 8.326 ± 1.291
2.061ThrMet: 2.061 ± 0.388
3.792ThrAsn: 3.792 ± 0.872
5.276ThrPro: 5.276 ± 0.473
3.957ThrGln: 3.957 ± 0.547
4.287ThrArg: 4.287 ± 0.464
5.028ThrSer: 5.028 ± 1.413
7.007ThrThr: 7.007 ± 0.811
5.276ThrVal: 5.276 ± 0.698
1.072ThrTrp: 1.072 ± 0.149
3.874ThrTyr: 3.874 ± 0.231
0.0ThrXaa: 0.0 ± 0.0
Val
2.226ValAla: 2.226 ± 0.612
1.484ValCys: 1.484 ± 0.142
4.864ValAsp: 4.864 ± 0.348
2.72ValGlu: 2.72 ± 0.497
1.896ValPhe: 1.896 ± 0.312
1.484ValGly: 1.484 ± 0.548
1.896ValHis: 1.896 ± 0.235
3.957ValIle: 3.957 ± 0.216
3.05ValLys: 3.05 ± 0.506
6.018ValLeu: 6.018 ± 0.638
1.072ValMet: 1.072 ± 0.149
5.028ValAsn: 5.028 ± 0.523
4.369ValPro: 4.369 ± 0.251
1.814ValGln: 1.814 ± 0.291
2.061ValArg: 2.061 ± 0.518
3.71ValSer: 3.71 ± 0.903
5.358ValThr: 5.358 ± 0.278
5.111ValVal: 5.111 ± 0.853
0.33ValTrp: 0.33 ± 0.068
3.38ValTyr: 3.38 ± 0.299
0.0ValXaa: 0.0 ± 0.0
Trp
0.165TrpAla: 0.165 ± 0.075
0.165TrpCys: 0.165 ± 0.075
0.824TrpAsp: 0.824 ± 0.231
0.0TrpGlu: 0.0 ± 0.0
0.247TrpPhe: 0.247 ± 0.163
0.33TrpGly: 0.33 ± 0.068
0.33TrpHis: 0.33 ± 0.158
0.495TrpIle: 0.495 ± 0.119
0.165TrpLys: 0.165 ± 0.079
1.237TrpLeu: 1.237 ± 0.222
0.165TrpMet: 0.165 ± 0.075
0.0TrpAsn: 0.0 ± 0.0
0.577TrpPro: 0.577 ± 0.094
0.0TrpGln: 0.0 ± 0.0
0.247TrpArg: 0.247 ± 0.146
0.577TrpSer: 0.577 ± 0.094
0.577TrpThr: 0.577 ± 0.226
0.33TrpVal: 0.33 ± 0.158
0.165TrpTrp: 0.165 ± 0.075
1.154TrpTyr: 1.154 ± 0.137
0.0TrpXaa: 0.0 ± 0.0
Tyr
4.616TyrAla: 4.616 ± 0.336
1.566TyrCys: 1.566 ± 0.2
4.369TyrAsp: 4.369 ± 0.481
2.061TyrGlu: 2.061 ± 0.297
2.968TyrPhe: 2.968 ± 0.148
2.391TyrGly: 2.391 ± 0.442
2.061TyrHis: 2.061 ± 0.253
3.874TyrIle: 3.874 ± 0.874
3.957TyrLys: 3.957 ± 0.517
4.864TyrLeu: 4.864 ± 0.807
0.577TyrMet: 0.577 ± 0.101
3.627TyrAsn: 3.627 ± 0.184
2.72TyrPro: 2.72 ± 0.171
2.061TyrGln: 2.061 ± 0.297
3.71TyrArg: 3.71 ± 0.507
2.638TyrSer: 2.638 ± 0.415
6.018TyrThr: 6.018 ± 0.493
3.71TyrVal: 3.71 ± 0.691
0.165TyrTrp: 0.165 ± 0.282
3.957TyrTyr: 3.957 ± 0.995
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (12132 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski