Amino acid dipepetide frequency for Wuhan Insect virus 5

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.721AlaAla: 4.721 ± 2.415
0.262AlaCys: 0.262 ± 0.148
4.458AlaAsp: 4.458 ± 0.699
4.196AlaGlu: 4.196 ± 0.844
1.311AlaPhe: 1.311 ± 0.583
2.885AlaGly: 2.885 ± 1.052
0.787AlaHis: 0.787 ± 0.673
4.458AlaIle: 4.458 ± 1.569
3.147AlaLys: 3.147 ± 1.701
4.196AlaLeu: 4.196 ± 1.193
1.049AlaMet: 1.049 ± 0.323
2.098AlaAsn: 2.098 ± 1.28
1.836AlaPro: 1.836 ± 1.041
1.836AlaGln: 1.836 ± 1.309
2.623AlaArg: 2.623 ± 1.315
6.557AlaSer: 6.557 ± 1.893
3.672AlaThr: 3.672 ± 1.647
5.245AlaVal: 5.245 ± 1.66
0.262AlaTrp: 0.262 ± 0.148
2.623AlaTyr: 2.623 ± 0.52
0.0AlaXaa: 0.0 ± 0.0
Cys
1.049CysAla: 1.049 ± 0.651
0.787CysCys: 0.787 ± 0.445
1.311CysAsp: 1.311 ± 0.57
0.787CysGlu: 0.787 ± 0.312
0.787CysPhe: 0.787 ± 0.312
1.836CysGly: 1.836 ± 0.74
0.262CysHis: 0.262 ± 0.148
0.787CysIle: 0.787 ± 0.445
0.525CysLys: 0.525 ± 0.297
1.574CysLeu: 1.574 ± 0.679
0.525CysMet: 0.525 ± 0.297
1.049CysAsn: 1.049 ± 0.414
1.836CysPro: 1.836 ± 0.83
0.262CysGln: 0.262 ± 0.524
1.311CysArg: 1.311 ± 0.777
2.623CysSer: 2.623 ± 0.806
0.525CysThr: 0.525 ± 0.259
1.574CysVal: 1.574 ± 0.857
0.262CysTrp: 0.262 ± 0.607
1.049CysTyr: 1.049 ± 0.414
0.0CysXaa: 0.0 ± 0.0
Asp
2.623AspAla: 2.623 ± 0.735
1.574AspCys: 1.574 ± 0.857
3.409AspAsp: 3.409 ± 0.832
2.36AspGlu: 2.36 ± 0.759
2.885AspPhe: 2.885 ± 0.844
4.983AspGly: 4.983 ± 0.885
1.049AspHis: 1.049 ± 0.519
4.983AspIle: 4.983 ± 0.664
3.672AspLys: 3.672 ± 0.547
6.557AspLeu: 6.557 ± 0.906
2.098AspMet: 2.098 ± 0.572
3.147AspAsn: 3.147 ± 1.049
2.885AspPro: 2.885 ± 0.963
1.836AspGln: 1.836 ± 0.743
2.098AspArg: 2.098 ± 0.684
4.458AspSer: 4.458 ± 0.909
2.098AspThr: 2.098 ± 0.646
4.196AspVal: 4.196 ± 0.945
1.049AspTrp: 1.049 ± 0.571
1.574AspTyr: 1.574 ± 0.544
0.0AspXaa: 0.0 ± 0.0
Glu
4.458GluAla: 4.458 ± 0.971
1.049GluCys: 1.049 ± 0.803
2.885GluAsp: 2.885 ± 0.812
3.409GluGlu: 3.409 ± 1.322
2.623GluPhe: 2.623 ± 0.664
2.623GluGly: 2.623 ± 0.927
0.525GluHis: 0.525 ± 0.299
4.721GluIle: 4.721 ± 1.064
4.196GluLys: 4.196 ± 0.671
4.458GluLeu: 4.458 ± 1.124
1.574GluMet: 1.574 ± 0.679
1.574GluAsn: 1.574 ± 0.63
2.098GluPro: 2.098 ± 0.717
1.574GluGln: 1.574 ± 0.859
2.36GluArg: 2.36 ± 0.618
5.77GluSer: 5.77 ± 0.959
2.885GluThr: 2.885 ± 0.707
4.721GluVal: 4.721 ± 1.182
1.836GluTrp: 1.836 ± 0.602
1.574GluTyr: 1.574 ± 0.679
0.0GluXaa: 0.0 ± 0.0
Phe
1.836PheAla: 1.836 ± 0.544
1.049PheCys: 1.049 ± 0.468
1.049PheAsp: 1.049 ± 0.514
1.311PheGlu: 1.311 ± 0.633
1.311PhePhe: 1.311 ± 0.57
1.836PheGly: 1.836 ± 0.653
1.049PheHis: 1.049 ± 0.556
2.36PheIle: 2.36 ± 0.723
2.885PheLys: 2.885 ± 0.405
3.409PheLeu: 3.409 ± 0.673
1.574PheMet: 1.574 ± 0.469
1.574PheAsn: 1.574 ± 0.551
2.36PhePro: 2.36 ± 1.066
0.787PheGln: 0.787 ± 0.544
2.098PheArg: 2.098 ± 0.663
3.409PheSer: 3.409 ± 0.537
3.409PheThr: 3.409 ± 1.292
2.098PheVal: 2.098 ± 0.576
0.262PheTrp: 0.262 ± 0.352
0.787PheTyr: 0.787 ± 0.44
0.0PheXaa: 0.0 ± 0.0
Gly
3.934GlyAla: 3.934 ± 1.393
1.311GlyCys: 1.311 ± 0.777
2.885GlyAsp: 2.885 ± 1.071
2.885GlyGlu: 2.885 ± 1.197
2.36GlyPhe: 2.36 ± 0.652
3.934GlyGly: 3.934 ± 2.614
2.36GlyHis: 2.36 ± 0.821
2.098GlyIle: 2.098 ± 1.002
4.196GlyLys: 4.196 ± 1.069
5.507GlyLeu: 5.507 ± 1.434
2.623GlyMet: 2.623 ± 0.943
2.36GlyAsn: 2.36 ± 0.779
1.574GlyPro: 1.574 ± 0.986
1.311GlyGln: 1.311 ± 0.6
3.672GlyArg: 3.672 ± 0.439
5.77GlySer: 5.77 ± 0.8
2.098GlyThr: 2.098 ± 0.467
4.196GlyVal: 4.196 ± 1.365
0.525GlyTrp: 0.525 ± 0.299
2.623GlyTyr: 2.623 ± 0.571
0.0GlyXaa: 0.0 ± 0.0
His
0.787HisAla: 0.787 ± 0.315
0.0HisCys: 0.0 ± 0.0
1.836HisAsp: 1.836 ± 0.529
0.262HisGlu: 0.262 ± 0.352
1.049HisPhe: 1.049 ± 0.414
1.574HisGly: 1.574 ± 0.551
1.049HisHis: 1.049 ± 0.414
1.049HisIle: 1.049 ± 0.414
1.574HisLys: 1.574 ± 0.672
2.36HisLeu: 2.36 ± 0.835
1.311HisMet: 1.311 ± 0.554
1.049HisAsn: 1.049 ± 0.442
1.049HisPro: 1.049 ± 0.594
0.525HisGln: 0.525 ± 0.557
0.787HisArg: 0.787 ± 0.315
1.836HisSer: 1.836 ± 0.602
0.0HisThr: 0.0 ± 0.0
1.574HisVal: 1.574 ± 0.727
0.0HisTrp: 0.0 ± 0.0
0.787HisTyr: 0.787 ± 1.155
0.0HisXaa: 0.0 ± 0.0
Ile
2.885IleAla: 2.885 ± 1.825
0.525IleCys: 0.525 ± 0.259
2.623IleAsp: 2.623 ± 1.064
3.934IleGlu: 3.934 ± 1.025
2.098IlePhe: 2.098 ± 0.665
3.934IleGly: 3.934 ± 1.015
2.098IleHis: 2.098 ± 0.873
4.458IleIle: 4.458 ± 1.41
3.672IleLys: 3.672 ± 1.502
5.245IleLeu: 5.245 ± 1.397
2.623IleMet: 2.623 ± 0.861
2.885IleAsn: 2.885 ± 0.853
2.623IlePro: 2.623 ± 1.037
2.885IleGln: 2.885 ± 0.685
3.934IleArg: 3.934 ± 1.283
6.557IleSer: 6.557 ± 1.161
2.623IleThr: 2.623 ± 0.79
3.409IleVal: 3.409 ± 0.537
0.525IleTrp: 0.525 ± 0.259
3.672IleTyr: 3.672 ± 0.68
0.0IleXaa: 0.0 ± 0.0
Lys
3.934LysAla: 3.934 ± 1.479
1.049LysCys: 1.049 ± 0.468
2.623LysAsp: 2.623 ± 1.615
4.196LysGlu: 4.196 ± 1.541
1.311LysPhe: 1.311 ± 1.171
1.836LysGly: 1.836 ± 0.538
1.574LysHis: 1.574 ± 0.429
6.557LysIle: 6.557 ± 0.989
3.409LysLys: 3.409 ± 1.094
6.032LysLeu: 6.032 ± 1.198
3.672LysMet: 3.672 ± 0.844
3.934LysAsn: 3.934 ± 1.22
3.672LysPro: 3.672 ± 1.382
2.098LysGln: 2.098 ± 0.881
1.574LysArg: 1.574 ± 0.672
4.983LysSer: 4.983 ± 1.587
2.885LysThr: 2.885 ± 0.476
4.721LysVal: 4.721 ± 1.406
0.525LysTrp: 0.525 ± 0.259
2.098LysTyr: 2.098 ± 0.593
0.0LysXaa: 0.0 ± 0.0
Leu
6.032LeuAla: 6.032 ± 1.564
3.409LeuCys: 3.409 ± 1.043
5.507LeuAsp: 5.507 ± 1.006
5.245LeuGlu: 5.245 ± 0.692
4.196LeuPhe: 4.196 ± 0.733
4.721LeuGly: 4.721 ± 0.71
1.836LeuHis: 1.836 ± 0.571
4.721LeuIle: 4.721 ± 1.209
5.507LeuLys: 5.507 ± 0.946
9.704LeuLeu: 9.704 ± 2.345
3.672LeuMet: 3.672 ± 0.82
3.147LeuAsn: 3.147 ± 0.648
3.672LeuPro: 3.672 ± 1.318
1.574LeuGln: 1.574 ± 0.999
4.458LeuArg: 4.458 ± 1.721
9.966LeuSer: 9.966 ± 1.807
5.77LeuThr: 5.77 ± 1.4
4.196LeuVal: 4.196 ± 0.835
1.574LeuTrp: 1.574 ± 0.891
3.672LeuTyr: 3.672 ± 1.301
0.0LeuXaa: 0.0 ± 0.0
Met
2.36MetAla: 2.36 ± 0.748
0.525MetCys: 0.525 ± 0.297
2.098MetAsp: 2.098 ± 0.872
1.574MetGlu: 1.574 ± 0.818
2.36MetPhe: 2.36 ± 1.336
1.836MetGly: 1.836 ± 0.686
1.311MetHis: 1.311 ± 0.538
2.885MetIle: 2.885 ± 0.893
0.787MetLys: 0.787 ± 0.544
2.36MetLeu: 2.36 ± 1.164
0.787MetMet: 0.787 ± 0.746
1.574MetAsn: 1.574 ± 0.672
2.36MetPro: 2.36 ± 0.91
0.787MetGln: 0.787 ± 0.315
4.196MetArg: 4.196 ± 1.233
3.147MetSer: 3.147 ± 0.857
2.36MetThr: 2.36 ± 1.066
2.36MetVal: 2.36 ± 0.975
0.525MetTrp: 0.525 ± 0.449
0.262MetTyr: 0.262 ± 0.285
0.0MetXaa: 0.0 ± 0.0
Asn
2.098AsnAla: 2.098 ± 0.724
0.787AsnCys: 0.787 ± 0.312
3.672AsnAsp: 3.672 ± 1.323
2.623AsnGlu: 2.623 ± 0.652
1.836AsnPhe: 1.836 ± 0.812
2.36AsnGly: 2.36 ± 0.656
0.787AsnHis: 0.787 ± 0.548
1.836AsnIle: 1.836 ± 1.299
1.049AsnLys: 1.049 ± 0.803
4.458AsnLeu: 4.458 ± 0.717
1.311AsnMet: 1.311 ± 0.756
0.787AsnAsn: 0.787 ± 0.381
2.36AsnPro: 2.36 ± 0.779
1.836AsnGln: 1.836 ± 0.601
1.311AsnArg: 1.311 ± 0.537
2.885AsnSer: 2.885 ± 0.908
1.574AsnThr: 1.574 ± 0.544
2.36AsnVal: 2.36 ± 0.756
0.525AsnTrp: 0.525 ± 0.297
0.787AsnTyr: 0.787 ± 0.312
0.0AsnXaa: 0.0 ± 0.0
Pro
1.574ProAla: 1.574 ± 1.129
0.262ProCys: 0.262 ± 0.607
4.196ProAsp: 4.196 ± 0.953
1.836ProGlu: 1.836 ± 0.544
0.787ProPhe: 0.787 ± 0.494
1.836ProGly: 1.836 ± 1.109
0.787ProHis: 0.787 ± 0.312
2.36ProIle: 2.36 ± 0.791
3.147ProLys: 3.147 ± 0.557
4.196ProLeu: 4.196 ± 1.291
1.311ProMet: 1.311 ± 0.472
2.36ProAsn: 2.36 ± 0.779
1.574ProPro: 1.574 ± 0.727
2.098ProGln: 2.098 ± 0.528
3.147ProArg: 3.147 ± 1.027
4.721ProSer: 4.721 ± 2.074
3.672ProThr: 3.672 ± 1.061
3.147ProVal: 3.147 ± 1.619
0.0ProTrp: 0.0 ± 0.0
2.098ProTyr: 2.098 ± 0.828
0.0ProXaa: 0.0 ± 0.0
Gln
3.147GlnAla: 3.147 ± 1.775
0.525GlnCys: 0.525 ± 0.259
0.787GlnAsp: 0.787 ± 0.609
2.36GlnGlu: 2.36 ± 0.678
1.049GlnPhe: 1.049 ± 0.519
0.262GlnGly: 0.262 ± 0.352
0.0GlnHis: 0.0 ± 0.0
2.36GlnIle: 2.36 ± 1.049
2.098GlnLys: 2.098 ± 1.612
2.885GlnLeu: 2.885 ± 0.581
0.525GlnMet: 0.525 ± 0.557
0.787GlnAsn: 0.787 ± 0.312
2.098GlnPro: 2.098 ± 1.346
0.525GlnGln: 0.525 ± 0.704
2.098GlnArg: 2.098 ± 0.909
2.36GlnSer: 2.36 ± 0.656
2.098GlnThr: 2.098 ± 0.927
2.623GlnVal: 2.623 ± 1.034
0.525GlnTrp: 0.525 ± 0.299
0.787GlnTyr: 0.787 ± 0.664
0.0GlnXaa: 0.0 ± 0.0
Arg
2.885ArgAla: 2.885 ± 1.046
2.098ArgCys: 2.098 ± 0.513
3.147ArgAsp: 3.147 ± 0.592
3.409ArgGlu: 3.409 ± 1.068
1.574ArgPhe: 1.574 ± 0.634
2.885ArgGly: 2.885 ± 0.701
0.787ArgHis: 0.787 ± 0.445
2.098ArgIle: 2.098 ± 0.739
3.147ArgLys: 3.147 ± 0.777
5.77ArgLeu: 5.77 ± 1.973
2.36ArgMet: 2.36 ± 1.04
1.574ArgAsn: 1.574 ± 0.544
2.098ArgPro: 2.098 ± 0.591
2.098ArgGln: 2.098 ± 0.782
4.458ArgArg: 4.458 ± 1.845
6.294ArgSer: 6.294 ± 2.365
3.147ArgThr: 3.147 ± 1.473
1.574ArgVal: 1.574 ± 0.469
1.049ArgTrp: 1.049 ± 0.414
2.36ArgTyr: 2.36 ± 1.04
0.0ArgXaa: 0.0 ± 0.0
Ser
4.721SerAla: 4.721 ± 1.792
2.098SerCys: 2.098 ± 0.786
6.819SerAsp: 6.819 ± 0.594
4.983SerGlu: 4.983 ± 1.181
4.196SerPhe: 4.196 ± 1.464
6.819SerGly: 6.819 ± 1.308
0.525SerHis: 0.525 ± 0.297
4.458SerIle: 4.458 ± 1.646
7.868SerLys: 7.868 ± 1.96
7.868SerLeu: 7.868 ± 1.424
2.885SerMet: 2.885 ± 0.722
2.36SerAsn: 2.36 ± 1.06
4.196SerPro: 4.196 ± 0.324
3.934SerGln: 3.934 ± 1.231
6.032SerArg: 6.032 ± 1.643
7.868SerSer: 7.868 ± 1.692
5.507SerThr: 5.507 ± 1.487
6.294SerVal: 6.294 ± 0.895
1.836SerTrp: 1.836 ± 0.595
3.672SerTyr: 3.672 ± 0.518
0.0SerXaa: 0.0 ± 0.0
Thr
2.885ThrAla: 2.885 ± 1.03
0.525ThrCys: 0.525 ± 0.259
3.934ThrAsp: 3.934 ± 0.648
4.458ThrGlu: 4.458 ± 1.116
1.836ThrPhe: 1.836 ± 0.812
3.934ThrGly: 3.934 ± 0.996
0.787ThrHis: 0.787 ± 0.524
4.196ThrIle: 4.196 ± 1.125
3.409ThrLys: 3.409 ± 1.059
4.458ThrLeu: 4.458 ± 0.856
2.36ThrMet: 2.36 ± 0.764
1.836ThrAsn: 1.836 ± 0.688
1.311ThrPro: 1.311 ± 0.596
1.836ThrGln: 1.836 ± 1.287
2.36ThrArg: 2.36 ± 0.71
3.934ThrSer: 3.934 ± 1.113
2.098ThrThr: 2.098 ± 0.591
4.983ThrVal: 4.983 ± 1.098
1.049ThrTrp: 1.049 ± 0.391
2.623ThrTyr: 2.623 ± 0.805
0.0ThrXaa: 0.0 ± 0.0
Val
3.934ValAla: 3.934 ± 1.084
1.574ValCys: 1.574 ± 0.299
2.885ValAsp: 2.885 ± 1.123
3.672ValGlu: 3.672 ± 1.263
1.574ValPhe: 1.574 ± 0.593
4.721ValGly: 4.721 ± 1.67
2.098ValHis: 2.098 ± 1.676
2.885ValIle: 2.885 ± 1.297
4.458ValLys: 4.458 ± 2.843
6.819ValLeu: 6.819 ± 1.968
2.885ValMet: 2.885 ± 0.861
1.311ValAsn: 1.311 ± 0.411
2.623ValPro: 2.623 ± 0.576
1.311ValGln: 1.311 ± 0.6
4.458ValArg: 4.458 ± 1.646
7.343ValSer: 7.343 ± 3.627
5.245ValThr: 5.245 ± 0.984
4.458ValVal: 4.458 ± 0.851
0.787ValTrp: 0.787 ± 0.315
2.36ValTyr: 2.36 ± 0.524
0.0ValXaa: 0.0 ± 0.0
Trp
0.262TrpAla: 0.262 ± 0.285
0.262TrpCys: 0.262 ± 0.148
1.311TrpAsp: 1.311 ± 0.349
0.525TrpGlu: 0.525 ± 0.297
0.525TrpPhe: 0.525 ± 0.557
1.311TrpGly: 1.311 ± 0.633
0.0TrpHis: 0.0 ± 0.0
2.098TrpIle: 2.098 ± 0.633
0.525TrpLys: 0.525 ± 0.449
1.311TrpLeu: 1.311 ± 0.349
0.525TrpMet: 0.525 ± 0.293
0.525TrpAsn: 0.525 ± 0.297
0.262TrpPro: 0.262 ± 0.285
0.0TrpGln: 0.0 ± 0.0
0.787TrpArg: 0.787 ± 0.445
1.049TrpSer: 1.049 ± 0.571
0.787TrpThr: 0.787 ± 0.445
1.049TrpVal: 1.049 ± 0.598
0.262TrpTrp: 0.262 ± 0.148
0.262TrpTyr: 0.262 ± 0.352
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.098TyrAla: 2.098 ± 0.464
1.049TyrCys: 1.049 ± 0.442
2.098TyrAsp: 2.098 ± 0.591
2.885TyrGlu: 2.885 ± 0.907
0.787TyrPhe: 0.787 ± 0.977
2.098TyrGly: 2.098 ± 0.45
0.787TyrHis: 0.787 ± 0.44
1.836TyrIle: 1.836 ± 0.538
3.409TyrLys: 3.409 ± 2.031
3.409TyrLeu: 3.409 ± 1.618
0.525TyrMet: 0.525 ± 0.297
1.311TyrAsn: 1.311 ± 0.349
2.623TyrPro: 2.623 ± 0.736
0.787TyrGln: 0.787 ± 0.315
1.049TyrArg: 1.049 ± 0.571
3.409TyrSer: 3.409 ± 0.895
2.623TyrThr: 2.623 ± 0.634
2.623TyrVal: 2.623 ± 0.861
0.262TyrTrp: 0.262 ± 0.285
1.049TyrTyr: 1.049 ± 0.372
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (3814 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski