Amino acid dipepetide frequency for Wenzhou Crab Virus 3

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.03AlaAla: 3.03 ± 0.822
1.01AlaCys: 1.01 ± 0.347
2.525AlaAsp: 2.525 ± 0.862
2.777AlaGlu: 2.777 ± 0.782
1.01AlaPhe: 1.01 ± 0.618
2.272AlaGly: 2.272 ± 0.548
2.525AlaHis: 2.525 ± 0.822
4.292AlaIle: 4.292 ± 0.922
3.787AlaLys: 3.787 ± 0.447
5.049AlaLeu: 5.049 ± 0.853
1.262AlaMet: 1.262 ± 0.352
1.767AlaAsn: 1.767 ± 0.566
1.01AlaPro: 1.01 ± 0.412
2.272AlaGln: 2.272 ± 0.66
1.767AlaArg: 1.767 ± 0.434
4.544AlaSer: 4.544 ± 1.105
3.282AlaThr: 3.282 ± 0.684
3.03AlaVal: 3.03 ± 0.803
0.757AlaTrp: 0.757 ± 0.451
1.515AlaTyr: 1.515 ± 0.905
0.0AlaXaa: 0.0 ± 0.0
Cys
1.262CysAla: 1.262 ± 1.074
0.757CysCys: 0.757 ± 0.3
0.505CysAsp: 0.505 ± 0.301
0.252CysGlu: 0.252 ± 0.253
1.767CysPhe: 1.767 ± 0.471
0.757CysGly: 0.757 ± 0.237
1.262CysHis: 1.262 ± 0.479
1.515CysIle: 1.515 ± 0.6
0.505CysLys: 0.505 ± 0.301
3.03CysLeu: 3.03 ± 0.696
0.252CysMet: 0.252 ± 0.358
1.262CysAsn: 1.262 ± 0.256
1.01CysPro: 1.01 ± 0.412
0.505CysGln: 0.505 ± 0.301
0.757CysArg: 0.757 ± 0.237
2.02CysSer: 2.02 ± 0.826
0.757CysThr: 0.757 ± 0.451
0.757CysVal: 0.757 ± 0.654
0.252CysTrp: 0.252 ± 0.15
1.01CysTyr: 1.01 ± 0.413
0.0CysXaa: 0.0 ± 0.0
Asp
1.262AspAla: 1.262 ± 0.515
1.515AspCys: 1.515 ± 0.619
5.554AspAsp: 5.554 ± 0.616
4.292AspGlu: 4.292 ± 0.593
3.534AspPhe: 3.534 ± 0.615
2.272AspGly: 2.272 ± 0.427
2.777AspHis: 2.777 ± 1.046
5.807AspIle: 5.807 ± 1.076
2.777AspLys: 2.777 ± 1.445
7.069AspLeu: 7.069 ± 0.808
1.01AspMet: 1.01 ± 0.597
1.262AspAsn: 1.262 ± 0.591
2.02AspPro: 2.02 ± 0.092
3.282AspGln: 3.282 ± 0.814
1.767AspArg: 1.767 ± 0.566
6.816AspSer: 6.816 ± 1.393
3.03AspThr: 3.03 ± 0.591
3.787AspVal: 3.787 ± 0.839
0.505AspTrp: 0.505 ± 0.193
2.02AspTyr: 2.02 ± 0.655
0.0AspXaa: 0.0 ± 0.0
Glu
1.515GluAla: 1.515 ± 0.377
1.01GluCys: 1.01 ± 0.602
3.282GluAsp: 3.282 ± 0.557
3.787GluGlu: 3.787 ± 0.973
4.039GluPhe: 4.039 ± 0.836
5.049GluGly: 5.049 ± 0.682
1.262GluHis: 1.262 ± 0.256
4.292GluIle: 4.292 ± 0.571
2.272GluLys: 2.272 ± 1.058
5.302GluLeu: 5.302 ± 0.748
1.262GluMet: 1.262 ± 0.346
2.02GluAsn: 2.02 ± 0.719
1.515GluPro: 1.515 ± 0.297
1.515GluGln: 1.515 ± 0.348
1.01GluArg: 1.01 ± 0.327
1.767GluSer: 1.767 ± 0.574
2.02GluThr: 2.02 ± 0.907
3.282GluVal: 3.282 ± 1.271
1.515GluTrp: 1.515 ± 0.348
1.01GluTyr: 1.01 ± 0.459
0.0GluXaa: 0.0 ± 0.0
Phe
2.02PheAla: 2.02 ± 0.607
0.252PheCys: 0.252 ± 0.253
3.534PheAsp: 3.534 ± 0.476
1.262PheGlu: 1.262 ± 0.346
2.525PhePhe: 2.525 ± 0.807
1.767PheGly: 1.767 ± 0.17
1.515PheHis: 1.515 ± 0.377
6.564PheIle: 6.564 ± 1.229
2.272PheLys: 2.272 ± 0.427
5.807PheLeu: 5.807 ± 1.339
0.757PheMet: 0.757 ± 0.3
2.272PheAsn: 2.272 ± 0.542
2.272PhePro: 2.272 ± 0.739
2.02PheGln: 2.02 ± 0.82
2.02PheArg: 2.02 ± 1.263
7.321PheSer: 7.321 ± 0.719
2.525PheThr: 2.525 ± 0.306
2.525PheVal: 2.525 ± 0.45
0.252PheTrp: 0.252 ± 0.15
0.757PheTyr: 0.757 ± 0.451
0.0PheXaa: 0.0 ± 0.0
Gly
2.777GlyAla: 2.777 ± 0.686
1.767GlyCys: 1.767 ± 0.489
3.03GlyAsp: 3.03 ± 1.461
1.515GlyGlu: 1.515 ± 0.393
2.272GlyPhe: 2.272 ± 0.736
2.525GlyGly: 2.525 ± 0.781
1.01GlyHis: 1.01 ± 0.665
4.039GlyIle: 4.039 ± 0.991
2.272GlyLys: 2.272 ± 0.446
3.282GlyLeu: 3.282 ± 0.808
1.767GlyMet: 1.767 ± 0.53
1.262GlyAsn: 1.262 ± 0.656
0.505GlyPro: 0.505 ± 0.301
2.777GlyGln: 2.777 ± 0.363
1.767GlyArg: 1.767 ± 0.798
4.292GlySer: 4.292 ± 0.773
2.777GlyThr: 2.777 ± 0.712
3.787GlyVal: 3.787 ± 0.654
0.505GlyTrp: 0.505 ± 0.301
2.525GlyTyr: 2.525 ± 0.59
0.0GlyXaa: 0.0 ± 0.0
His
1.262HisAla: 1.262 ± 1.074
0.505HisCys: 0.505 ± 0.413
2.272HisAsp: 2.272 ± 0.808
1.262HisGlu: 1.262 ± 0.256
1.515HisPhe: 1.515 ± 0.667
1.262HisGly: 1.262 ± 0.515
1.767HisHis: 1.767 ± 0.541
2.777HisIle: 2.777 ± 0.758
1.262HisLys: 1.262 ± 0.238
2.777HisLeu: 2.777 ± 0.311
0.252HisMet: 0.252 ± 0.358
2.02HisAsn: 2.02 ± 0.092
1.515HisPro: 1.515 ± 0.474
2.272HisGln: 2.272 ± 0.808
1.262HisArg: 1.262 ± 0.525
2.02HisSer: 2.02 ± 0.092
1.01HisThr: 1.01 ± 0.235
0.757HisVal: 0.757 ± 0.237
0.0HisTrp: 0.0 ± 0.0
1.767HisTyr: 1.767 ± 0.221
0.0HisXaa: 0.0 ± 0.0
Ile
4.797IleAla: 4.797 ± 1.197
1.01IleCys: 1.01 ± 0.412
4.039IleAsp: 4.039 ± 0.785
3.534IleGlu: 3.534 ± 1.319
1.515IlePhe: 1.515 ± 0.474
4.292IleGly: 4.292 ± 0.719
2.272IleHis: 2.272 ± 0.468
6.816IleIle: 6.816 ± 2.174
7.321IleLys: 7.321 ± 1.104
6.816IleLeu: 6.816 ± 1.517
2.272IleMet: 2.272 ± 0.709
5.049IleAsn: 5.049 ± 1.18
5.302IlePro: 5.302 ± 0.601
2.777IleGln: 2.777 ± 0.565
2.02IleArg: 2.02 ± 0.548
8.079IleSer: 8.079 ± 1.104
4.797IleThr: 4.797 ± 1.037
4.292IleVal: 4.292 ± 0.862
0.252IleTrp: 0.252 ± 0.327
1.767IleTyr: 1.767 ± 0.17
0.0IleXaa: 0.0 ± 0.0
Lys
3.282LysAla: 3.282 ± 0.676
1.767LysCys: 1.767 ± 0.489
4.544LysAsp: 4.544 ± 1.807
3.787LysGlu: 3.787 ± 0.561
2.525LysPhe: 2.525 ± 0.607
2.272LysGly: 2.272 ± 0.937
1.262LysHis: 1.262 ± 0.256
6.564LysIle: 6.564 ± 0.835
2.272LysLys: 2.272 ± 0.446
6.059LysLeu: 6.059 ± 0.996
0.757LysMet: 0.757 ± 0.288
3.282LysAsn: 3.282 ± 0.44
2.272LysPro: 2.272 ± 0.397
2.02LysGln: 2.02 ± 0.821
2.777LysArg: 2.777 ± 0.269
3.534LysSer: 3.534 ± 0.287
3.282LysThr: 3.282 ± 0.661
3.787LysVal: 3.787 ± 1.037
0.757LysTrp: 0.757 ± 0.451
3.282LysTyr: 3.282 ± 0.458
0.0LysXaa: 0.0 ± 0.0
Leu
5.049LeuAla: 5.049 ± 2.578
1.767LeuCys: 1.767 ± 0.736
5.049LeuAsp: 5.049 ± 1.359
5.049LeuGlu: 5.049 ± 1.143
4.292LeuPhe: 4.292 ± 0.43
7.321LeuGly: 7.321 ± 0.816
2.02LeuHis: 2.02 ± 0.562
10.098LeuIle: 10.098 ± 1.491
11.108LeuLys: 11.108 ± 1.88
11.613LeuLeu: 11.613 ± 2.287
1.767LeuMet: 1.767 ± 0.394
6.312LeuAsn: 6.312 ± 1.152
5.049LeuPro: 5.049 ± 1.075
3.787LeuGln: 3.787 ± 0.951
7.321LeuArg: 7.321 ± 0.899
9.594LeuSer: 9.594 ± 0.545
7.069LeuThr: 7.069 ± 1.074
5.554LeuVal: 5.554 ± 0.622
2.02LeuTrp: 2.02 ± 0.644
3.282LeuTyr: 3.282 ± 0.505
0.0LeuXaa: 0.0 ± 0.0
Met
1.767MetAla: 1.767 ± 0.764
0.757MetCys: 0.757 ± 0.451
1.515MetAsp: 1.515 ± 0.662
2.525MetGlu: 2.525 ± 0.683
0.505MetPhe: 0.505 ± 0.715
1.01MetGly: 1.01 ± 0.274
0.0MetHis: 0.0 ± 0.0
2.777MetIle: 2.777 ± 0.398
0.757MetLys: 0.757 ± 0.322
2.525MetLeu: 2.525 ± 1.577
1.515MetMet: 1.515 ± 0.516
0.757MetAsn: 0.757 ± 0.425
0.505MetPro: 0.505 ± 0.301
0.505MetGln: 0.505 ± 0.311
1.515MetArg: 1.515 ± 0.619
2.525MetSer: 2.525 ± 0.571
2.272MetThr: 2.272 ± 0.86
1.01MetVal: 1.01 ± 0.413
0.0MetTrp: 0.0 ± 0.0
1.01MetTyr: 1.01 ± 0.602
0.0MetXaa: 0.0 ± 0.0
Asn
1.262AsnAla: 1.262 ± 0.605
1.01AsnCys: 1.01 ± 0.623
3.03AsnAsp: 3.03 ± 0.786
4.544AsnGlu: 4.544 ± 0.681
2.272AsnPhe: 2.272 ± 1.229
1.262AsnGly: 1.262 ± 0.727
1.262AsnHis: 1.262 ± 0.609
3.787AsnIle: 3.787 ± 0.904
0.757AsnLys: 0.757 ± 0.425
5.302AsnLeu: 5.302 ± 0.742
1.515AsnMet: 1.515 ± 0.652
3.03AsnAsn: 3.03 ± 0.805
1.262AsnPro: 1.262 ± 0.605
2.272AsnGln: 2.272 ± 0.937
1.262AsnArg: 1.262 ± 0.562
3.787AsnSer: 3.787 ± 0.495
2.525AsnThr: 2.525 ± 0.499
2.272AsnVal: 2.272 ± 0.542
0.252AsnTrp: 0.252 ± 0.15
2.525AsnTyr: 2.525 ± 0.261
0.0AsnXaa: 0.0 ± 0.0
Pro
1.01ProAla: 1.01 ± 0.274
0.252ProCys: 0.252 ± 0.358
3.282ProAsp: 3.282 ± 0.458
1.767ProGlu: 1.767 ± 0.541
2.777ProPhe: 2.777 ± 0.742
1.262ProGly: 1.262 ± 0.479
1.01ProHis: 1.01 ± 0.347
1.515ProIle: 1.515 ± 0.634
1.767ProLys: 1.767 ± 0.574
6.312ProLeu: 6.312 ± 1.714
1.01ProMet: 1.01 ± 0.387
1.767ProAsn: 1.767 ± 0.583
1.01ProPro: 1.01 ± 0.502
2.525ProGln: 2.525 ± 0.795
2.02ProArg: 2.02 ± 0.453
3.03ProSer: 3.03 ± 0.568
2.02ProThr: 2.02 ± 0.399
2.777ProVal: 2.777 ± 0.845
0.252ProTrp: 0.252 ± 0.15
0.505ProTyr: 0.505 ± 0.311
0.0ProXaa: 0.0 ± 0.0
Gln
3.787GlnAla: 3.787 ± 1.306
0.252GlnCys: 0.252 ± 0.253
4.039GlnAsp: 4.039 ± 1.333
1.262GlnGlu: 1.262 ± 0.479
3.534GlnPhe: 3.534 ± 1.165
0.757GlnGly: 0.757 ± 0.237
1.01GlnHis: 1.01 ± 0.631
1.515GlnIle: 1.515 ± 0.142
1.767GlnLys: 1.767 ± 0.702
5.807GlnLeu: 5.807 ± 0.634
2.02GlnMet: 2.02 ± 0.942
1.01GlnAsn: 1.01 ± 0.602
1.515GlnPro: 1.515 ± 0.543
1.767GlnGln: 1.767 ± 0.394
1.262GlnArg: 1.262 ± 0.238
3.282GlnSer: 3.282 ± 0.733
1.515GlnThr: 1.515 ± 0.474
3.03GlnVal: 3.03 ± 0.417
0.0GlnTrp: 0.0 ± 0.0
1.262GlnTyr: 1.262 ± 0.53
0.0GlnXaa: 0.0 ± 0.0
Arg
2.525ArgAla: 2.525 ± 0.261
1.01ArgCys: 1.01 ± 0.602
2.525ArgAsp: 2.525 ± 1.065
1.262ArgGlu: 1.262 ± 0.53
3.787ArgPhe: 3.787 ± 0.561
1.515ArgGly: 1.515 ± 0.58
0.757ArgHis: 0.757 ± 0.321
2.272ArgIle: 2.272 ± 0.427
2.777ArgLys: 2.777 ± 1.154
4.544ArgLeu: 4.544 ± 1.16
1.262ArgMet: 1.262 ± 0.479
1.262ArgAsn: 1.262 ± 0.303
1.515ArgPro: 1.515 ± 0.297
0.505ArgGln: 0.505 ± 0.42
1.767ArgArg: 1.767 ± 1.053
4.797ArgSer: 4.797 ± 1.483
3.03ArgThr: 3.03 ± 0.871
3.03ArgVal: 3.03 ± 1.043
0.505ArgTrp: 0.505 ± 0.301
0.757ArgTyr: 0.757 ± 0.333
0.0ArgXaa: 0.0 ± 0.0
Ser
5.807SerAla: 5.807 ± 1.76
1.767SerCys: 1.767 ± 0.53
4.797SerAsp: 4.797 ± 0.83
3.787SerGlu: 3.787 ± 0.768
4.292SerPhe: 4.292 ± 0.791
3.534SerGly: 3.534 ± 0.541
3.282SerHis: 3.282 ± 1.39
4.544SerIle: 4.544 ± 0.889
5.807SerLys: 5.807 ± 1.632
11.866SerLeu: 11.866 ± 1.196
2.777SerMet: 2.777 ± 0.311
3.03SerAsn: 3.03 ± 0.326
2.777SerPro: 2.777 ± 1.126
4.544SerGln: 4.544 ± 0.853
4.797SerArg: 4.797 ± 1.138
9.089SerSer: 9.089 ± 1.006
4.292SerThr: 4.292 ± 0.945
4.797SerVal: 4.797 ± 0.626
1.262SerTrp: 1.262 ± 0.554
4.292SerTyr: 4.292 ± 0.733
0.0SerXaa: 0.0 ± 0.0
Thr
1.262ThrAla: 1.262 ± 0.256
1.01ThrCys: 1.01 ± 0.631
3.282ThrAsp: 3.282 ± 0.488
1.515ThrGlu: 1.515 ± 0.902
3.03ThrPhe: 3.03 ± 0.228
4.039ThrGly: 4.039 ± 1.584
1.767ThrHis: 1.767 ± 0.221
3.03ThrIle: 3.03 ± 0.652
2.777ThrLys: 2.777 ± 0.397
6.564ThrLeu: 6.564 ± 1.342
2.02ThrMet: 2.02 ± 0.591
2.525ThrAsn: 2.525 ± 1.207
1.515ThrPro: 1.515 ± 0.527
1.01ThrGln: 1.01 ± 0.327
2.525ThrArg: 2.525 ± 0.251
6.564ThrSer: 6.564 ± 0.844
3.282ThrThr: 3.282 ± 0.536
3.534ThrVal: 3.534 ± 0.442
0.757ThrTrp: 0.757 ± 0.321
2.272ThrTyr: 2.272 ± 0.4
0.0ThrXaa: 0.0 ± 0.0
Val
2.777ValAla: 2.777 ± 0.739
1.515ValCys: 1.515 ± 0.348
3.534ValAsp: 3.534 ± 0.498
3.282ValGlu: 3.282 ± 0.211
2.777ValPhe: 2.777 ± 0.757
2.02ValGly: 2.02 ± 0.092
1.01ValHis: 1.01 ± 0.235
2.777ValIle: 2.777 ± 0.614
3.787ValLys: 3.787 ± 1.681
8.836ValLeu: 8.836 ± 1.144
0.757ValMet: 0.757 ± 0.3
2.525ValAsn: 2.525 ± 0.458
3.03ValPro: 3.03 ± 0.663
2.02ValGln: 2.02 ± 0.745
2.02ValArg: 2.02 ± 0.616
6.059ValSer: 6.059 ± 1.212
3.282ValThr: 3.282 ± 0.948
2.777ValVal: 2.777 ± 0.92
0.0ValTrp: 0.0 ± 0.0
2.777ValTyr: 2.777 ± 0.966
0.0ValXaa: 0.0 ± 0.0
Trp
0.252TrpAla: 0.252 ± 0.15
0.505TrpCys: 0.505 ± 0.301
0.757TrpAsp: 0.757 ± 0.322
0.757TrpGlu: 0.757 ± 0.425
0.505TrpPhe: 0.505 ± 0.193
0.757TrpGly: 0.757 ± 0.333
0.0TrpHis: 0.0 ± 0.0
1.01TrpIle: 1.01 ± 0.412
1.01TrpLys: 1.01 ± 0.574
1.01TrpLeu: 1.01 ± 0.412
0.505TrpMet: 0.505 ± 0.301
1.01TrpAsn: 1.01 ± 0.347
0.252TrpPro: 0.252 ± 0.15
0.0TrpGln: 0.0 ± 0.0
0.252TrpArg: 0.252 ± 0.253
0.505TrpSer: 0.505 ± 0.301
0.505TrpThr: 0.505 ± 0.301
0.505TrpVal: 0.505 ± 0.301
0.252TrpTrp: 0.252 ± 0.15
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.777TyrAla: 2.777 ± 0.845
0.757TyrCys: 0.757 ± 0.237
1.262TyrAsp: 1.262 ± 0.525
0.505TyrGlu: 0.505 ± 0.287
1.767TyrPhe: 1.767 ± 0.851
0.505TyrGly: 0.505 ± 0.301
1.767TyrHis: 1.767 ± 0.517
3.282TyrIle: 3.282 ± 0.44
3.03TyrLys: 3.03 ± 0.417
5.554TyrLeu: 5.554 ± 1.245
0.757TyrMet: 0.757 ± 0.451
1.767TyrAsn: 1.767 ± 0.804
2.02TyrPro: 2.02 ± 0.636
2.02TyrGln: 2.02 ± 0.607
1.515TyrArg: 1.515 ± 0.652
1.515TyrSer: 1.515 ± 0.619
1.01TyrThr: 1.01 ± 0.235
2.272TyrVal: 2.272 ± 0.342
0.252TyrTrp: 0.252 ± 0.15
0.505TyrTyr: 0.505 ± 0.193
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (3962 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski