Amino acid dipepetide frequency for Shayang virga-like virus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.905AlaAla: 3.905 ± 1.059
1.562AlaCys: 1.562 ± 0.56
3.124AlaAsp: 3.124 ± 1.638
3.385AlaGlu: 3.385 ± 0.805
3.645AlaPhe: 3.645 ± 1.911
1.822AlaGly: 1.822 ± 0.543
1.562AlaHis: 1.562 ± 1.271
2.603AlaIle: 2.603 ± 1.774
2.083AlaLys: 2.083 ± 0.56
6.248AlaLeu: 6.248 ± 0.952
2.343AlaMet: 2.343 ± 1.234
1.822AlaAsn: 1.822 ± 0.543
1.562AlaPro: 1.562 ± 0.412
2.083AlaGln: 2.083 ± 1.36
3.905AlaArg: 3.905 ± 0.529
4.166AlaSer: 4.166 ± 1.072
3.124AlaThr: 3.124 ± 0.685
6.769AlaVal: 6.769 ± 3.437
0.521AlaTrp: 0.521 ± 0.683
1.822AlaTyr: 1.822 ± 0.355
0.0AlaXaa: 0.0 ± 0.0
Cys
1.562CysAla: 1.562 ± 0.819
1.302CysCys: 1.302 ± 0.683
2.603CysAsp: 2.603 ± 0.928
1.562CysGlu: 1.562 ± 0.819
1.041CysPhe: 1.041 ± 0.28
0.521CysGly: 0.521 ± 0.273
0.26CysHis: 0.26 ± 0.436
0.521CysIle: 0.521 ± 0.273
1.302CysLys: 1.302 ± 0.683
2.603CysLeu: 2.603 ± 1.365
0.521CysMet: 0.521 ± 0.273
1.562CysAsn: 1.562 ± 0.436
1.302CysPro: 1.302 ± 0.34
0.26CysGln: 0.26 ± 0.436
1.302CysArg: 1.302 ± 0.34
1.041CysSer: 1.041 ± 0.546
1.041CysThr: 1.041 ± 0.28
1.822CysVal: 1.822 ± 1.446
0.0CysTrp: 0.0 ± 0.0
1.302CysTyr: 1.302 ± 0.683
0.0CysXaa: 0.0 ± 0.0
Asp
4.166AspAla: 4.166 ± 0.271
3.124AspCys: 3.124 ± 1.638
6.769AspAsp: 6.769 ± 1.967
2.083AspGlu: 2.083 ± 0.56
2.864AspPhe: 2.864 ± 0.818
3.385AspGly: 3.385 ± 0.983
2.083AspHis: 2.083 ± 0.347
2.603AspIle: 2.603 ± 0.928
3.905AspLys: 3.905 ± 1.594
5.467AspLeu: 5.467 ± 1.577
1.562AspMet: 1.562 ± 0.436
2.864AspAsn: 2.864 ± 1.06
2.864AspPro: 2.864 ± 0.538
1.822AspGln: 1.822 ± 0.956
3.905AspArg: 3.905 ± 0.205
4.166AspSer: 4.166 ± 1.072
2.343AspThr: 2.343 ± 0.84
9.373AspVal: 9.373 ± 1.873
0.781AspTrp: 0.781 ± 0.28
2.864AspTyr: 2.864 ± 1.085
0.0AspXaa: 0.0 ± 0.0
Glu
1.822GluAla: 1.822 ± 1.065
1.041GluCys: 1.041 ± 0.546
4.947GluAsp: 4.947 ± 2.594
2.864GluGlu: 2.864 ± 0.966
2.603GluPhe: 2.603 ± 1.216
2.343GluGly: 2.343 ± 0.902
2.083GluHis: 2.083 ± 0.808
3.645GluIle: 3.645 ± 0.219
4.426GluLys: 4.426 ± 1.946
4.426GluLeu: 4.426 ± 1.47
2.864GluMet: 2.864 ± 2.412
2.083GluAsn: 2.083 ± 0.56
2.343GluPro: 2.343 ± 1.034
1.562GluGln: 1.562 ± 0.56
2.864GluArg: 2.864 ± 0.771
3.385GluSer: 3.385 ± 0.898
2.083GluThr: 2.083 ± 0.672
5.728GluVal: 5.728 ± 1.481
0.26GluTrp: 0.26 ± 0.137
2.343GluTyr: 2.343 ± 0.389
0.0GluXaa: 0.0 ± 0.0
Phe
4.166PheAla: 4.166 ± 3.081
1.562PheCys: 1.562 ± 0.56
3.124PheAsp: 3.124 ± 1.192
4.947PheGlu: 4.947 ± 0.94
2.343PhePhe: 2.343 ± 1.841
3.124PheGly: 3.124 ± 0.135
0.781PheHis: 0.781 ± 0.77
2.864PheIle: 2.864 ± 0.571
3.124PheLys: 3.124 ± 0.873
6.509PheLeu: 6.509 ± 1.81
0.26PheMet: 0.26 ± 0.126
3.645PheAsn: 3.645 ± 0.738
2.083PhePro: 2.083 ± 0.931
1.822PheGln: 1.822 ± 0.355
2.603PheArg: 2.603 ± 1.7
5.467PheSer: 5.467 ± 1.853
3.124PheThr: 3.124 ± 1.192
6.248PheVal: 6.248 ± 2.424
0.521PheTrp: 0.521 ± 0.846
3.124PheTyr: 3.124 ± 1.656
0.0PheXaa: 0.0 ± 0.0
Gly
1.041GlyAla: 1.041 ± 0.546
0.781GlyCys: 0.781 ± 0.28
3.124GlyAsp: 3.124 ± 1.638
2.343GlyGlu: 2.343 ± 0.389
3.645GlyPhe: 3.645 ± 1.603
3.124GlyGly: 3.124 ± 0.685
0.26GlyHis: 0.26 ± 0.137
1.041GlyIle: 1.041 ± 0.607
2.864GlyLys: 2.864 ± 0.771
3.645GlyLeu: 3.645 ± 1.83
1.041GlyMet: 1.041 ± 0.607
3.124GlyAsn: 3.124 ± 1.638
1.041GlyPro: 1.041 ± 1.365
1.041GlyGln: 1.041 ± 1.365
4.426GlyArg: 4.426 ± 0.377
1.822GlySer: 1.822 ± 0.956
1.302GlyThr: 1.302 ± 0.626
2.864GlyVal: 2.864 ± 0.538
0.781GlyTrp: 0.781 ± 0.41
2.343GlyTyr: 2.343 ± 0.799
0.0GlyXaa: 0.0 ± 0.0
His
0.781HisAla: 0.781 ± 0.636
1.302HisCys: 1.302 ± 0.34
0.521HisAsp: 0.521 ± 0.683
1.562HisGlu: 1.562 ± 1.246
1.822HisPhe: 1.822 ± 0.355
1.041HisGly: 1.041 ± 0.981
0.781HisHis: 0.781 ± 1.43
0.781HisIle: 0.781 ± 0.41
1.041HisLys: 1.041 ± 0.616
2.603HisLeu: 2.603 ± 1.756
0.521HisMet: 0.521 ± 0.273
0.26HisAsn: 0.26 ± 0.137
0.781HisPro: 0.781 ± 1.308
0.26HisGln: 0.26 ± 0.137
1.562HisArg: 1.562 ± 0.664
1.822HisSer: 1.822 ± 0.543
0.521HisThr: 0.521 ± 0.34
2.603HisVal: 2.603 ± 0.818
0.0HisTrp: 0.0 ± 0.0
2.083HisTyr: 2.083 ± 0.931
0.0HisXaa: 0.0 ± 0.0
Ile
3.645IleAla: 3.645 ± 1.459
1.041IleCys: 1.041 ± 0.28
2.083IleAsp: 2.083 ± 0.883
3.645IleGlu: 3.645 ± 0.945
3.385IlePhe: 3.385 ± 0.302
0.781IleGly: 0.781 ± 0.28
2.083IleHis: 2.083 ± 0.883
2.603IleIle: 2.603 ± 0.667
1.041IleLys: 1.041 ± 0.28
6.509IleLeu: 6.509 ± 4.874
2.083IleMet: 2.083 ± 0.347
2.083IleAsn: 2.083 ± 0.808
2.083IlePro: 2.083 ± 0.56
1.822IleGln: 1.822 ± 0.573
1.562IleArg: 1.562 ± 0.819
3.124IleSer: 3.124 ± 0.873
2.343IleThr: 2.343 ± 1.405
2.083IleVal: 2.083 ± 0.808
0.26IleTrp: 0.26 ± 0.137
2.083IleTyr: 2.083 ± 0.347
0.0IleXaa: 0.0 ± 0.0
Lys
2.083LysAla: 2.083 ± 0.808
0.521LysCys: 0.521 ± 0.273
2.864LysAsp: 2.864 ± 1.283
3.385LysGlu: 3.385 ± 1.325
6.248LysPhe: 6.248 ± 1.919
1.302LysGly: 1.302 ± 0.34
0.781LysHis: 0.781 ± 0.28
3.385LysIle: 3.385 ± 0.983
2.343LysLys: 2.343 ± 0.902
6.509LysLeu: 6.509 ± 0.17
1.041LysMet: 1.041 ± 0.546
2.343LysAsn: 2.343 ± 0.608
1.822LysPro: 1.822 ± 1.164
0.781LysGln: 0.781 ± 0.41
2.343LysArg: 2.343 ± 1.229
3.124LysSer: 3.124 ± 1.192
2.343LysThr: 2.343 ± 0.799
4.166LysVal: 4.166 ± 1.072
0.521LysTrp: 0.521 ± 0.872
3.645LysTyr: 3.645 ± 0.931
0.0LysXaa: 0.0 ± 0.0
Leu
5.207LeuAla: 5.207 ± 1.915
1.822LeuCys: 1.822 ± 0.543
5.207LeuAsp: 5.207 ± 2.431
6.509LeuGlu: 6.509 ± 1.067
4.686LeuPhe: 4.686 ± 2.673
3.645LeuGly: 3.645 ± 0.738
2.603LeuHis: 2.603 ± 2.052
4.686LeuIle: 4.686 ± 0.895
5.467LeuLys: 5.467 ± 0.511
10.154LeuLeu: 10.154 ± 4.42
4.947LeuMet: 4.947 ± 0.441
2.864LeuAsn: 2.864 ± 0.075
3.124LeuPro: 3.124 ± 0.824
5.467LeuGln: 5.467 ± 3.389
7.55LeuArg: 7.55 ± 1.092
8.331LeuSer: 8.331 ± 0.297
4.166LeuThr: 4.166 ± 0.556
9.893LeuVal: 9.893 ± 0.471
0.781LeuTrp: 0.781 ± 0.41
3.385LeuTyr: 3.385 ± 1.095
0.0LeuXaa: 0.0 ± 0.0
Met
3.124MetAla: 3.124 ± 1.666
0.26MetCys: 0.26 ± 0.137
1.302MetAsp: 1.302 ± 0.683
1.562MetGlu: 1.562 ± 0.436
1.302MetPhe: 1.302 ± 0.608
1.822MetGly: 1.822 ± 0.55
0.521MetHis: 0.521 ± 0.273
1.822MetIle: 1.822 ± 2.097
2.083MetLys: 2.083 ± 0.672
4.426MetLeu: 4.426 ± 1.316
1.822MetMet: 1.822 ± 0.573
1.041MetAsn: 1.041 ± 1.365
0.26MetPro: 0.26 ± 0.137
1.302MetGln: 1.302 ± 0.5
1.562MetArg: 1.562 ± 0.56
1.822MetSer: 1.822 ± 0.727
1.562MetThr: 1.562 ± 0.56
2.083MetVal: 2.083 ± 1.231
1.041MetTrp: 1.041 ± 0.28
0.26MetTyr: 0.26 ± 0.137
0.0MetXaa: 0.0 ± 0.0
Asn
3.645AsnAla: 3.645 ± 1.1
0.781AsnCys: 0.781 ± 0.41
4.426AsnAsp: 4.426 ± 0.8
1.562AsnGlu: 1.562 ± 0.709
2.603AsnPhe: 2.603 ± 0.469
1.562AsnGly: 1.562 ± 0.819
0.521AsnHis: 0.521 ± 0.273
1.562AsnIle: 1.562 ± 0.709
1.302AsnLys: 1.302 ± 0.34
3.385AsnLeu: 3.385 ± 1.755
0.521AsnMet: 0.521 ± 0.34
2.083AsnAsn: 2.083 ± 0.672
0.781AsnPro: 0.781 ± 0.41
0.521AsnGln: 0.521 ± 1.504
2.864AsnArg: 2.864 ± 1.502
1.562AsnSer: 1.562 ± 0.664
1.822AsnThr: 1.822 ± 0.55
4.426AsnVal: 4.426 ± 1.506
0.0AsnTrp: 0.0 ± 0.0
1.562AsnTyr: 1.562 ± 0.819
0.0AsnXaa: 0.0 ± 0.0
Pro
2.083ProAla: 2.083 ± 1.536
0.26ProCys: 0.26 ± 0.137
2.603ProAsp: 2.603 ± 0.991
1.562ProGlu: 1.562 ± 1.447
2.864ProPhe: 2.864 ± 0.818
1.562ProGly: 1.562 ± 0.56
0.26ProHis: 0.26 ± 0.436
2.083ProIle: 2.083 ± 1.36
2.603ProLys: 2.603 ± 0.469
2.864ProLeu: 2.864 ± 0.571
0.521ProMet: 0.521 ± 0.273
0.781ProAsn: 0.781 ± 0.724
1.822ProPro: 1.822 ± 1.993
0.521ProGln: 0.521 ± 0.683
2.343ProArg: 2.343 ± 0.608
2.343ProSer: 2.343 ± 1.104
1.562ProThr: 1.562 ± 0.56
4.166ProVal: 4.166 ± 2.185
0.26ProTrp: 0.26 ± 0.137
0.781ProTyr: 0.781 ± 0.41
0.0ProXaa: 0.0 ± 0.0
Gln
1.041GlnAla: 1.041 ± 0.616
0.521GlnCys: 0.521 ± 0.273
1.302GlnAsp: 1.302 ± 0.34
1.562GlnGlu: 1.562 ± 1.246
1.822GlnPhe: 1.822 ± 0.944
1.041GlnGly: 1.041 ± 0.546
0.781GlnHis: 0.781 ± 0.77
1.562GlnIle: 1.562 ± 0.56
1.302GlnLys: 1.302 ± 0.34
4.426GlnLeu: 4.426 ± 1.36
0.26GlnMet: 0.26 ± 0.436
0.781GlnAsn: 0.781 ± 0.41
1.041GlnPro: 1.041 ± 0.28
1.822GlnGln: 1.822 ± 0.355
3.385GlnArg: 3.385 ± 1.576
3.385GlnSer: 3.385 ± 2.465
1.302GlnThr: 1.302 ± 1.312
3.385GlnVal: 3.385 ± 1.992
0.26GlnTrp: 0.26 ± 0.137
1.302GlnTyr: 1.302 ± 0.34
0.0GlnXaa: 0.0 ± 0.0
Arg
3.905ArgAla: 3.905 ± 1.021
2.603ArgCys: 2.603 ± 0.681
3.385ArgAsp: 3.385 ± 0.805
4.166ArgGlu: 4.166 ± 0.664
3.905ArgPhe: 3.905 ± 1.095
2.083ArgGly: 2.083 ± 1.092
1.822ArgHis: 1.822 ± 0.573
2.864ArgIle: 2.864 ± 1.06
2.864ArgLys: 2.864 ± 0.907
6.769ArgLeu: 6.769 ± 1.336
1.822ArgMet: 1.822 ± 0.543
2.603ArgAsn: 2.603 ± 0.991
0.26ArgPro: 0.26 ± 0.137
2.603ArgGln: 2.603 ± 0.928
5.988ArgArg: 5.988 ± 1.156
6.248ArgSer: 6.248 ± 0.28
3.645ArgThr: 3.645 ± 0.394
6.248ArgVal: 6.248 ± 2.384
0.781ArgTrp: 0.781 ± 0.28
2.864ArgTyr: 2.864 ± 0.538
0.0ArgXaa: 0.0 ± 0.0
Ser
2.343SerAla: 2.343 ± 1.104
1.041SerCys: 1.041 ± 0.546
3.905SerAsp: 3.905 ± 1.095
3.905SerGlu: 3.905 ± 0.529
4.426SerPhe: 4.426 ± 0.978
3.385SerGly: 3.385 ± 1.847
1.562SerHis: 1.562 ± 0.56
3.385SerIle: 3.385 ± 0.261
4.426SerLys: 4.426 ± 1.205
6.248SerLeu: 6.248 ± 0.28
2.083SerMet: 2.083 ± 0.438
2.083SerAsn: 2.083 ± 1.092
3.385SerPro: 3.385 ± 0.847
2.603SerGln: 2.603 ± 0.681
7.29SerArg: 7.29 ± 1.424
5.988SerSer: 5.988 ± 0.705
3.124SerThr: 3.124 ± 1.513
6.769SerVal: 6.769 ± 1.611
0.0SerTrp: 0.0 ± 0.0
2.603SerTyr: 2.603 ± 0.818
0.0SerXaa: 0.0 ± 0.0
Thr
3.645ThrAla: 3.645 ± 1.459
0.521ThrCys: 0.521 ± 0.273
3.124ThrAsp: 3.124 ± 0.414
1.822ThrGlu: 1.822 ± 0.727
4.947ThrPhe: 4.947 ± 1.304
2.083ThrGly: 2.083 ± 0.347
0.521ThrHis: 0.521 ± 0.34
3.124ThrIle: 3.124 ± 0.873
0.781ThrLys: 0.781 ± 0.41
3.645ThrLeu: 3.645 ± 1.086
1.302ThrMet: 1.302 ± 0.608
1.562ThrAsn: 1.562 ± 0.56
1.562ThrPro: 1.562 ± 1.447
2.083ThrGln: 2.083 ± 0.56
3.124ThrArg: 3.124 ± 0.873
2.864ThrSer: 2.864 ± 0.075
3.645ThrThr: 3.645 ± 2.328
4.166ThrVal: 4.166 ± 0.556
0.781ThrTrp: 0.781 ± 0.77
1.822ThrTyr: 1.822 ± 0.55
0.0ThrXaa: 0.0 ± 0.0
Val
6.248ValAla: 6.248 ± 0.27
2.083ValCys: 2.083 ± 0.672
10.414ValAsp: 10.414 ± 1.923
6.248ValGlu: 6.248 ± 2.384
4.686ValPhe: 4.686 ± 2.593
4.686ValGly: 4.686 ± 0.431
1.822ValHis: 1.822 ± 0.573
2.603ValIle: 2.603 ± 1.001
5.467ValLys: 5.467 ± 1.066
8.852ValLeu: 8.852 ± 3.457
4.686ValMet: 4.686 ± 1.993
2.083ValAsn: 2.083 ± 0.883
3.905ValPro: 3.905 ± 0.689
2.083ValGln: 2.083 ± 0.438
5.207ValArg: 5.207 ± 1.109
5.207ValSer: 5.207 ± 0.756
6.248ValThr: 6.248 ± 0.926
11.195ValVal: 11.195 ± 0.614
0.26ValTrp: 0.26 ± 0.436
4.686ValTyr: 4.686 ± 2.457
0.0ValXaa: 0.0 ± 0.0
Trp
0.26TrpAla: 0.26 ± 0.137
0.781TrpCys: 0.781 ± 0.41
0.26TrpAsp: 0.26 ± 0.137
0.26TrpGlu: 0.26 ± 0.137
0.781TrpPhe: 0.781 ± 0.636
0.781TrpGly: 0.781 ± 0.636
0.26TrpHis: 0.26 ± 0.137
0.26TrpIle: 0.26 ± 0.436
0.781TrpLys: 0.781 ± 0.28
0.781TrpLeu: 0.781 ± 0.28
0.26TrpMet: 0.26 ± 0.137
0.521TrpAsn: 0.521 ± 0.872
0.0TrpPro: 0.0 ± 0.0
0.521TrpGln: 0.521 ± 0.34
0.26TrpArg: 0.26 ± 0.436
0.26TrpSer: 0.26 ± 0.137
0.26TrpThr: 0.26 ± 0.436
0.781TrpVal: 0.781 ± 0.28
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.385TyrAla: 3.385 ± 1.325
0.521TyrCys: 0.521 ± 0.273
3.905TyrAsp: 3.905 ± 1.174
1.041TyrGlu: 1.041 ± 0.546
1.822TyrPhe: 1.822 ± 0.55
1.562TyrGly: 1.562 ± 0.664
1.302TyrHis: 1.302 ± 0.5
2.343TyrIle: 2.343 ± 0.798
2.343TyrLys: 2.343 ± 0.389
4.426TyrLeu: 4.426 ± 0.724
0.26TyrMet: 0.26 ± 0.436
1.302TyrAsn: 1.302 ± 1.312
1.822TyrPro: 1.822 ± 0.55
1.302TyrGln: 1.302 ± 0.608
3.385TyrArg: 3.385 ± 0.884
4.426TyrSer: 4.426 ± 1.205
1.562TyrThr: 1.562 ± 0.819
3.905TyrVal: 3.905 ± 0.529
0.26TyrTrp: 0.26 ± 0.137
2.343TyrTyr: 2.343 ± 0.799
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (3842 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski