Amino acid dipepetide frequency for Siegesbeckia yellow vein Guangxi virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.306AlaAla: 6.306 ± 2.362
0.0AlaCys: 0.0 ± 0.0
1.802AlaAsp: 1.802 ± 0.661
0.901AlaGlu: 0.901 ± 0.646
1.802AlaPhe: 1.802 ± 1.195
0.901AlaGly: 0.901 ± 1.111
2.703AlaHis: 2.703 ± 1.244
4.505AlaIle: 4.505 ± 1.053
2.703AlaLys: 2.703 ± 0.915
4.505AlaLeu: 4.505 ± 2.292
0.901AlaMet: 0.901 ± 0.706
2.703AlaAsn: 2.703 ± 1.255
2.703AlaPro: 2.703 ± 0.816
5.405AlaGln: 5.405 ± 1.581
5.405AlaArg: 5.405 ± 2.971
5.405AlaSer: 5.405 ± 2.411
3.604AlaThr: 3.604 ± 1.212
0.901AlaVal: 0.901 ± 0.904
1.802AlaTrp: 1.802 ± 1.291
0.901AlaTyr: 0.901 ± 0.646
0.0AlaXaa: 0.0 ± 0.0
Cys
0.901CysAla: 0.901 ± 1.111
1.802CysCys: 1.802 ± 1.586
0.0CysAsp: 0.0 ± 0.0
0.901CysGlu: 0.901 ± 0.706
0.901CysPhe: 0.901 ± 0.904
1.802CysGly: 1.802 ± 1.059
0.0CysHis: 0.0 ± 0.0
1.802CysIle: 1.802 ± 1.183
0.901CysLys: 0.901 ± 0.706
0.0CysLeu: 0.0 ± 0.0
0.901CysMet: 0.901 ± 0.793
0.901CysAsn: 0.901 ± 0.646
1.802CysPro: 1.802 ± 1.586
1.802CysGln: 1.802 ± 1.291
1.802CysArg: 1.802 ± 0.928
2.703CysSer: 2.703 ± 1.244
0.901CysThr: 0.901 ± 1.111
0.901CysVal: 0.901 ± 0.706
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
2.703AspAla: 2.703 ± 1.937
0.901AspCys: 0.901 ± 1.111
0.901AspAsp: 0.901 ± 0.646
2.703AspGlu: 2.703 ± 1.099
1.802AspPhe: 1.802 ± 1.059
3.604AspGly: 3.604 ± 1.843
1.802AspHis: 1.802 ± 1.808
2.703AspIle: 2.703 ± 0.878
0.0AspLys: 0.0 ± 0.0
4.505AspLeu: 4.505 ± 1.053
1.802AspMet: 1.802 ± 1.046
3.604AspAsn: 3.604 ± 0.976
1.802AspPro: 1.802 ± 0.928
4.505AspGln: 4.505 ± 2.149
3.604AspArg: 3.604 ± 1.321
7.207AspSer: 7.207 ± 1.504
1.802AspThr: 1.802 ± 1.358
5.405AspVal: 5.405 ± 1.702
0.901AspTrp: 0.901 ± 0.646
1.802AspTyr: 1.802 ± 1.358
0.0AspXaa: 0.0 ± 0.0
Glu
3.604GluAla: 3.604 ± 1.119
0.0GluCys: 0.0 ± 0.0
2.703GluAsp: 2.703 ± 1.485
5.405GluGlu: 5.405 ± 3.194
3.604GluPhe: 3.604 ± 1.312
3.604GluGly: 3.604 ± 1.677
0.0GluHis: 0.0 ± 0.0
0.0GluIle: 0.0 ± 0.0
3.604GluLys: 3.604 ± 1.677
5.405GluLeu: 5.405 ± 1.838
0.0GluMet: 0.0 ± 0.0
4.505GluAsn: 4.505 ± 1.806
0.901GluPro: 0.901 ± 0.706
1.802GluGln: 1.802 ± 1.412
0.0GluArg: 0.0 ± 0.0
0.901GluSer: 0.901 ± 1.022
2.703GluThr: 2.703 ± 1.229
4.505GluVal: 4.505 ± 1.787
1.802GluTrp: 1.802 ± 1.059
0.901GluTyr: 0.901 ± 0.646
0.0GluXaa: 0.0 ± 0.0
Phe
0.0PheAla: 0.0 ± 0.0
0.901PheCys: 0.901 ± 0.706
2.703PheAsp: 2.703 ± 1.099
1.802PheGlu: 1.802 ± 0.89
0.901PhePhe: 0.901 ± 0.646
0.901PheGly: 0.901 ± 0.706
3.604PheHis: 3.604 ± 1.331
1.802PheIle: 1.802 ± 1.291
3.604PheLys: 3.604 ± 1.78
5.405PheLeu: 5.405 ± 1.891
0.901PheMet: 0.901 ± 0.646
3.604PheAsn: 3.604 ± 2.781
0.901PhePro: 0.901 ± 0.793
1.802PheGln: 1.802 ± 1.059
3.604PheArg: 3.604 ± 1.499
1.802PheSer: 1.802 ± 2.044
2.703PheThr: 2.703 ± 0.915
1.802PheVal: 1.802 ± 1.291
0.901PheTrp: 0.901 ± 0.706
1.802PheTyr: 1.802 ± 1.412
0.0PheXaa: 0.0 ± 0.0
Gly
1.802GlyAla: 1.802 ± 1.291
2.703GlyCys: 2.703 ± 0.915
3.604GlyAsp: 3.604 ± 1.843
1.802GlyGlu: 1.802 ± 1.129
2.703GlyPhe: 2.703 ± 1.351
2.703GlyGly: 2.703 ± 1.099
1.802GlyHis: 1.802 ± 0.928
2.703GlyIle: 2.703 ± 1.13
4.505GlyLys: 4.505 ± 1.695
2.703GlyLeu: 2.703 ± 1.293
0.0GlyMet: 0.0 ± 0.0
0.901GlyAsn: 0.901 ± 0.706
5.405GlyPro: 5.405 ± 1.635
0.901GlyGln: 0.901 ± 0.706
0.901GlyArg: 0.901 ± 0.646
1.802GlySer: 1.802 ± 1.059
4.505GlyThr: 4.505 ± 1.849
2.703GlyVal: 2.703 ± 1.674
0.901GlyTrp: 0.901 ± 0.706
0.901GlyTyr: 0.901 ± 0.793
0.0GlyXaa: 0.0 ± 0.0
His
1.802HisAla: 1.802 ± 0.955
2.703HisCys: 2.703 ± 2.072
0.901HisAsp: 0.901 ± 0.904
0.901HisGlu: 0.901 ± 0.646
2.703HisPhe: 2.703 ± 1.388
0.901HisGly: 0.901 ± 0.793
1.802HisHis: 1.802 ± 1.195
1.802HisIle: 1.802 ± 1.1
2.703HisLys: 2.703 ± 1.535
2.703HisLeu: 2.703 ± 1.265
0.901HisMet: 0.901 ± 1.022
1.802HisAsn: 1.802 ± 1.291
1.802HisPro: 1.802 ± 0.89
0.901HisGln: 0.901 ± 0.706
1.802HisArg: 1.802 ± 1.195
1.802HisSer: 1.802 ± 1.358
2.703HisThr: 2.703 ± 2.118
2.703HisVal: 2.703 ± 1.266
0.0HisTrp: 0.0 ± 0.0
0.901HisTyr: 0.901 ± 0.646
0.0HisXaa: 0.0 ± 0.0
Ile
1.802IleAla: 1.802 ± 1.059
0.901IleCys: 0.901 ± 0.646
1.802IleAsp: 1.802 ± 1.059
1.802IleGlu: 1.802 ± 1.291
4.505IlePhe: 4.505 ± 1.297
0.901IleGly: 0.901 ± 1.111
1.802IleHis: 1.802 ± 1.353
0.0IleIle: 0.0 ± 0.0
7.207IleLys: 7.207 ± 1.284
4.505IleLeu: 4.505 ± 2.56
2.703IleMet: 2.703 ± 1.375
2.703IleAsn: 2.703 ± 1.265
0.901IlePro: 0.901 ± 0.646
6.306IleGln: 6.306 ± 2.291
5.405IleArg: 5.405 ± 2.147
3.604IleSer: 3.604 ± 1.159
3.604IleThr: 3.604 ± 2.539
2.703IleVal: 2.703 ± 1.205
2.703IleTrp: 2.703 ± 1.449
5.405IleTyr: 5.405 ± 3.003
0.0IleXaa: 0.0 ± 0.0
Lys
4.505LysAla: 4.505 ± 1.992
1.802LysCys: 1.802 ± 0.89
3.604LysAsp: 3.604 ± 1.331
4.505LysGlu: 4.505 ± 1.192
3.604LysPhe: 3.604 ± 1.049
0.901LysGly: 0.901 ± 0.646
0.901LysHis: 0.901 ± 0.646
4.505LysIle: 4.505 ± 2.298
6.306LysLys: 6.306 ± 2.939
2.703LysLeu: 2.703 ± 1.228
0.0LysMet: 0.0 ± 0.0
5.405LysAsn: 5.405 ± 2.411
2.703LysPro: 2.703 ± 1.042
0.901LysGln: 0.901 ± 0.793
0.901LysArg: 0.901 ± 0.706
3.604LysSer: 3.604 ± 2.1
4.505LysThr: 4.505 ± 1.192
4.505LysVal: 4.505 ± 2.029
0.0LysTrp: 0.0 ± 0.0
2.703LysTyr: 2.703 ± 0.816
0.0LysXaa: 0.0 ± 0.0
Leu
1.802LeuAla: 1.802 ± 0.928
2.703LeuCys: 2.703 ± 1.099
7.207LeuAsp: 7.207 ± 3.133
1.802LeuGlu: 1.802 ± 1.1
0.901LeuPhe: 0.901 ± 0.646
3.604LeuGly: 3.604 ± 1.753
2.703LeuHis: 2.703 ± 1.265
6.306LeuIle: 6.306 ± 2.167
4.505LeuLys: 4.505 ± 1.067
3.604LeuLeu: 3.604 ± 1.676
0.0LeuMet: 0.0 ± 0.0
4.505LeuAsn: 4.505 ± 1.635
2.703LeuPro: 2.703 ± 1.229
6.306LeuGln: 6.306 ± 1.553
6.306LeuArg: 6.306 ± 2.078
5.405LeuSer: 5.405 ± 1.625
5.405LeuThr: 5.405 ± 1.401
1.802LeuVal: 1.802 ± 1.412
0.0LeuTrp: 0.0 ± 0.0
3.604LeuTyr: 3.604 ± 1.655
0.0LeuXaa: 0.0 ± 0.0
Met
2.703MetAla: 2.703 ± 1.619
0.0MetCys: 0.0 ± 0.0
1.802MetAsp: 1.802 ± 1.129
3.604MetGlu: 3.604 ± 2.199
1.802MetPhe: 1.802 ± 1.412
2.703MetGly: 2.703 ± 1.042
0.0MetHis: 0.0 ± 0.0
0.0MetIle: 0.0 ± 0.0
0.0MetLys: 0.0 ± 0.0
1.802MetLeu: 1.802 ± 1.358
0.0MetMet: 0.0 ± 0.0
0.0MetAsn: 0.0 ± 0.0
1.802MetPro: 1.802 ± 1.1
1.802MetGln: 1.802 ± 1.353
0.901MetArg: 0.901 ± 0.706
0.0MetSer: 0.0 ± 0.0
0.0MetThr: 0.0 ± 0.0
0.0MetVal: 0.0 ± 0.0
1.802MetTrp: 1.802 ± 0.928
2.703MetTyr: 2.703 ± 1.652
0.0MetXaa: 0.0 ± 0.0
Asn
3.604AsnAla: 3.604 ± 1.266
0.901AsnCys: 0.901 ± 0.793
3.604AsnAsp: 3.604 ± 1.321
1.802AsnGlu: 1.802 ± 0.955
0.901AsnPhe: 0.901 ± 0.706
3.604AsnGly: 3.604 ± 1.04
3.604AsnHis: 3.604 ± 2.824
5.405AsnIle: 5.405 ± 1.51
2.703AsnLys: 2.703 ± 1.674
5.405AsnLeu: 5.405 ± 1.87
1.802AsnMet: 1.802 ± 1.368
3.604AsnAsn: 3.604 ± 0.976
3.604AsnPro: 3.604 ± 0.976
0.901AsnGln: 0.901 ± 0.646
3.604AsnArg: 3.604 ± 1.04
4.505AsnSer: 4.505 ± 1.83
2.703AsnThr: 2.703 ± 0.816
1.802AsnVal: 1.802 ± 0.89
0.0AsnTrp: 0.0 ± 0.0
2.703AsnTyr: 2.703 ± 1.265
0.0AsnXaa: 0.0 ± 0.0
Pro
2.703ProAla: 2.703 ± 1.205
0.901ProCys: 0.901 ± 0.706
3.604ProAsp: 3.604 ± 1.025
2.703ProGlu: 2.703 ± 1.927
1.802ProPhe: 1.802 ± 0.89
0.901ProGly: 0.901 ± 0.646
3.604ProHis: 3.604 ± 2.582
3.604ProIle: 3.604 ± 2.1
4.505ProLys: 4.505 ± 1.992
4.505ProLeu: 4.505 ± 1.789
1.802ProMet: 1.802 ± 0.661
3.604ProAsn: 3.604 ± 1.331
1.802ProPro: 1.802 ± 1.1
4.505ProGln: 4.505 ± 2.117
3.604ProArg: 3.604 ± 1.743
4.505ProSer: 4.505 ± 2.3
3.604ProThr: 3.604 ± 1.677
3.604ProVal: 3.604 ± 1.245
0.901ProTrp: 0.901 ± 0.646
1.802ProTyr: 1.802 ± 1.412
0.0ProXaa: 0.0 ± 0.0
Gln
3.604GlnAla: 3.604 ± 1.771
0.901GlnCys: 0.901 ± 0.793
3.604GlnAsp: 3.604 ± 2.056
2.703GlnGlu: 2.703 ± 1.099
2.703GlnPhe: 2.703 ± 1.265
1.802GlnGly: 1.802 ± 1.1
1.802GlnHis: 1.802 ± 1.393
2.703GlnIle: 2.703 ± 1.265
0.0GlnLys: 0.0 ± 0.0
2.703GlnLeu: 2.703 ± 1.569
0.901GlnMet: 0.901 ± 1.111
3.604GlnAsn: 3.604 ± 1.467
3.604GlnPro: 3.604 ± 2.717
4.505GlnGln: 4.505 ± 2.754
2.703GlnArg: 2.703 ± 0.915
6.306GlnSer: 6.306 ± 1.4
5.405GlnThr: 5.405 ± 3.499
4.505GlnVal: 4.505 ± 1.028
0.0GlnTrp: 0.0 ± 0.0
1.802GlnTyr: 1.802 ± 1.129
0.0GlnXaa: 0.0 ± 0.0
Arg
0.901ArgAla: 0.901 ± 0.904
0.901ArgCys: 0.901 ± 0.793
2.703ArgAsp: 2.703 ± 1.205
3.604ArgGlu: 3.604 ± 1.524
3.604ArgPhe: 3.604 ± 1.321
4.505ArgGly: 4.505 ± 1.806
0.0ArgHis: 0.0 ± 0.0
6.306ArgIle: 6.306 ± 0.82
0.901ArgLys: 0.901 ± 0.706
3.604ArgLeu: 3.604 ± 1.686
0.901ArgMet: 0.901 ± 0.706
0.901ArgAsn: 0.901 ± 0.646
7.207ArgPro: 7.207 ± 1.353
3.604ArgGln: 3.604 ± 2.25
7.207ArgArg: 7.207 ± 4.36
6.306ArgSer: 6.306 ± 1.85
7.207ArgThr: 7.207 ± 2.454
1.802ArgVal: 1.802 ± 1.353
0.0ArgTrp: 0.0 ± 0.0
0.901ArgTyr: 0.901 ± 0.793
0.0ArgXaa: 0.0 ± 0.0
Ser
6.306SerAla: 6.306 ± 2.61
0.901SerCys: 0.901 ± 0.793
4.505SerAsp: 4.505 ± 1.975
1.802SerGlu: 1.802 ± 0.928
0.901SerPhe: 0.901 ± 0.646
3.604SerGly: 3.604 ± 1.248
1.802SerHis: 1.802 ± 1.586
4.505SerIle: 4.505 ± 2.03
6.306SerLys: 6.306 ± 2.884
4.505SerLeu: 4.505 ± 2.086
0.901SerMet: 0.901 ± 1.022
5.405SerAsn: 5.405 ± 2.278
7.207SerPro: 7.207 ± 3.017
2.703SerGln: 2.703 ± 1.358
4.505SerArg: 4.505 ± 1.672
9.91SerSer: 9.91 ± 3.464
9.009SerThr: 9.009 ± 2.892
0.901SerVal: 0.901 ± 0.706
0.0SerTrp: 0.0 ± 0.0
1.802SerTyr: 1.802 ± 0.661
0.0SerXaa: 0.0 ± 0.0
Thr
3.604ThrAla: 3.604 ± 0.976
1.802ThrCys: 1.802 ± 1.237
2.703ThrAsp: 2.703 ± 2.201
3.604ThrGlu: 3.604 ± 1.072
1.802ThrPhe: 1.802 ± 1.1
5.405ThrGly: 5.405 ± 1.853
4.505ThrHis: 4.505 ± 2.407
3.604ThrIle: 3.604 ± 0.976
3.604ThrLys: 3.604 ± 1.119
2.703ThrLeu: 2.703 ± 1.042
1.802ThrMet: 1.802 ± 1.018
3.604ThrAsn: 3.604 ± 1.321
7.207ThrPro: 7.207 ± 1.442
2.703ThrGln: 2.703 ± 2.04
2.703ThrArg: 2.703 ± 1.409
4.505ThrSer: 4.505 ± 1.884
2.703ThrThr: 2.703 ± 2.095
6.306ThrVal: 6.306 ± 2.511
0.901ThrTrp: 0.901 ± 0.904
4.505ThrTyr: 4.505 ± 1.255
0.0ThrXaa: 0.0 ± 0.0
Val
0.901ValAla: 0.901 ± 1.022
0.0ValCys: 0.0 ± 0.0
2.703ValAsp: 2.703 ± 1.265
1.802ValGlu: 1.802 ± 1.586
1.802ValPhe: 1.802 ± 0.89
0.901ValGly: 0.901 ± 0.793
0.901ValHis: 0.901 ± 0.793
5.405ValIle: 5.405 ± 1.756
2.703ValLys: 2.703 ± 0.878
2.703ValLeu: 2.703 ± 0.878
1.802ValMet: 1.802 ± 1.129
0.901ValAsn: 0.901 ± 0.706
3.604ValPro: 3.604 ± 0.976
3.604ValGln: 3.604 ± 1.674
2.703ValArg: 2.703 ± 1.205
4.505ValSer: 4.505 ± 1.635
6.306ValThr: 6.306 ± 3.423
0.901ValVal: 0.901 ± 0.793
0.901ValTrp: 0.901 ± 0.904
5.405ValTyr: 5.405 ± 1.695
0.0ValXaa: 0.0 ± 0.0
Trp
2.703TrpAla: 2.703 ± 1.099
0.0TrpCys: 0.0 ± 0.0
0.901TrpAsp: 0.901 ± 0.793
0.0TrpGlu: 0.0 ± 0.0
0.0TrpPhe: 0.0 ± 0.0
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
0.901TrpLys: 0.901 ± 1.022
0.901TrpLeu: 0.901 ± 0.706
0.901TrpMet: 0.901 ± 0.706
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.901TrpGln: 0.901 ± 0.646
2.703TrpArg: 2.703 ± 1.13
0.0TrpSer: 0.0 ± 0.0
1.802TrpThr: 1.802 ± 1.808
0.901TrpVal: 0.901 ± 0.646
0.0TrpTrp: 0.0 ± 0.0
0.901TrpTyr: 0.901 ± 0.646
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.604TyrAla: 3.604 ± 1.212
0.0TyrCys: 0.0 ± 0.0
2.703TyrAsp: 2.703 ± 1.481
1.802TyrGlu: 1.802 ± 1.412
2.703TyrPhe: 2.703 ± 0.878
2.703TyrGly: 2.703 ± 0.816
0.901TyrHis: 0.901 ± 0.706
3.604TyrIle: 3.604 ± 0.976
0.901TyrLys: 0.901 ± 0.706
6.306TyrLeu: 6.306 ± 1.069
3.604TyrMet: 3.604 ± 1.079
4.505TyrAsn: 4.505 ± 1.644
0.901TyrPro: 0.901 ± 0.646
0.0TyrGln: 0.0 ± 0.0
2.703TyrArg: 2.703 ± 2.118
2.703TyrSer: 2.703 ± 1.388
0.0TyrThr: 0.0 ± 0.0
1.802TyrVal: 1.802 ± 0.89
0.0TyrTrp: 0.0 ± 0.0
0.0TyrTyr: 0.0 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (1111 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski