Amino acid dipepetide frequency for Sanxia water strider virus 19

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.506AlaAla: 7.506 ± 2.785
0.0AlaCys: 0.0 ± 0.0
1.732AlaAsp: 1.732 ± 1.314
5.196AlaGlu: 5.196 ± 0.691
2.309AlaPhe: 2.309 ± 0.455
5.196AlaGly: 5.196 ± 2.436
2.887AlaHis: 2.887 ± 0.632
3.464AlaIle: 3.464 ± 0.868
5.774AlaLys: 5.774 ± 1.108
6.928AlaLeu: 6.928 ± 2.546
0.577AlaMet: 0.577 ± 0.284
0.577AlaAsn: 0.577 ± 0.284
5.774AlaPro: 5.774 ± 1.108
3.464AlaGln: 3.464 ± 0.868
2.887AlaArg: 2.887 ± 0.632
6.928AlaSer: 6.928 ± 2.237
5.196AlaThr: 5.196 ± 1.256
2.309AlaVal: 2.309 ± 1.137
2.309AlaTrp: 2.309 ± 0.455
4.042AlaTyr: 4.042 ± 1.127
0.0AlaXaa: 0.0 ± 0.0
Cys
0.577CysAla: 0.577 ± 0.284
0.0CysCys: 0.0 ± 0.0
0.577CysAsp: 0.577 ± 0.284
1.155CysGlu: 1.155 ± 0.568
0.577CysPhe: 0.577 ± 0.773
0.0CysGly: 0.0 ± 0.0
0.577CysHis: 0.577 ± 0.284
0.0CysIle: 0.0 ± 0.0
1.155CysLys: 1.155 ± 0.568
1.155CysLeu: 1.155 ± 0.568
0.577CysMet: 0.577 ± 0.284
0.0CysAsn: 0.0 ± 0.0
2.887CysPro: 2.887 ± 1.421
0.0CysGln: 0.0 ± 0.0
2.309CysArg: 2.309 ± 0.455
0.577CysSer: 0.577 ± 1.566
0.0CysThr: 0.0 ± 0.0
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
4.619AspAla: 4.619 ± 0.909
0.0AspCys: 0.0 ± 0.0
2.309AspAsp: 2.309 ± 1.106
1.155AspGlu: 1.155 ± 0.568
3.464AspPhe: 3.464 ± 0.838
4.619AspGly: 4.619 ± 0.869
1.732AspHis: 1.732 ± 0.419
1.732AspIle: 1.732 ± 2.975
3.464AspLys: 3.464 ± 1.448
7.506AspLeu: 7.506 ± 1.817
0.577AspMet: 0.577 ± 0.284
1.732AspAsn: 1.732 ± 0.419
4.042AspPro: 4.042 ± 1.181
2.887AspGln: 2.887 ± 1.122
2.309AspArg: 2.309 ± 1.27
1.732AspSer: 1.732 ± 0.419
1.732AspThr: 1.732 ± 0.419
4.042AspVal: 4.042 ± 0.827
2.887AspTrp: 2.887 ± 0.632
0.577AspTyr: 0.577 ± 0.773
0.0AspXaa: 0.0 ± 0.0
Glu
3.464GluAla: 3.464 ± 1.705
0.0GluCys: 0.0 ± 0.0
4.619GluAsp: 4.619 ± 0.869
4.619GluGlu: 4.619 ± 1.66
2.309GluPhe: 2.309 ± 1.137
4.619GluGly: 4.619 ± 3.228
1.732GluHis: 1.732 ± 0.853
2.309GluIle: 2.309 ± 1.137
4.042GluLys: 4.042 ± 0.827
6.351GluLeu: 6.351 ± 0.452
4.042GluMet: 4.042 ± 1.127
1.732GluAsn: 1.732 ± 0.853
3.464GluPro: 3.464 ± 0.964
2.887GluGln: 2.887 ± 0.632
3.464GluArg: 3.464 ± 1.705
2.309GluSer: 2.309 ± 2.083
3.464GluThr: 3.464 ± 1.705
5.774GluVal: 5.774 ± 1.229
0.577GluTrp: 0.577 ± 0.284
1.155GluTyr: 1.155 ± 1.418
0.0GluXaa: 0.0 ± 0.0
Phe
0.0PheAla: 0.0 ± 0.0
0.0PheCys: 0.0 ± 0.0
1.155PheAsp: 1.155 ± 0.568
0.577PheGlu: 0.577 ± 0.284
0.577PhePhe: 0.577 ± 0.284
2.309PheGly: 2.309 ± 0.455
1.155PheHis: 1.155 ± 1.418
0.577PheIle: 0.577 ± 0.773
2.309PheLys: 2.309 ± 0.455
3.464PheLeu: 3.464 ± 0.964
0.0PheMet: 0.0 ± 0.0
0.0PheAsn: 0.0 ± 0.0
2.887PhePro: 2.887 ± 0.939
3.464PheGln: 3.464 ± 0.838
1.732PheArg: 1.732 ± 0.419
4.619PheSer: 4.619 ± 0.869
1.732PheThr: 1.732 ± 0.853
2.887PheVal: 2.887 ± 0.939
0.577PheTrp: 0.577 ± 0.284
0.0PheTyr: 0.0 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
5.774GlyAla: 5.774 ± 3.99
0.577GlyCys: 0.577 ± 0.284
4.619GlyAsp: 4.619 ± 1.749
4.042GlyGlu: 4.042 ± 2.033
2.887GlyPhe: 2.887 ± 1.122
5.196GlyGly: 5.196 ± 2.436
0.577GlyHis: 0.577 ± 1.566
1.732GlyIle: 1.732 ± 1.314
6.351GlyLys: 6.351 ± 2.304
4.619GlyLeu: 4.619 ± 1.395
1.155GlyMet: 1.155 ± 0.553
0.577GlyAsn: 0.577 ± 0.284
5.774GlyPro: 5.774 ± 1.264
1.732GlyGln: 1.732 ± 0.853
2.309GlyArg: 2.309 ± 3.066
5.774GlySer: 5.774 ± 1.18
5.774GlyThr: 5.774 ± 1.878
4.042GlyVal: 4.042 ± 1.127
0.577GlyTrp: 0.577 ± 0.773
1.155GlyTyr: 1.155 ± 0.553
0.0GlyXaa: 0.0 ± 0.0
His
2.887HisAla: 2.887 ± 1.421
0.577HisCys: 0.577 ± 0.773
0.577HisAsp: 0.577 ± 0.284
2.309HisGlu: 2.309 ± 1.137
1.155HisPhe: 1.155 ± 0.553
2.309HisGly: 2.309 ± 0.455
0.0HisHis: 0.0 ± 0.0
0.577HisIle: 0.577 ± 0.773
0.0HisLys: 0.0 ± 0.0
2.309HisLeu: 2.309 ± 1.27
0.0HisMet: 0.0 ± 0.0
0.577HisAsn: 0.577 ± 0.284
1.155HisPro: 1.155 ± 0.568
0.577HisGln: 0.577 ± 0.284
2.309HisArg: 2.309 ± 3.066
1.155HisSer: 1.155 ± 0.568
1.155HisThr: 1.155 ± 1.546
0.577HisVal: 0.577 ± 0.284
1.155HisTrp: 1.155 ± 0.568
1.732HisTyr: 1.732 ± 0.853
0.0HisXaa: 0.0 ± 0.0
Ile
1.155IleAla: 1.155 ± 1.418
0.577IleCys: 0.577 ± 0.284
1.155IleAsp: 1.155 ± 0.568
2.309IleGlu: 2.309 ± 2.083
1.155IlePhe: 1.155 ± 1.793
2.309IleGly: 2.309 ± 1.106
0.0IleHis: 0.0 ± 0.0
1.155IleIle: 1.155 ± 0.553
1.155IleLys: 1.155 ± 0.568
1.155IleLeu: 1.155 ± 0.568
2.309IleMet: 2.309 ± 0.744
0.577IleAsn: 0.577 ± 0.773
4.619IlePro: 4.619 ± 0.909
0.577IleGln: 0.577 ± 0.284
3.464IleArg: 3.464 ± 0.838
1.155IleSer: 1.155 ± 0.568
2.309IleThr: 2.309 ± 1.137
4.619IleVal: 4.619 ± 3.71
0.0IleTrp: 0.0 ± 0.0
1.155IleTyr: 1.155 ± 0.568
0.0IleXaa: 0.0 ± 0.0
Lys
4.042LysAla: 4.042 ± 1.127
1.732LysCys: 1.732 ± 0.853
4.042LysAsp: 4.042 ± 0.873
4.042LysGlu: 4.042 ± 1.127
1.732LysPhe: 1.732 ± 1.316
6.351LysGly: 6.351 ± 1.492
1.155LysHis: 1.155 ± 0.553
1.732LysIle: 1.732 ± 0.419
6.351LysLys: 6.351 ± 3.126
5.774LysLeu: 5.774 ± 0.51
0.577LysMet: 0.577 ± 0.606
1.732LysAsn: 1.732 ± 1.314
6.928LysPro: 6.928 ± 1.736
2.309LysGln: 2.309 ± 1.27
5.196LysArg: 5.196 ± 0.691
1.155LysSer: 1.155 ± 0.568
5.196LysThr: 5.196 ± 2.558
4.042LysVal: 4.042 ± 1.989
2.309LysTrp: 2.309 ± 0.455
0.577LysTyr: 0.577 ± 0.773
0.0LysXaa: 0.0 ± 0.0
Leu
9.815LeuAla: 9.815 ± 1.803
2.309LeuCys: 2.309 ± 1.137
7.506LeuAsp: 7.506 ± 3.807
6.351LeuGlu: 6.351 ± 0.896
2.887LeuPhe: 2.887 ± 1.421
2.887LeuGly: 2.887 ± 1.421
2.309LeuHis: 2.309 ± 1.137
1.732LeuIle: 1.732 ± 0.853
2.887LeuLys: 2.887 ± 1.287
13.857LeuLeu: 13.857 ± 3.93
1.155LeuMet: 1.155 ± 0.568
2.309LeuAsn: 2.309 ± 1.137
6.928LeuPro: 6.928 ± 2.129
4.042LeuGln: 4.042 ± 2.867
4.619LeuArg: 4.619 ± 1.749
11.547LeuSer: 11.547 ± 0.812
4.619LeuThr: 4.619 ± 0.869
2.309LeuVal: 2.309 ± 0.455
2.309LeuTrp: 2.309 ± 1.106
2.887LeuTyr: 2.887 ± 1.861
0.0LeuXaa: 0.0 ± 0.0
Met
4.042MetAla: 4.042 ± 1.483
0.577MetCys: 0.577 ± 0.284
2.887MetAsp: 2.887 ± 0.632
1.732MetGlu: 1.732 ± 0.419
1.155MetPhe: 1.155 ± 0.568
1.732MetGly: 1.732 ± 2.975
1.155MetHis: 1.155 ± 0.553
0.577MetIle: 0.577 ± 0.284
0.0MetLys: 0.0 ± 0.0
0.577MetLeu: 0.577 ± 0.284
0.577MetMet: 0.577 ± 0.284
0.577MetAsn: 0.577 ± 0.284
1.155MetPro: 1.155 ± 0.553
0.577MetGln: 0.577 ± 0.284
1.155MetArg: 1.155 ± 0.568
1.732MetSer: 1.732 ± 0.853
1.732MetThr: 1.732 ± 0.853
1.155MetVal: 1.155 ± 0.553
0.0MetTrp: 0.0 ± 0.0
0.577MetTyr: 0.577 ± 0.284
0.0MetXaa: 0.0 ± 0.0
Asn
2.309AsnAla: 2.309 ± 0.455
0.0AsnCys: 0.0 ± 0.0
1.155AsnAsp: 1.155 ± 0.553
1.155AsnGlu: 1.155 ± 0.568
0.0AsnPhe: 0.0 ± 0.0
0.577AsnGly: 0.577 ± 0.773
0.577AsnHis: 0.577 ± 0.284
0.0AsnIle: 0.0 ± 0.0
1.732AsnLys: 1.732 ± 0.853
2.309AsnLeu: 2.309 ± 0.455
0.0AsnMet: 0.0 ± 0.0
0.0AsnAsn: 0.0 ± 0.0
5.196AsnPro: 5.196 ± 2.963
1.155AsnGln: 1.155 ± 0.553
1.155AsnArg: 1.155 ± 0.568
1.732AsnSer: 1.732 ± 0.853
1.732AsnThr: 1.732 ± 1.314
1.155AsnVal: 1.155 ± 0.553
0.577AsnTrp: 0.577 ± 0.284
0.0AsnTyr: 0.0 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
4.619ProAla: 4.619 ± 1.395
0.577ProCys: 0.577 ± 0.284
4.042ProAsp: 4.042 ± 0.827
4.042ProGlu: 4.042 ± 1.989
1.155ProPhe: 1.155 ± 0.568
6.351ProGly: 6.351 ± 5.922
3.464ProHis: 3.464 ± 1.448
3.464ProIle: 3.464 ± 1.365
6.351ProLys: 6.351 ± 2.224
8.083ProLeu: 8.083 ± 1.745
1.732ProMet: 1.732 ± 0.419
1.732ProAsn: 1.732 ± 1.314
4.619ProPro: 4.619 ± 0.869
3.464ProGln: 3.464 ± 1.448
6.928ProArg: 6.928 ± 3.725
8.083ProSer: 8.083 ± 2.253
5.196ProThr: 5.196 ± 1.064
6.351ProVal: 6.351 ± 0.896
0.0ProTrp: 0.0 ± 0.0
1.732ProTyr: 1.732 ± 0.853
0.0ProXaa: 0.0 ± 0.0
Gln
4.619GlnAla: 4.619 ± 0.909
0.0GlnCys: 0.0 ± 0.0
2.309GlnAsp: 2.309 ± 1.322
1.155GlnGlu: 1.155 ± 0.568
0.0GlnPhe: 0.0 ± 0.0
4.042GlnGly: 4.042 ± 1.127
0.0GlnHis: 0.0 ± 0.0
3.464GlnIle: 3.464 ± 0.964
2.887GlnLys: 2.887 ± 1.861
4.619GlnLeu: 4.619 ± 0.869
1.732GlnMet: 1.732 ± 0.792
0.577GlnAsn: 0.577 ± 0.773
3.464GlnPro: 3.464 ± 0.964
1.155GlnGln: 1.155 ± 1.546
2.887GlnArg: 2.887 ± 1.287
2.887GlnSer: 2.887 ± 0.939
1.155GlnThr: 1.155 ± 0.553
2.887GlnVal: 2.887 ± 1.287
1.732GlnTrp: 1.732 ± 0.853
1.732GlnTyr: 1.732 ± 0.853
0.0GlnXaa: 0.0 ± 0.0
Arg
2.309ArgAla: 2.309 ± 0.455
1.155ArgCys: 1.155 ± 1.418
2.309ArgAsp: 2.309 ± 0.455
5.774ArgGlu: 5.774 ± 2.575
2.309ArgPhe: 2.309 ± 0.455
3.464ArgGly: 3.464 ± 1.659
0.577ArgHis: 0.577 ± 0.284
2.887ArgIle: 2.887 ± 2.855
3.464ArgLys: 3.464 ± 1.365
6.928ArgLeu: 6.928 ± 3.638
3.464ArgMet: 3.464 ± 0.838
3.464ArgAsn: 3.464 ± 3.627
5.774ArgPro: 5.774 ± 3.882
1.732ArgGln: 1.732 ± 0.853
6.351ArgArg: 6.351 ± 4.921
5.196ArgSer: 5.196 ± 1.256
2.887ArgThr: 2.887 ± 1.421
4.042ArgVal: 4.042 ± 1.181
0.577ArgTrp: 0.577 ± 1.566
1.732ArgTyr: 1.732 ± 0.853
0.0ArgXaa: 0.0 ± 0.0
Ser
4.042SerAla: 4.042 ± 0.873
1.155SerCys: 1.155 ± 0.553
4.619SerAsp: 4.619 ± 0.869
2.309SerGlu: 2.309 ± 1.27
1.732SerPhe: 1.732 ± 0.419
2.887SerGly: 2.887 ± 0.632
1.155SerHis: 1.155 ± 0.553
2.887SerIle: 2.887 ± 0.939
7.506SerLys: 7.506 ± 1.509
4.042SerLeu: 4.042 ± 1.127
3.464SerMet: 3.464 ± 0.868
1.155SerAsn: 1.155 ± 0.568
5.196SerPro: 5.196 ± 2.97
5.196SerGln: 5.196 ± 1.064
6.928SerArg: 6.928 ± 1.594
5.196SerSer: 5.196 ± 2.436
6.351SerThr: 6.351 ± 1.756
2.887SerVal: 2.887 ± 0.632
1.155SerTrp: 1.155 ± 0.568
2.887SerTyr: 2.887 ± 0.632
0.0SerXaa: 0.0 ± 0.0
Thr
5.196ThrAla: 5.196 ± 1.669
0.577ThrCys: 0.577 ± 0.284
3.464ThrAsp: 3.464 ± 1.448
2.309ThrGlu: 2.309 ± 0.455
2.309ThrPhe: 2.309 ± 1.106
5.196ThrGly: 5.196 ± 2.032
1.155ThrHis: 1.155 ± 0.568
2.309ThrIle: 2.309 ± 1.106
4.042ThrLys: 4.042 ± 1.127
5.774ThrLeu: 5.774 ± 1.264
0.577ThrMet: 0.577 ± 0.284
1.732ThrAsn: 1.732 ± 0.419
6.351ThrPro: 6.351 ± 1.267
2.309ThrGln: 2.309 ± 1.137
3.464ThrArg: 3.464 ± 0.868
5.196ThrSer: 5.196 ± 1.256
4.042ThrThr: 4.042 ± 0.827
4.042ThrVal: 4.042 ± 1.989
1.155ThrTrp: 1.155 ± 0.553
0.577ThrTyr: 0.577 ± 0.284
0.0ThrXaa: 0.0 ± 0.0
Val
6.928ValAla: 6.928 ± 1.594
1.155ValCys: 1.155 ± 0.568
2.309ValAsp: 2.309 ± 1.106
5.774ValGlu: 5.774 ± 1.946
1.732ValPhe: 1.732 ± 0.853
0.577ValGly: 0.577 ± 0.284
1.732ValHis: 1.732 ± 0.853
1.155ValIle: 1.155 ± 0.553
4.042ValLys: 4.042 ± 1.483
6.351ValLeu: 6.351 ± 1.267
0.0ValMet: 0.0 ± 0.0
2.309ValAsn: 2.309 ± 1.137
4.042ValPro: 4.042 ± 1.181
5.196ValGln: 5.196 ± 0.691
4.042ValArg: 4.042 ± 0.873
3.464ValSer: 3.464 ± 0.838
4.042ValThr: 4.042 ± 1.127
5.774ValVal: 5.774 ± 1.264
1.155ValTrp: 1.155 ± 0.568
2.887ValTyr: 2.887 ± 2.855
0.0ValXaa: 0.0 ± 0.0
Trp
0.577TrpAla: 0.577 ± 0.284
0.577TrpCys: 0.577 ± 0.284
0.0TrpAsp: 0.0 ± 0.0
2.887TrpGlu: 2.887 ± 0.939
0.577TrpPhe: 0.577 ± 0.773
1.732TrpGly: 1.732 ± 0.853
0.577TrpHis: 0.577 ± 0.284
1.155TrpIle: 1.155 ± 0.568
1.732TrpLys: 1.732 ± 0.853
1.155TrpLeu: 1.155 ± 0.553
0.577TrpMet: 0.577 ± 1.566
0.577TrpAsn: 0.577 ± 0.284
0.577TrpPro: 0.577 ± 0.284
0.0TrpGln: 0.0 ± 0.0
0.577TrpArg: 0.577 ± 0.284
0.577TrpSer: 0.577 ± 0.284
2.309TrpThr: 2.309 ± 0.455
2.887TrpVal: 2.887 ± 0.939
0.0TrpTrp: 0.0 ± 0.0
0.577TrpTyr: 0.577 ± 0.284
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.577TyrAla: 0.577 ± 0.284
0.577TyrCys: 0.577 ± 0.284
1.732TyrAsp: 1.732 ± 1.314
4.042TyrGlu: 4.042 ± 1.989
0.0TyrPhe: 0.0 ± 0.0
2.887TyrGly: 2.887 ± 0.632
0.577TyrHis: 0.577 ± 0.284
0.577TyrIle: 0.577 ± 0.773
1.732TyrLys: 1.732 ± 0.419
2.309TyrLeu: 2.309 ± 1.322
0.0TyrMet: 0.0 ± 0.0
0.577TyrAsn: 0.577 ± 0.284
1.155TyrPro: 1.155 ± 0.568
0.577TyrGln: 0.577 ± 0.284
2.309TyrArg: 2.309 ± 0.455
1.732TyrSer: 1.732 ± 0.419
1.155TyrThr: 1.155 ± 1.546
2.887TyrVal: 2.887 ± 0.939
0.577TyrTrp: 0.577 ± 0.284
0.577TyrTyr: 0.577 ± 0.284
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (1733 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski