Amino acid dipepetide frequency for Beihai sphaeromadae virus 3

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.279AlaAla: 9.279 ± 4.539
0.0AlaCys: 0.0 ± 0.0
4.283AlaAsp: 4.283 ± 3.158
2.141AlaGlu: 2.141 ± 0.197
4.283AlaPhe: 4.283 ± 0.989
7.138AlaGly: 7.138 ± 4.342
1.428AlaHis: 1.428 ± 0.592
3.569AlaIle: 3.569 ± 0.789
4.283AlaLys: 4.283 ± 2.371
3.569AlaLeu: 3.569 ± 0.593
1.428AlaMet: 1.428 ± 0.626
5.71AlaAsn: 5.71 ± 0.986
4.283AlaPro: 4.283 ± 1.776
0.714AlaGln: 0.714 ± 0.395
4.996AlaArg: 4.996 ± 1.384
8.565AlaSer: 8.565 ± 0.787
7.852AlaThr: 7.852 ± 0.2
3.569AlaVal: 3.569 ± 0.593
1.428AlaTrp: 1.428 ± 0.592
2.855AlaTyr: 2.855 ± 1.58
0.0AlaXaa: 0.0 ± 0.0
Cys
1.428CysAla: 1.428 ± 0.79
0.0CysCys: 0.0 ± 0.0
0.714CysAsp: 0.714 ± 0.395
0.0CysGlu: 0.0 ± 0.0
0.0CysPhe: 0.0 ± 0.0
0.714CysGly: 0.714 ± 0.395
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
0.714CysLys: 0.714 ± 0.395
1.428CysLeu: 1.428 ± 0.79
0.0CysMet: 0.0 ± 0.0
0.714CysAsn: 0.714 ± 0.395
0.714CysPro: 0.714 ± 0.987
0.0CysGln: 0.0 ± 0.0
0.714CysArg: 0.714 ± 0.395
0.0CysSer: 0.0 ± 0.0
2.855CysThr: 2.855 ± 1.58
0.714CysVal: 0.714 ± 0.395
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
3.569AspAla: 3.569 ± 0.593
1.428AspCys: 1.428 ± 0.79
2.141AspAsp: 2.141 ± 0.197
0.714AspGlu: 0.714 ± 0.395
2.141AspPhe: 2.141 ± 0.197
10.707AspGly: 10.707 ± 0.984
1.428AspHis: 1.428 ± 0.79
3.569AspIle: 3.569 ± 0.593
1.428AspLys: 1.428 ± 0.592
6.424AspLeu: 6.424 ± 2.174
0.714AspMet: 0.714 ± 0.395
2.855AspAsn: 2.855 ± 1.184
4.996AspPro: 4.996 ± 0.001
2.141AspGln: 2.141 ± 1.185
2.141AspArg: 2.141 ± 0.197
2.141AspSer: 2.141 ± 0.197
2.855AspThr: 2.855 ± 0.198
3.569AspVal: 3.569 ± 0.593
0.0AspTrp: 0.0 ± 0.0
2.855AspTyr: 2.855 ± 0.198
0.0AspXaa: 0.0 ± 0.0
Glu
3.569GluAla: 3.569 ± 0.789
0.0GluCys: 0.0 ± 0.0
2.141GluAsp: 2.141 ± 0.197
3.569GluGlu: 3.569 ± 0.789
1.428GluPhe: 1.428 ± 0.592
1.428GluGly: 1.428 ± 0.79
0.0GluHis: 0.0 ± 0.0
0.714GluIle: 0.714 ± 0.395
1.428GluLys: 1.428 ± 0.79
6.424GluLeu: 6.424 ± 0.792
0.0GluMet: 0.0 ± 0.0
2.855GluAsn: 2.855 ± 2.566
1.428GluPro: 1.428 ± 0.592
3.569GluGln: 3.569 ± 1.976
1.428GluArg: 1.428 ± 0.79
6.424GluSer: 6.424 ± 2.174
2.855GluThr: 2.855 ± 1.184
6.424GluVal: 6.424 ± 0.59
0.714GluTrp: 0.714 ± 0.395
2.141GluTyr: 2.141 ± 1.185
0.0GluXaa: 0.0 ± 0.0
Phe
1.428PheAla: 1.428 ± 0.79
0.0PheCys: 0.0 ± 0.0
2.141PheAsp: 2.141 ± 1.185
3.569PheGlu: 3.569 ± 0.789
1.428PhePhe: 1.428 ± 0.79
0.714PheGly: 0.714 ± 0.395
1.428PheHis: 1.428 ± 0.79
1.428PheIle: 1.428 ± 0.592
0.714PheLys: 0.714 ± 0.395
2.855PheLeu: 2.855 ± 0.198
0.714PheMet: 0.714 ± 0.247
2.141PheAsn: 2.141 ± 1.579
1.428PhePro: 1.428 ± 0.592
2.141PheGln: 2.141 ± 0.197
0.0PheArg: 0.0 ± 0.0
2.141PheSer: 2.141 ± 1.579
2.855PheThr: 2.855 ± 1.184
4.283PheVal: 4.283 ± 1.776
1.428PheTrp: 1.428 ± 0.79
0.0PheTyr: 0.0 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
3.569GlyAla: 3.569 ± 3.553
0.0GlyCys: 0.0 ± 0.0
2.141GlyAsp: 2.141 ± 1.185
4.283GlyGlu: 4.283 ± 1.776
3.569GlyPhe: 3.569 ± 0.593
4.996GlyGly: 4.996 ± 0.001
3.569GlyHis: 3.569 ± 0.593
4.283GlyIle: 4.283 ± 2.371
2.855GlyLys: 2.855 ± 0.198
2.141GlyLeu: 2.141 ± 1.185
0.714GlyMet: 0.714 ± 0.395
2.141GlyAsn: 2.141 ± 0.197
5.71GlyPro: 5.71 ± 0.397
4.283GlyGln: 4.283 ± 0.989
6.424GlyArg: 6.424 ± 1.973
4.283GlySer: 4.283 ± 3.158
3.569GlyThr: 3.569 ± 0.593
3.569GlyVal: 3.569 ± 0.789
1.428GlyTrp: 1.428 ± 0.79
0.714GlyTyr: 0.714 ± 0.987
0.0GlyXaa: 0.0 ± 0.0
His
1.428HisAla: 1.428 ± 0.592
0.0HisCys: 0.0 ± 0.0
1.428HisAsp: 1.428 ± 0.79
0.0HisGlu: 0.0 ± 0.0
1.428HisPhe: 1.428 ± 0.79
0.714HisGly: 0.714 ± 0.395
0.0HisHis: 0.0 ± 0.0
2.141HisIle: 2.141 ± 0.197
2.141HisLys: 2.141 ± 1.185
3.569HisLeu: 3.569 ± 1.976
0.0HisMet: 0.0 ± 0.0
0.0HisAsn: 0.0 ± 0.0
3.569HisPro: 3.569 ± 0.593
1.428HisGln: 1.428 ± 0.79
3.569HisArg: 3.569 ± 0.593
2.855HisSer: 2.855 ± 1.58
0.714HisThr: 0.714 ± 0.395
0.714HisVal: 0.714 ± 0.395
0.0HisTrp: 0.0 ± 0.0
1.428HisTyr: 1.428 ± 0.79
0.0HisXaa: 0.0 ± 0.0
Ile
2.141IleAla: 2.141 ± 1.185
2.141IleCys: 2.141 ± 1.185
1.428IleAsp: 1.428 ± 0.79
2.855IleGlu: 2.855 ± 0.198
0.0IlePhe: 0.0 ± 0.0
5.71IleGly: 5.71 ± 0.397
2.855IleHis: 2.855 ± 1.58
5.71IleIle: 5.71 ± 0.397
1.428IleLys: 1.428 ± 0.592
0.714IleLeu: 0.714 ± 0.395
0.0IleMet: 0.0 ± 0.0
3.569IleAsn: 3.569 ± 0.593
0.714IlePro: 0.714 ± 0.395
1.428IleGln: 1.428 ± 0.592
1.428IleArg: 1.428 ± 0.592
8.565IleSer: 8.565 ± 1.977
5.71IleThr: 5.71 ± 2.368
5.71IleVal: 5.71 ± 0.986
0.0IleTrp: 0.0 ± 0.0
2.141IleTyr: 2.141 ± 1.579
0.0IleXaa: 0.0 ± 0.0
Lys
5.71LysAla: 5.71 ± 1.779
0.714LysCys: 0.714 ± 0.395
4.996LysAsp: 4.996 ± 1.384
4.283LysGlu: 4.283 ± 0.989
0.0LysPhe: 0.0 ± 0.0
2.141LysGly: 2.141 ± 0.197
2.141LysHis: 2.141 ± 1.185
4.283LysIle: 4.283 ± 2.371
2.141LysLys: 2.141 ± 1.185
2.855LysLeu: 2.855 ± 1.58
1.428LysMet: 1.428 ± 0.79
0.714LysAsn: 0.714 ± 0.395
2.141LysPro: 2.141 ± 0.197
0.714LysGln: 0.714 ± 0.987
2.141LysArg: 2.141 ± 0.197
1.428LysSer: 1.428 ± 0.592
4.996LysThr: 4.996 ± 1.384
0.714LysVal: 0.714 ± 0.395
0.714LysTrp: 0.714 ± 0.987
0.714LysTyr: 0.714 ± 0.395
0.0LysXaa: 0.0 ± 0.0
Leu
7.852LeuAla: 7.852 ± 2.565
0.0LeuCys: 0.0 ± 0.0
2.855LeuAsp: 2.855 ± 1.58
2.855LeuGlu: 2.855 ± 1.184
0.714LeuPhe: 0.714 ± 0.395
3.569LeuGly: 3.569 ± 1.976
1.428LeuHis: 1.428 ± 0.79
1.428LeuIle: 1.428 ± 0.592
6.424LeuLys: 6.424 ± 2.174
7.852LeuLeu: 7.852 ± 2.565
0.714LeuMet: 0.714 ± 0.395
1.428LeuAsn: 1.428 ± 1.974
4.996LeuPro: 4.996 ± 1.381
2.855LeuGln: 2.855 ± 1.184
7.138LeuArg: 7.138 ± 1.187
7.138LeuSer: 7.138 ± 0.195
7.138LeuThr: 7.138 ± 0.195
4.996LeuVal: 4.996 ± 1.384
0.0LeuTrp: 0.0 ± 0.0
0.714LeuTyr: 0.714 ± 0.395
0.0LeuXaa: 0.0 ± 0.0
Met
0.714MetAla: 0.714 ± 0.987
0.0MetCys: 0.0 ± 0.0
0.714MetAsp: 0.714 ± 0.395
1.428MetGlu: 1.428 ± 0.79
0.0MetPhe: 0.0 ± 0.0
0.0MetGly: 0.0 ± 0.0
1.428MetHis: 1.428 ± 0.79
1.428MetIle: 1.428 ± 0.79
0.0MetLys: 0.0 ± 0.0
0.714MetLeu: 0.714 ± 0.987
0.714MetMet: 0.714 ± 0.987
2.141MetAsn: 2.141 ± 0.197
0.714MetPro: 0.714 ± 0.395
0.0MetGln: 0.0 ± 0.0
2.141MetArg: 2.141 ± 0.197
1.428MetSer: 1.428 ± 0.592
0.714MetThr: 0.714 ± 0.395
0.0MetVal: 0.0 ± 0.0
0.0MetTrp: 0.0 ± 0.0
0.714MetTyr: 0.714 ± 0.987
0.0MetXaa: 0.0 ± 0.0
Asn
5.71AsnAla: 5.71 ± 2.368
2.141AsnCys: 2.141 ± 1.185
6.424AsnAsp: 6.424 ± 0.792
2.855AsnGlu: 2.855 ± 1.184
0.0AsnPhe: 0.0 ± 0.0
4.996AsnGly: 4.996 ± 0.001
0.714AsnHis: 0.714 ± 0.395
2.855AsnIle: 2.855 ± 0.198
1.428AsnLys: 1.428 ± 0.79
2.141AsnLeu: 2.141 ± 1.185
1.428AsnMet: 1.428 ± 0.592
3.569AsnAsn: 3.569 ± 2.171
4.283AsnPro: 4.283 ± 4.54
3.569AsnGln: 3.569 ± 2.171
4.283AsnArg: 4.283 ± 0.394
4.996AsnSer: 4.996 ± 0.001
0.714AsnThr: 0.714 ± 0.395
2.855AsnVal: 2.855 ± 1.184
0.714AsnTrp: 0.714 ± 0.395
2.141AsnTyr: 2.141 ± 0.197
0.0AsnXaa: 0.0 ± 0.0
Pro
2.141ProAla: 2.141 ± 1.185
2.141ProCys: 2.141 ± 0.197
1.428ProAsp: 1.428 ± 0.592
2.141ProGlu: 2.141 ± 1.579
2.855ProPhe: 2.855 ± 0.198
3.569ProGly: 3.569 ± 0.593
1.428ProHis: 1.428 ± 0.592
2.855ProIle: 2.855 ± 1.184
2.141ProLys: 2.141 ± 1.579
4.283ProLeu: 4.283 ± 0.989
0.0ProMet: 0.0 ± 0.0
4.283ProAsn: 4.283 ± 0.394
3.569ProPro: 3.569 ± 4.935
2.855ProGln: 2.855 ± 0.198
5.71ProArg: 5.71 ± 0.986
5.71ProSer: 5.71 ± 2.368
4.996ProThr: 4.996 ± 1.381
4.996ProVal: 4.996 ± 0.001
0.714ProTrp: 0.714 ± 0.395
2.141ProTyr: 2.141 ± 0.197
0.0ProXaa: 0.0 ± 0.0
Gln
2.855GlnAla: 2.855 ± 1.184
0.0GlnCys: 0.0 ± 0.0
2.855GlnAsp: 2.855 ± 1.58
2.141GlnGlu: 2.141 ± 0.197
0.714GlnPhe: 0.714 ± 0.395
0.714GlnGly: 0.714 ± 0.395
0.0GlnHis: 0.0 ± 0.0
0.0GlnIle: 0.0 ± 0.0
3.569GlnLys: 3.569 ± 1.976
4.283GlnLeu: 4.283 ± 0.394
0.714GlnMet: 0.714 ± 0.987
3.569GlnAsn: 3.569 ± 0.593
4.283GlnPro: 4.283 ± 0.989
2.855GlnGln: 2.855 ± 1.58
5.71GlnArg: 5.71 ± 0.986
2.855GlnSer: 2.855 ± 1.58
3.569GlnThr: 3.569 ± 0.789
1.428GlnVal: 1.428 ± 0.79
0.0GlnTrp: 0.0 ± 0.0
0.0GlnTyr: 0.0 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
4.996ArgAla: 4.996 ± 0.001
0.714ArgCys: 0.714 ± 0.395
4.283ArgAsp: 4.283 ± 2.371
2.141ArgGlu: 2.141 ± 1.185
4.996ArgPhe: 4.996 ± 1.381
2.141ArgGly: 2.141 ± 1.579
4.283ArgHis: 4.283 ± 2.371
4.283ArgIle: 4.283 ± 0.989
2.855ArgLys: 2.855 ± 1.58
5.71ArgLeu: 5.71 ± 2.368
1.428ArgMet: 1.428 ± 0.592
7.138ArgAsn: 7.138 ± 2.569
4.283ArgPro: 4.283 ± 1.776
2.141ArgGln: 2.141 ± 0.197
5.71ArgArg: 5.71 ± 3.75
3.569ArgSer: 3.569 ± 0.789
4.283ArgThr: 4.283 ± 0.989
2.855ArgVal: 2.855 ± 0.198
0.714ArgTrp: 0.714 ± 0.395
1.428ArgTyr: 1.428 ± 1.974
0.0ArgXaa: 0.0 ± 0.0
Ser
5.71SerAla: 5.71 ± 0.986
0.0SerCys: 0.0 ± 0.0
7.852SerAsp: 7.852 ± 2.565
3.569SerGlu: 3.569 ± 1.976
2.855SerPhe: 2.855 ± 2.566
5.71SerGly: 5.71 ± 0.397
2.141SerHis: 2.141 ± 1.185
6.424SerIle: 6.424 ± 3.355
4.283SerLys: 4.283 ± 0.394
7.852SerLeu: 7.852 ± 1.182
0.714SerMet: 0.714 ± 0.987
4.283SerAsn: 4.283 ± 0.989
3.569SerPro: 3.569 ± 0.593
0.714SerGln: 0.714 ± 0.395
5.71SerArg: 5.71 ± 0.986
5.71SerSer: 5.71 ± 3.75
3.569SerThr: 3.569 ± 1.976
7.138SerVal: 7.138 ± 2.569
2.855SerTrp: 2.855 ± 1.184
2.855SerTyr: 2.855 ± 1.58
0.0SerXaa: 0.0 ± 0.0
Thr
7.138ThrAla: 7.138 ± 2.569
0.714ThrCys: 0.714 ± 0.395
5.71ThrAsp: 5.71 ± 0.397
4.996ThrGlu: 4.996 ± 1.384
4.283ThrPhe: 4.283 ± 1.776
2.855ThrGly: 2.855 ± 2.566
0.714ThrHis: 0.714 ± 0.987
4.283ThrIle: 4.283 ± 0.394
1.428ThrLys: 1.428 ± 0.79
4.283ThrLeu: 4.283 ± 1.776
1.428ThrMet: 1.428 ± 0.592
2.141ThrAsn: 2.141 ± 0.197
4.996ThrPro: 4.996 ± 0.001
3.569ThrGln: 3.569 ± 0.593
3.569ThrArg: 3.569 ± 1.976
4.996ThrSer: 4.996 ± 2.763
2.855ThrThr: 2.855 ± 1.58
6.424ThrVal: 6.424 ± 2.174
0.714ThrTrp: 0.714 ± 0.395
2.141ThrTyr: 2.141 ± 1.185
0.0ThrXaa: 0.0 ± 0.0
Val
7.138ValAla: 7.138 ± 2.96
0.0ValCys: 0.0 ± 0.0
2.141ValAsp: 2.141 ± 0.197
4.996ValGlu: 4.996 ± 2.766
1.428ValPhe: 1.428 ± 0.79
4.996ValGly: 4.996 ± 0.001
1.428ValHis: 1.428 ± 0.79
2.855ValIle: 2.855 ± 1.58
2.855ValLys: 2.855 ± 0.198
2.141ValLeu: 2.141 ± 2.961
1.428ValMet: 1.428 ± 0.79
4.283ValAsn: 4.283 ± 3.158
3.569ValPro: 3.569 ± 0.593
2.855ValGln: 2.855 ± 1.58
5.71ValArg: 5.71 ± 1.779
8.565ValSer: 8.565 ± 1.977
5.71ValThr: 5.71 ± 0.397
4.283ValVal: 4.283 ± 1.776
0.714ValTrp: 0.714 ± 0.395
0.714ValTyr: 0.714 ± 0.395
0.0ValXaa: 0.0 ± 0.0
Trp
1.428TrpAla: 1.428 ± 0.79
0.714TrpCys: 0.714 ± 0.395
0.714TrpAsp: 0.714 ± 0.987
0.0TrpGlu: 0.0 ± 0.0
0.0TrpPhe: 0.0 ± 0.0
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.714TrpIle: 0.714 ± 0.395
0.714TrpLys: 0.714 ± 0.987
0.714TrpLeu: 0.714 ± 0.395
0.0TrpMet: 0.0 ± 0.0
0.714TrpAsn: 0.714 ± 0.395
0.714TrpPro: 0.714 ± 0.395
1.428TrpGln: 1.428 ± 0.592
0.0TrpArg: 0.0 ± 0.0
0.714TrpSer: 0.714 ± 0.395
0.0TrpThr: 0.0 ± 0.0
2.141TrpVal: 2.141 ± 0.197
0.714TrpTrp: 0.714 ± 0.987
1.428TrpTyr: 1.428 ± 0.79
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.569TyrAla: 3.569 ± 1.976
0.0TyrCys: 0.0 ± 0.0
1.428TyrAsp: 1.428 ± 0.592
0.0TyrGlu: 0.0 ± 0.0
1.428TyrPhe: 1.428 ± 1.974
1.428TyrGly: 1.428 ± 0.79
0.714TyrHis: 0.714 ± 0.395
1.428TyrIle: 1.428 ± 0.592
2.141TyrLys: 2.141 ± 1.185
2.141TyrLeu: 2.141 ± 1.579
0.714TyrMet: 0.714 ± 0.395
3.569TyrAsn: 3.569 ± 0.789
0.0TyrPro: 0.0 ± 0.0
2.855TyrGln: 2.855 ± 1.58
1.428TyrArg: 1.428 ± 0.79
1.428TyrSer: 1.428 ± 0.79
1.428TyrThr: 1.428 ± 0.592
1.428TyrVal: 1.428 ± 0.79
0.0TyrTrp: 0.0 ± 0.0
0.714TyrTyr: 0.714 ± 0.395
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (1402 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski