Amino acid dipepetide frequency for Hubei partiti-like virus 11

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.967AlaAla: 1.967 ± 1.523
1.967AlaCys: 1.967 ± 0.095
1.967AlaAsp: 1.967 ± 0.095
7.866AlaGlu: 7.866 ± 1.809
3.933AlaPhe: 3.933 ± 0.191
0.983AlaGly: 0.983 ± 0.666
0.0AlaHis: 0.0 ± 0.0
2.95AlaIle: 2.95 ± 0.571
3.933AlaLys: 3.933 ± 2.664
0.983AlaLeu: 0.983 ± 0.761
0.983AlaMet: 0.983 ± 0.666
0.983AlaAsn: 0.983 ± 0.761
3.933AlaPro: 3.933 ± 1.618
1.967AlaGln: 1.967 ± 1.523
2.95AlaArg: 2.95 ± 0.571
3.933AlaSer: 3.933 ± 0.191
3.933AlaThr: 3.933 ± 0.191
5.9AlaVal: 5.9 ± 1.141
0.0AlaTrp: 0.0 ± 0.0
3.933AlaTyr: 3.933 ± 0.191
0.0AlaXaa: 0.0 ± 0.0
Cys
0.983CysAla: 0.983 ± 0.666
0.983CysCys: 0.983 ± 0.761
0.0CysAsp: 0.0 ± 0.0
0.983CysGlu: 0.983 ± 0.761
0.983CysPhe: 0.983 ± 0.761
2.95CysGly: 2.95 ± 0.857
0.983CysHis: 0.983 ± 0.666
0.983CysIle: 0.983 ± 0.666
0.0CysLys: 0.0 ± 0.0
0.0CysLeu: 0.0 ± 0.0
0.983CysMet: 0.983 ± 0.666
1.967CysAsn: 1.967 ± 0.095
0.0CysPro: 0.0 ± 0.0
1.967CysGln: 1.967 ± 0.095
0.0CysArg: 0.0 ± 0.0
0.0CysSer: 0.0 ± 0.0
0.983CysThr: 0.983 ± 0.761
0.983CysVal: 0.983 ± 0.666
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
5.9AspAla: 5.9 ± 0.286
0.0AspCys: 0.0 ± 0.0
5.9AspAsp: 5.9 ± 1.141
1.967AspGlu: 1.967 ± 0.095
6.883AspPhe: 6.883 ± 1.048
4.916AspGly: 4.916 ± 0.475
0.0AspHis: 0.0 ± 0.0
2.95AspIle: 2.95 ± 1.998
7.866AspLys: 7.866 ± 0.382
2.95AspLeu: 2.95 ± 1.998
2.95AspMet: 2.95 ± 1.998
1.967AspAsn: 1.967 ± 0.095
1.967AspPro: 1.967 ± 0.095
1.967AspGln: 1.967 ± 1.332
6.883AspArg: 6.883 ± 0.38
0.983AspSer: 0.983 ± 0.666
1.967AspThr: 1.967 ± 1.332
4.916AspVal: 4.916 ± 1.902
1.967AspTrp: 1.967 ± 0.095
2.95AspTyr: 2.95 ± 0.857
0.0AspXaa: 0.0 ± 0.0
Glu
1.967GluAla: 1.967 ± 1.332
0.0GluCys: 0.0 ± 0.0
3.933GluAsp: 3.933 ± 0.191
4.916GluGlu: 4.916 ± 0.475
0.983GluPhe: 0.983 ± 0.761
4.916GluGly: 4.916 ± 2.38
0.983GluHis: 0.983 ± 0.761
2.95GluIle: 2.95 ± 0.857
3.933GluLys: 3.933 ± 1.618
1.967GluLeu: 1.967 ± 0.095
1.967GluMet: 1.967 ± 0.415
3.933GluAsn: 3.933 ± 1.236
1.967GluPro: 1.967 ± 1.523
2.95GluGln: 2.95 ± 0.857
5.9GluArg: 5.9 ± 1.714
3.933GluSer: 3.933 ± 0.191
0.983GluThr: 0.983 ± 0.761
3.933GluVal: 3.933 ± 1.618
0.983GluTrp: 0.983 ± 0.666
1.967GluTyr: 1.967 ± 1.332
0.0GluXaa: 0.0 ± 0.0
Phe
2.95PheAla: 2.95 ± 0.571
0.0PheCys: 0.0 ± 0.0
5.9PheAsp: 5.9 ± 1.714
1.967PheGlu: 1.967 ± 0.095
0.983PhePhe: 0.983 ± 0.666
9.833PheGly: 9.833 ± 1.905
0.0PheHis: 0.0 ± 0.0
1.967PheIle: 1.967 ± 1.332
0.0PheLys: 0.0 ± 0.0
2.95PheLeu: 2.95 ± 1.998
1.967PheMet: 1.967 ± 1.332
3.933PheAsn: 3.933 ± 1.236
3.933PhePro: 3.933 ± 1.618
1.967PheGln: 1.967 ± 1.523
2.95PheArg: 2.95 ± 0.571
1.967PheSer: 1.967 ± 1.523
2.95PheThr: 2.95 ± 0.571
3.933PheVal: 3.933 ± 1.618
0.0PheTrp: 0.0 ± 0.0
0.0PheTyr: 0.0 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
5.9GlyAla: 5.9 ± 0.286
0.0GlyCys: 0.0 ± 0.0
4.916GlyAsp: 4.916 ± 0.475
2.95GlyGlu: 2.95 ± 1.998
1.967GlyPhe: 1.967 ± 1.332
2.95GlyGly: 2.95 ± 1.998
0.983GlyHis: 0.983 ± 0.761
6.883GlyIle: 6.883 ± 3.234
2.95GlyLys: 2.95 ± 0.571
4.916GlyLeu: 4.916 ± 0.952
1.967GlyMet: 1.967 ± 1.332
4.916GlyAsn: 4.916 ± 0.952
0.983GlyPro: 0.983 ± 0.761
2.95GlyGln: 2.95 ± 0.571
1.967GlyArg: 1.967 ± 1.332
6.883GlySer: 6.883 ± 1.048
1.967GlyThr: 1.967 ± 1.332
1.967GlyVal: 1.967 ± 0.095
1.967GlyTrp: 1.967 ± 1.523
3.933GlyTyr: 3.933 ± 1.236
0.0GlyXaa: 0.0 ± 0.0
His
0.0HisAla: 0.0 ± 0.0
0.0HisCys: 0.0 ± 0.0
0.983HisAsp: 0.983 ± 0.761
1.967HisGlu: 1.967 ± 0.095
0.983HisPhe: 0.983 ± 0.761
0.983HisGly: 0.983 ± 0.666
0.0HisHis: 0.0 ± 0.0
0.0HisIle: 0.0 ± 0.0
0.0HisLys: 0.0 ± 0.0
2.95HisLeu: 2.95 ± 2.284
0.0HisMet: 0.0 ± 0.0
0.983HisAsn: 0.983 ± 0.666
2.95HisPro: 2.95 ± 0.571
0.0HisGln: 0.0 ± 0.0
1.967HisArg: 1.967 ± 0.095
0.0HisSer: 0.0 ± 0.0
0.0HisThr: 0.0 ± 0.0
0.983HisVal: 0.983 ± 0.761
0.983HisTrp: 0.983 ± 0.761
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
2.95IleAla: 2.95 ± 1.998
0.983IleCys: 0.983 ± 0.761
5.9IleAsp: 5.9 ± 2.568
3.933IleGlu: 3.933 ± 1.618
3.933IlePhe: 3.933 ± 1.236
3.933IleGly: 3.933 ± 1.236
0.983IleHis: 0.983 ± 0.761
2.95IleIle: 2.95 ± 0.571
0.0IleLys: 0.0 ± 0.0
1.967IleLeu: 1.967 ± 0.095
0.0IleMet: 0.0 ± 0.0
3.933IleAsn: 3.933 ± 1.236
4.916IlePro: 4.916 ± 0.952
2.95IleGln: 2.95 ± 1.998
2.95IleArg: 2.95 ± 0.571
3.933IleSer: 3.933 ± 1.236
2.95IleThr: 2.95 ± 0.857
3.933IleVal: 3.933 ± 2.664
0.983IleTrp: 0.983 ± 0.761
0.983IleTyr: 0.983 ± 0.761
0.0IleXaa: 0.0 ± 0.0
Lys
1.967LysAla: 1.967 ± 0.095
0.983LysCys: 0.983 ± 0.666
0.0LysAsp: 0.0 ± 0.0
4.916LysGlu: 4.916 ± 0.475
1.967LysPhe: 1.967 ± 1.523
2.95LysGly: 2.95 ± 0.571
0.0LysHis: 0.0 ± 0.0
2.95LysIle: 2.95 ± 0.571
3.933LysLys: 3.933 ± 3.046
5.9LysLeu: 5.9 ± 1.141
1.967LysMet: 1.967 ± 1.523
0.983LysAsn: 0.983 ± 0.761
5.9LysPro: 5.9 ± 4.568
0.983LysGln: 0.983 ± 0.666
3.933LysArg: 3.933 ± 0.191
8.85LysSer: 8.85 ± 1.143
4.916LysThr: 4.916 ± 2.38
2.95LysVal: 2.95 ± 0.857
0.983LysTrp: 0.983 ± 0.666
2.95LysTyr: 2.95 ± 0.571
0.0LysXaa: 0.0 ± 0.0
Leu
2.95LeuAla: 2.95 ± 0.571
1.967LeuCys: 1.967 ± 0.095
0.983LeuAsp: 0.983 ± 0.666
3.933LeuGlu: 3.933 ± 1.618
1.967LeuPhe: 1.967 ± 1.523
7.866LeuGly: 7.866 ± 3.9
1.967LeuHis: 1.967 ± 1.332
4.916LeuIle: 4.916 ± 0.475
5.9LeuLys: 5.9 ± 0.286
3.933LeuLeu: 3.933 ± 1.618
2.95LeuMet: 2.95 ± 0.571
3.933LeuAsn: 3.933 ± 1.618
6.883LeuPro: 6.883 ± 1.807
1.967LeuGln: 1.967 ± 0.095
4.916LeuArg: 4.916 ± 0.475
5.9LeuSer: 5.9 ± 1.714
2.95LeuThr: 2.95 ± 0.857
1.967LeuVal: 1.967 ± 0.095
0.983LeuTrp: 0.983 ± 0.761
0.983LeuTyr: 0.983 ± 0.761
0.0LeuXaa: 0.0 ± 0.0
Met
1.967MetAla: 1.967 ± 1.332
0.983MetCys: 0.983 ± 0.666
3.933MetAsp: 3.933 ± 1.236
2.95MetGlu: 2.95 ± 1.998
2.95MetPhe: 2.95 ± 0.857
0.983MetGly: 0.983 ± 0.761
0.0MetHis: 0.0 ± 0.0
2.95MetIle: 2.95 ± 0.571
0.983MetLys: 0.983 ± 0.761
2.95MetLeu: 2.95 ± 0.571
0.983MetMet: 0.983 ± 0.761
2.95MetAsn: 2.95 ± 0.571
0.983MetPro: 0.983 ± 0.761
0.983MetGln: 0.983 ± 0.666
2.95MetArg: 2.95 ± 0.571
4.916MetSer: 4.916 ± 2.38
2.95MetThr: 2.95 ± 1.998
1.967MetVal: 1.967 ± 1.332
1.967MetTrp: 1.967 ± 1.332
0.983MetTyr: 0.983 ± 0.666
0.0MetXaa: 0.0 ± 0.0
Asn
2.95AsnAla: 2.95 ± 0.571
0.0AsnCys: 0.0 ± 0.0
5.9AsnAsp: 5.9 ± 1.141
0.983AsnGlu: 0.983 ± 0.666
3.933AsnPhe: 3.933 ± 0.191
2.95AsnGly: 2.95 ± 1.998
0.0AsnHis: 0.0 ± 0.0
2.95AsnIle: 2.95 ± 1.998
0.983AsnLys: 0.983 ± 0.761
5.9AsnLeu: 5.9 ± 3.141
2.95AsnMet: 2.95 ± 1.998
3.933AsnAsn: 3.933 ± 3.046
1.967AsnPro: 1.967 ± 0.095
0.0AsnGln: 0.0 ± 0.0
4.916AsnArg: 4.916 ± 2.38
1.967AsnSer: 1.967 ± 0.095
4.916AsnThr: 4.916 ± 0.952
2.95AsnVal: 2.95 ± 0.857
0.0AsnTrp: 0.0 ± 0.0
0.0AsnTyr: 0.0 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
4.916ProAla: 4.916 ± 3.807
0.0ProCys: 0.0 ± 0.0
3.933ProAsp: 3.933 ± 1.236
2.95ProGlu: 2.95 ± 0.857
0.983ProPhe: 0.983 ± 0.761
2.95ProGly: 2.95 ± 1.998
0.0ProHis: 0.0 ± 0.0
0.983ProIle: 0.983 ± 0.666
4.916ProLys: 4.916 ± 3.807
5.9ProLeu: 5.9 ± 1.714
0.983ProMet: 0.983 ± 0.761
1.967ProAsn: 1.967 ± 1.523
3.933ProPro: 3.933 ± 1.236
0.983ProGln: 0.983 ± 0.761
0.983ProArg: 0.983 ± 0.666
2.95ProSer: 2.95 ± 0.571
4.916ProThr: 4.916 ± 0.952
4.916ProVal: 4.916 ± 0.475
3.933ProTrp: 3.933 ± 2.664
1.967ProTyr: 1.967 ± 0.095
0.0ProXaa: 0.0 ± 0.0
Gln
2.95GlnAla: 2.95 ± 0.571
0.983GlnCys: 0.983 ± 0.666
0.0GlnAsp: 0.0 ± 0.0
0.0GlnGlu: 0.0 ± 0.0
4.916GlnPhe: 4.916 ± 1.902
0.983GlnGly: 0.983 ± 0.761
0.983GlnHis: 0.983 ± 0.761
0.0GlnIle: 0.0 ± 0.0
1.967GlnLys: 1.967 ± 0.095
0.983GlnLeu: 0.983 ± 0.666
0.983GlnMet: 0.983 ± 0.666
0.983GlnAsn: 0.983 ± 0.761
0.983GlnPro: 0.983 ± 0.666
0.0GlnGln: 0.0 ± 0.0
0.983GlnArg: 0.983 ± 0.666
6.883GlnSer: 6.883 ± 1.807
3.933GlnThr: 3.933 ± 1.618
1.967GlnVal: 1.967 ± 1.523
0.0GlnTrp: 0.0 ± 0.0
0.983GlnTyr: 0.983 ± 0.761
0.0GlnXaa: 0.0 ± 0.0
Arg
2.95ArgAla: 2.95 ± 1.998
0.0ArgCys: 0.0 ± 0.0
5.9ArgAsp: 5.9 ± 0.286
0.983ArgGlu: 0.983 ± 0.666
2.95ArgPhe: 2.95 ± 0.857
4.916ArgGly: 4.916 ± 0.475
1.967ArgHis: 1.967 ± 1.523
4.916ArgIle: 4.916 ± 0.952
2.95ArgLys: 2.95 ± 0.571
1.967ArgLeu: 1.967 ± 1.332
5.9ArgMet: 5.9 ± 1.141
1.967ArgAsn: 1.967 ± 0.095
5.9ArgPro: 5.9 ± 1.141
0.983ArgGln: 0.983 ± 0.761
2.95ArgArg: 2.95 ± 0.571
7.866ArgSer: 7.866 ± 1.809
1.967ArgThr: 1.967 ± 1.332
2.95ArgVal: 2.95 ± 1.998
0.983ArgTrp: 0.983 ± 0.666
5.9ArgTyr: 5.9 ± 2.568
0.0ArgXaa: 0.0 ± 0.0
Ser
3.933SerAla: 3.933 ± 3.046
0.983SerCys: 0.983 ± 0.761
4.916SerAsp: 4.916 ± 0.475
3.933SerGlu: 3.933 ± 1.618
3.933SerPhe: 3.933 ± 0.191
2.95SerGly: 2.95 ± 0.571
4.916SerHis: 4.916 ± 2.38
3.933SerIle: 3.933 ± 1.618
6.883SerLys: 6.883 ± 2.475
4.916SerLeu: 4.916 ± 0.475
6.883SerMet: 6.883 ± 1.048
1.967SerAsn: 1.967 ± 1.332
0.983SerPro: 0.983 ± 0.666
4.916SerGln: 4.916 ± 0.475
6.883SerArg: 6.883 ± 0.38
2.95SerSer: 2.95 ± 2.284
0.983SerThr: 0.983 ± 0.666
3.933SerVal: 3.933 ± 0.191
1.967SerTrp: 1.967 ± 1.523
2.95SerTyr: 2.95 ± 0.571
0.0SerXaa: 0.0 ± 0.0
Thr
0.0ThrAla: 0.0 ± 0.0
1.967ThrCys: 1.967 ± 1.332
3.933ThrAsp: 3.933 ± 1.236
1.967ThrGlu: 1.967 ± 0.095
0.983ThrPhe: 0.983 ± 0.761
1.967ThrGly: 1.967 ± 0.095
0.0ThrHis: 0.0 ± 0.0
3.933ThrIle: 3.933 ± 1.618
4.916ThrLys: 4.916 ± 0.952
7.866ThrLeu: 7.866 ± 1.809
0.983ThrMet: 0.983 ± 0.512
0.983ThrAsn: 0.983 ± 0.666
2.95ThrPro: 2.95 ± 0.571
0.0ThrGln: 0.0 ± 0.0
3.933ThrArg: 3.933 ± 1.236
7.866ThrSer: 7.866 ± 3.236
1.967ThrThr: 1.967 ± 0.095
2.95ThrVal: 2.95 ± 0.857
0.983ThrTrp: 0.983 ± 0.666
1.967ThrTyr: 1.967 ± 0.095
0.0ThrXaa: 0.0 ± 0.0
Val
4.916ValAla: 4.916 ± 0.952
0.983ValCys: 0.983 ± 0.761
4.916ValAsp: 4.916 ± 0.475
2.95ValGlu: 2.95 ± 2.284
1.967ValPhe: 1.967 ± 1.332
0.983ValGly: 0.983 ± 0.666
0.0ValHis: 0.0 ± 0.0
2.95ValIle: 2.95 ± 0.571
3.933ValLys: 3.933 ± 1.618
4.916ValLeu: 4.916 ± 0.475
2.95ValMet: 2.95 ± 0.571
4.916ValAsn: 4.916 ± 0.475
3.933ValPro: 3.933 ± 0.191
3.933ValGln: 3.933 ± 1.236
4.916ValArg: 4.916 ± 1.902
2.95ValSer: 2.95 ± 0.857
2.95ValThr: 2.95 ± 2.284
3.933ValVal: 3.933 ± 1.236
1.967ValTrp: 1.967 ± 1.332
0.983ValTyr: 0.983 ± 0.666
0.0ValXaa: 0.0 ± 0.0
Trp
0.983TrpAla: 0.983 ± 0.761
0.0TrpCys: 0.0 ± 0.0
2.95TrpAsp: 2.95 ± 1.998
0.983TrpGlu: 0.983 ± 0.761
0.983TrpPhe: 0.983 ± 0.666
0.983TrpGly: 0.983 ± 0.666
0.983TrpHis: 0.983 ± 0.666
0.983TrpIle: 0.983 ± 0.666
2.95TrpLys: 2.95 ± 0.571
0.983TrpLeu: 0.983 ± 0.666
0.0TrpMet: 0.0 ± 0.0
2.95TrpAsn: 2.95 ± 0.857
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
0.0TrpArg: 0.0 ± 0.0
0.983TrpSer: 0.983 ± 0.666
1.967TrpThr: 1.967 ± 0.095
0.983TrpVal: 0.983 ± 0.761
0.983TrpTrp: 0.983 ± 0.666
0.983TrpTyr: 0.983 ± 0.761
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.967TyrAla: 1.967 ± 1.523
2.95TyrCys: 2.95 ± 0.857
0.983TyrAsp: 0.983 ± 0.666
2.95TyrGlu: 2.95 ± 2.284
1.967TyrPhe: 1.967 ± 1.332
1.967TyrGly: 1.967 ± 0.095
0.983TyrHis: 0.983 ± 0.666
1.967TyrIle: 1.967 ± 0.095
0.983TyrLys: 0.983 ± 0.666
4.916TyrLeu: 4.916 ± 1.902
2.95TyrMet: 2.95 ± 0.571
0.0TyrAsn: 0.0 ± 0.0
0.0TyrPro: 0.0 ± 0.0
0.0TyrGln: 0.0 ± 0.0
3.933TyrArg: 3.933 ± 2.664
0.0TyrSer: 0.0 ± 0.0
1.967TyrThr: 1.967 ± 0.095
3.933TyrVal: 3.933 ± 0.191
0.0TyrTrp: 0.0 ± 0.0
0.0TyrTyr: 0.0 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (1018 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski