Amino acid dipepetide frequency for Sanxia tombus-like virus 5

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.211AlaAla: 9.211 ± 2.165
1.316AlaCys: 1.316 ± 0.794
5.263AlaAsp: 5.263 ± 0.686
5.263AlaGlu: 5.263 ± 0.686
2.632AlaPhe: 2.632 ± 1.587
5.263AlaGly: 5.263 ± 2.616
2.632AlaHis: 2.632 ± 0.343
5.263AlaIle: 5.263 ± 2.616
11.842AlaLys: 11.842 ± 2.508
3.947AlaLeu: 3.947 ± 0.451
5.263AlaMet: 5.263 ± 0.686
2.632AlaAsn: 2.632 ± 0.343
3.947AlaPro: 3.947 ± 1.479
3.947AlaGln: 3.947 ± 0.451
5.263AlaArg: 5.263 ± 0.686
0.0AlaSer: 0.0 ± 0.0
5.263AlaThr: 5.263 ± 0.686
2.632AlaVal: 2.632 ± 2.273
0.0AlaTrp: 0.0 ± 0.0
1.316AlaTyr: 1.316 ± 1.136
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.0CysGlu: 0.0 ± 0.0
1.316CysPhe: 1.316 ± 0.794
3.947CysGly: 3.947 ± 2.381
1.316CysHis: 1.316 ± 0.794
1.316CysIle: 1.316 ± 1.136
1.316CysLys: 1.316 ± 1.136
1.316CysLeu: 1.316 ± 0.794
0.0CysMet: 0.0 ± 0.0
1.316CysAsn: 1.316 ± 1.136
1.316CysPro: 1.316 ± 0.794
1.316CysGln: 1.316 ± 0.794
0.0CysArg: 0.0 ± 0.0
0.0CysSer: 0.0 ± 0.0
0.0CysThr: 0.0 ± 0.0
2.632CysVal: 2.632 ± 0.343
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
3.947AspAla: 3.947 ± 0.451
1.316AspCys: 1.316 ± 0.794
5.263AspAsp: 5.263 ± 1.244
2.632AspGlu: 2.632 ± 1.587
5.263AspPhe: 5.263 ± 1.244
1.316AspGly: 1.316 ± 0.794
1.316AspHis: 1.316 ± 0.794
2.632AspIle: 2.632 ± 0.343
3.947AspLys: 3.947 ± 1.479
2.632AspLeu: 2.632 ± 1.587
1.316AspMet: 1.316 ± 1.774
1.316AspAsn: 1.316 ± 0.794
1.316AspPro: 1.316 ± 0.794
1.316AspGln: 1.316 ± 1.136
3.947AspArg: 3.947 ± 2.381
2.632AspSer: 2.632 ± 1.587
1.316AspThr: 1.316 ± 1.136
5.263AspVal: 5.263 ± 0.686
0.0AspTrp: 0.0 ± 0.0
0.0AspTyr: 0.0 ± 0.0
0.0AspXaa: 0.0 ± 0.0
Glu
5.263GluAla: 5.263 ± 4.546
0.0GluCys: 0.0 ± 0.0
1.316GluAsp: 1.316 ± 0.794
7.895GluGlu: 7.895 ± 1.028
2.632GluPhe: 2.632 ± 1.587
1.316GluGly: 1.316 ± 0.794
6.579GluHis: 6.579 ± 2.038
2.632GluIle: 2.632 ± 0.343
7.895GluLys: 7.895 ± 0.901
3.947GluLeu: 3.947 ± 1.479
0.0GluMet: 0.0 ± 0.0
1.316GluAsn: 1.316 ± 0.794
0.0GluPro: 0.0 ± 0.0
1.316GluGln: 1.316 ± 0.794
1.316GluArg: 1.316 ± 0.794
3.947GluSer: 3.947 ± 2.381
3.947GluThr: 3.947 ± 1.479
2.632GluVal: 2.632 ± 0.343
0.0GluTrp: 0.0 ± 0.0
1.316GluTyr: 1.316 ± 0.794
0.0GluXaa: 0.0 ± 0.0
Phe
0.0PheAla: 0.0 ± 0.0
1.316PheCys: 1.316 ± 0.794
3.947PheAsp: 3.947 ± 2.381
2.632PheGlu: 2.632 ± 1.587
1.316PhePhe: 1.316 ± 0.794
0.0PheGly: 0.0 ± 0.0
0.0PheHis: 0.0 ± 0.0
1.316PheIle: 1.316 ± 0.794
3.947PheLys: 3.947 ± 1.479
9.211PheLeu: 9.211 ± 1.695
0.0PheMet: 0.0 ± 0.0
1.316PheAsn: 1.316 ± 0.794
0.0PhePro: 0.0 ± 0.0
2.632PheGln: 2.632 ± 2.273
2.632PheArg: 2.632 ± 1.587
3.947PheSer: 3.947 ± 1.479
0.0PheThr: 0.0 ± 0.0
1.316PheVal: 1.316 ± 0.794
1.316PheTrp: 1.316 ± 0.794
0.0PheTyr: 0.0 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
2.632GlyAla: 2.632 ± 2.273
0.0GlyCys: 0.0 ± 0.0
5.263GlyAsp: 5.263 ± 0.686
1.316GlyGlu: 1.316 ± 0.794
2.632GlyPhe: 2.632 ± 0.343
1.316GlyGly: 1.316 ± 0.794
5.263GlyHis: 5.263 ± 0.686
2.632GlyIle: 2.632 ± 1.587
3.947GlyLys: 3.947 ± 0.451
6.579GlyLeu: 6.579 ± 2.038
2.632GlyMet: 2.632 ± 1.285
1.316GlyAsn: 1.316 ± 0.794
1.316GlyPro: 1.316 ± 1.136
1.316GlyGln: 1.316 ± 1.136
3.947GlyArg: 3.947 ± 3.409
1.316GlySer: 1.316 ± 0.794
2.632GlyThr: 2.632 ± 1.587
6.579GlyVal: 6.579 ± 0.108
0.0GlyTrp: 0.0 ± 0.0
3.947GlyTyr: 3.947 ± 2.381
0.0GlyXaa: 0.0 ± 0.0
His
2.632HisAla: 2.632 ± 1.587
1.316HisCys: 1.316 ± 0.794
0.0HisAsp: 0.0 ± 0.0
1.316HisGlu: 1.316 ± 1.136
1.316HisPhe: 1.316 ± 0.794
2.632HisGly: 2.632 ± 2.273
1.316HisHis: 1.316 ± 0.794
3.947HisIle: 3.947 ± 2.381
1.316HisLys: 1.316 ± 0.794
3.947HisLeu: 3.947 ± 0.451
1.316HisMet: 1.316 ± 1.136
0.0HisAsn: 0.0 ± 0.0
1.316HisPro: 1.316 ± 0.794
1.316HisGln: 1.316 ± 0.794
2.632HisArg: 2.632 ± 0.343
2.632HisSer: 2.632 ± 1.587
3.947HisThr: 3.947 ± 3.409
2.632HisVal: 2.632 ± 0.343
0.0HisTrp: 0.0 ± 0.0
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
3.947IleAla: 3.947 ± 0.451
1.316IleCys: 1.316 ± 0.794
5.263IleAsp: 5.263 ± 1.244
1.316IleGlu: 1.316 ± 1.136
1.316IlePhe: 1.316 ± 0.794
2.632IleGly: 2.632 ± 1.587
1.316IleHis: 1.316 ± 0.794
3.947IleIle: 3.947 ± 1.479
3.947IleLys: 3.947 ± 2.381
1.316IleLeu: 1.316 ± 1.136
0.0IleMet: 0.0 ± 0.0
2.632IleAsn: 2.632 ± 0.343
1.316IlePro: 1.316 ± 1.136
2.632IleGln: 2.632 ± 0.343
3.947IleArg: 3.947 ± 0.451
2.632IleSer: 2.632 ± 1.587
5.263IleThr: 5.263 ± 0.686
6.579IleVal: 6.579 ± 1.822
0.0IleTrp: 0.0 ± 0.0
3.947IleTyr: 3.947 ± 0.451
0.0IleXaa: 0.0 ± 0.0
Lys
7.895LysAla: 7.895 ± 1.028
2.632LysCys: 2.632 ± 0.343
1.316LysAsp: 1.316 ± 1.136
0.0LysGlu: 0.0 ± 0.0
3.947LysPhe: 3.947 ± 2.381
5.263LysGly: 5.263 ± 1.244
2.632LysHis: 2.632 ± 0.343
5.263LysIle: 5.263 ± 0.686
2.632LysLys: 2.632 ± 2.273
6.579LysLeu: 6.579 ± 1.822
1.316LysMet: 1.316 ± 1.136
3.947LysAsn: 3.947 ± 0.451
10.526LysPro: 10.526 ± 3.301
1.316LysGln: 1.316 ± 0.794
5.263LysArg: 5.263 ± 0.686
5.263LysSer: 5.263 ± 0.686
2.632LysThr: 2.632 ± 2.273
5.263LysVal: 5.263 ± 1.244
1.316LysTrp: 1.316 ± 0.794
2.632LysTyr: 2.632 ± 1.587
0.0LysXaa: 0.0 ± 0.0
Leu
9.211LeuAla: 9.211 ± 0.235
1.316LeuCys: 1.316 ± 0.794
3.947LeuAsp: 3.947 ± 2.381
3.947LeuGlu: 3.947 ± 0.451
1.316LeuPhe: 1.316 ± 1.136
9.211LeuGly: 9.211 ± 4.095
3.947LeuHis: 3.947 ± 1.479
2.632LeuIle: 2.632 ± 0.343
10.526LeuLys: 10.526 ± 1.371
13.158LeuLeu: 13.158 ± 7.504
1.316LeuMet: 1.316 ± 0.794
6.579LeuAsn: 6.579 ± 0.108
1.316LeuPro: 1.316 ± 1.136
5.263LeuGln: 5.263 ± 2.616
5.263LeuArg: 5.263 ± 1.244
3.947LeuSer: 3.947 ± 0.451
2.632LeuThr: 2.632 ± 0.343
5.263LeuVal: 5.263 ± 1.244
1.316LeuTrp: 1.316 ± 0.794
3.947LeuTyr: 3.947 ± 2.381
0.0LeuXaa: 0.0 ± 0.0
Met
1.316MetAla: 1.316 ± 0.794
2.632MetCys: 2.632 ± 2.273
1.316MetAsp: 1.316 ± 0.794
3.947MetGlu: 3.947 ± 1.479
1.316MetPhe: 1.316 ± 1.136
2.632MetGly: 2.632 ± 1.587
0.0MetHis: 0.0 ± 0.0
1.316MetIle: 1.316 ± 0.794
0.0MetLys: 0.0 ± 0.0
1.316MetLeu: 1.316 ± 1.136
0.0MetMet: 0.0 ± 0.0
1.316MetAsn: 1.316 ± 0.794
0.0MetPro: 0.0 ± 0.0
0.0MetGln: 0.0 ± 0.0
1.316MetArg: 1.316 ± 1.136
1.316MetSer: 1.316 ± 0.794
1.316MetThr: 1.316 ± 0.794
1.316MetVal: 1.316 ± 0.794
0.0MetTrp: 0.0 ± 0.0
1.316MetTyr: 1.316 ± 0.794
0.0MetXaa: 0.0 ± 0.0
Asn
1.316AsnAla: 1.316 ± 0.794
0.0AsnCys: 0.0 ± 0.0
0.0AsnAsp: 0.0 ± 0.0
2.632AsnGlu: 2.632 ± 1.587
1.316AsnPhe: 1.316 ± 1.136
2.632AsnGly: 2.632 ± 1.587
0.0AsnHis: 0.0 ± 0.0
2.632AsnIle: 2.632 ± 0.343
0.0AsnLys: 0.0 ± 0.0
7.895AsnLeu: 7.895 ± 0.901
0.0AsnMet: 0.0 ± 0.0
3.947AsnAsn: 3.947 ± 0.451
5.263AsnPro: 5.263 ± 1.244
0.0AsnGln: 0.0 ± 0.0
5.263AsnArg: 5.263 ± 1.244
2.632AsnSer: 2.632 ± 0.343
1.316AsnThr: 1.316 ± 0.794
2.632AsnVal: 2.632 ± 0.343
0.0AsnTrp: 0.0 ± 0.0
1.316AsnTyr: 1.316 ± 0.794
0.0AsnXaa: 0.0 ± 0.0
Pro
6.579ProAla: 6.579 ± 5.682
0.0ProCys: 0.0 ± 0.0
1.316ProAsp: 1.316 ± 0.794
0.0ProGlu: 0.0 ± 0.0
2.632ProPhe: 2.632 ± 0.343
1.316ProGly: 1.316 ± 0.794
0.0ProHis: 0.0 ± 0.0
5.263ProIle: 5.263 ± 0.686
2.632ProLys: 2.632 ± 0.343
5.263ProLeu: 5.263 ± 1.244
0.0ProMet: 0.0 ± 0.0
1.316ProAsn: 1.316 ± 1.136
6.579ProPro: 6.579 ± 5.682
1.316ProGln: 1.316 ± 1.136
2.632ProArg: 2.632 ± 1.587
3.947ProSer: 3.947 ± 1.479
1.316ProThr: 1.316 ± 1.136
6.579ProVal: 6.579 ± 2.038
0.0ProTrp: 0.0 ± 0.0
1.316ProTyr: 1.316 ± 0.794
0.0ProXaa: 0.0 ± 0.0
Gln
3.947GlnAla: 3.947 ± 1.479
0.0GlnCys: 0.0 ± 0.0
0.0GlnAsp: 0.0 ± 0.0
1.316GlnGlu: 1.316 ± 0.794
0.0GlnPhe: 0.0 ± 0.0
1.316GlnGly: 1.316 ± 1.136
1.316GlnHis: 1.316 ± 1.136
1.316GlnIle: 1.316 ± 0.794
2.632GlnLys: 2.632 ± 2.273
3.947GlnLeu: 3.947 ± 1.479
1.316GlnMet: 1.316 ± 0.794
0.0GlnAsn: 0.0 ± 0.0
1.316GlnPro: 1.316 ± 1.136
0.0GlnGln: 0.0 ± 0.0
2.632GlnArg: 2.632 ± 2.273
6.579GlnSer: 6.579 ± 0.108
0.0GlnThr: 0.0 ± 0.0
1.316GlnVal: 1.316 ± 0.794
1.316GlnTrp: 1.316 ± 1.136
1.316GlnTyr: 1.316 ± 0.794
0.0GlnXaa: 0.0 ± 0.0
Arg
11.842ArgAla: 11.842 ± 1.352
0.0ArgCys: 0.0 ± 0.0
3.947ArgAsp: 3.947 ± 1.479
6.579ArgGlu: 6.579 ± 2.038
0.0ArgPhe: 0.0 ± 0.0
2.632ArgGly: 2.632 ± 1.587
1.316ArgHis: 1.316 ± 0.794
2.632ArgIle: 2.632 ± 0.343
2.632ArgLys: 2.632 ± 0.343
6.579ArgLeu: 6.579 ± 5.682
3.947ArgMet: 3.947 ± 2.381
3.947ArgAsn: 3.947 ± 0.451
0.0ArgPro: 0.0 ± 0.0
3.947ArgGln: 3.947 ± 1.479
2.632ArgArg: 2.632 ± 0.343
7.895ArgSer: 7.895 ± 2.831
5.263ArgThr: 5.263 ± 2.616
6.579ArgVal: 6.579 ± 3.968
0.0ArgTrp: 0.0 ± 0.0
0.0ArgTyr: 0.0 ± 0.0
0.0ArgXaa: 0.0 ± 0.0
Ser
2.632SerAla: 2.632 ± 0.343
2.632SerCys: 2.632 ± 1.587
1.316SerAsp: 1.316 ± 0.794
1.316SerGlu: 1.316 ± 0.794
3.947SerPhe: 3.947 ± 1.479
6.579SerGly: 6.579 ± 0.108
1.316SerHis: 1.316 ± 0.794
1.316SerIle: 1.316 ± 0.794
6.579SerLys: 6.579 ± 2.038
2.632SerLeu: 2.632 ± 1.587
0.0SerMet: 0.0 ± 0.0
2.632SerAsn: 2.632 ± 1.587
5.263SerPro: 5.263 ± 0.686
0.0SerGln: 0.0 ± 0.0
5.263SerArg: 5.263 ± 0.686
1.316SerSer: 1.316 ± 0.794
3.947SerThr: 3.947 ± 2.381
6.579SerVal: 6.579 ± 1.822
0.0SerTrp: 0.0 ± 0.0
2.632SerTyr: 2.632 ± 1.587
0.0SerXaa: 0.0 ± 0.0
Thr
2.632ThrAla: 2.632 ± 0.343
0.0ThrCys: 0.0 ± 0.0
3.947ThrAsp: 3.947 ± 1.479
3.947ThrGlu: 3.947 ± 3.409
0.0ThrPhe: 0.0 ± 0.0
2.632ThrGly: 2.632 ± 0.343
2.632ThrHis: 2.632 ± 0.343
1.316ThrIle: 1.316 ± 0.794
3.947ThrLys: 3.947 ± 1.479
3.947ThrLeu: 3.947 ± 1.479
2.632ThrMet: 2.632 ± 0.343
1.316ThrAsn: 1.316 ± 0.794
0.0ThrPro: 0.0 ± 0.0
0.0ThrGln: 0.0 ± 0.0
7.895ThrArg: 7.895 ± 0.901
1.316ThrSer: 1.316 ± 1.136
2.632ThrThr: 2.632 ± 0.343
2.632ThrVal: 2.632 ± 2.273
1.316ThrTrp: 1.316 ± 0.794
3.947ThrTyr: 3.947 ± 1.479
0.0ThrXaa: 0.0 ± 0.0
Val
5.263ValAla: 5.263 ± 2.616
0.0ValCys: 0.0 ± 0.0
5.263ValAsp: 5.263 ± 3.174
7.895ValGlu: 7.895 ± 0.901
2.632ValPhe: 2.632 ± 1.587
1.316ValGly: 1.316 ± 1.136
1.316ValHis: 1.316 ± 1.136
5.263ValIle: 5.263 ± 1.244
5.263ValLys: 5.263 ± 0.686
6.579ValLeu: 6.579 ± 0.108
0.0ValMet: 0.0 ± 0.0
1.316ValAsn: 1.316 ± 0.794
6.579ValPro: 6.579 ± 0.108
2.632ValGln: 2.632 ± 0.343
7.895ValArg: 7.895 ± 0.901
3.947ValSer: 3.947 ± 0.451
3.947ValThr: 3.947 ± 3.409
3.947ValVal: 3.947 ± 1.479
0.0ValTrp: 0.0 ± 0.0
2.632ValTyr: 2.632 ± 1.587
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
0.0TrpGlu: 0.0 ± 0.0
0.0TrpPhe: 0.0 ± 0.0
1.316TrpGly: 1.316 ± 0.794
1.316TrpHis: 1.316 ± 0.794
0.0TrpIle: 0.0 ± 0.0
0.0TrpLys: 0.0 ± 0.0
1.316TrpLeu: 1.316 ± 1.136
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
0.0TrpArg: 0.0 ± 0.0
1.316TrpSer: 1.316 ± 0.794
0.0TrpThr: 0.0 ± 0.0
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
1.316TrpTyr: 1.316 ± 0.794
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.632TyrAla: 2.632 ± 0.343
1.316TyrCys: 1.316 ± 1.136
1.316TyrAsp: 1.316 ± 1.136
1.316TyrGlu: 1.316 ± 0.794
1.316TyrPhe: 1.316 ± 0.794
1.316TyrGly: 1.316 ± 0.794
0.0TyrHis: 0.0 ± 0.0
2.632TyrIle: 2.632 ± 1.587
2.632TyrLys: 2.632 ± 1.587
3.947TyrLeu: 3.947 ± 2.381
1.316TyrMet: 1.316 ± 0.794
2.632TyrAsn: 2.632 ± 1.587
2.632TyrPro: 2.632 ± 1.587
1.316TyrGln: 1.316 ± 1.136
2.632TyrArg: 2.632 ± 1.587
1.316TyrSer: 1.316 ± 0.794
1.316TyrThr: 1.316 ± 0.794
1.316TyrVal: 1.316 ± 0.794
0.0TyrTrp: 0.0 ± 0.0
0.0TyrTyr: 0.0 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (761 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski