Amino acid dipepetide frequency for Rodent Torque teno virus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.924AlaAla: 2.924 ± 1.39
0.0AlaCys: 0.0 ± 0.0
2.924AlaAsp: 2.924 ± 1.39
0.0AlaGlu: 0.0 ± 0.0
1.462AlaPhe: 1.462 ± 1.326
2.924AlaGly: 2.924 ± 1.387
0.0AlaHis: 0.0 ± 0.0
1.462AlaIle: 1.462 ± 0.813
1.462AlaLys: 1.462 ± 1.326
2.924AlaLeu: 2.924 ± 3.223
0.0AlaMet: 0.0 ± 0.0
1.462AlaAsn: 1.462 ± 1.326
1.462AlaPro: 1.462 ± 1.564
2.924AlaGln: 2.924 ± 1.626
1.462AlaArg: 1.462 ± 0.813
1.462AlaSer: 1.462 ± 0.813
2.924AlaThr: 2.924 ± 1.626
0.0AlaVal: 0.0 ± 0.0
2.924AlaTrp: 2.924 ± 1.39
5.848AlaTyr: 5.848 ± 3.252
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
1.462CysAsp: 1.462 ± 1.326
0.0CysGlu: 0.0 ± 0.0
0.0CysPhe: 0.0 ± 0.0
1.462CysGly: 1.462 ± 0.813
0.0CysHis: 0.0 ± 0.0
1.462CysIle: 1.462 ± 1.326
1.462CysLys: 1.462 ± 1.564
0.0CysLeu: 0.0 ± 0.0
1.462CysMet: 1.462 ± 1.612
5.848CysAsn: 5.848 ± 0.921
1.462CysPro: 1.462 ± 0.813
0.0CysGln: 0.0 ± 0.0
0.0CysArg: 0.0 ± 0.0
2.924CysSer: 2.924 ± 2.108
1.462CysThr: 1.462 ± 1.612
2.924CysVal: 2.924 ± 1.387
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
0.0AspAla: 0.0 ± 0.0
0.0AspCys: 0.0 ± 0.0
0.0AspAsp: 0.0 ± 0.0
8.772AspGlu: 8.772 ± 4.229
5.848AspPhe: 5.848 ± 2.774
4.386AspGly: 4.386 ± 1.852
0.0AspHis: 0.0 ± 0.0
1.462AspIle: 1.462 ± 0.813
0.0AspLys: 0.0 ± 0.0
1.462AspLeu: 1.462 ± 0.813
2.924AspMet: 2.924 ± 1.626
2.924AspAsn: 2.924 ± 1.387
1.462AspPro: 1.462 ± 1.564
1.462AspGln: 1.462 ± 1.564
4.386AspArg: 4.386 ± 1.497
5.848AspSer: 5.848 ± 2.801
4.386AspThr: 4.386 ± 1.409
4.386AspVal: 4.386 ± 1.603
0.0AspTrp: 0.0 ± 0.0
1.462AspTyr: 1.462 ± 0.813
0.0AspXaa: 0.0 ± 0.0
Glu
2.924GluAla: 2.924 ± 1.926
2.924GluCys: 2.924 ± 1.926
8.772GluAsp: 8.772 ± 4.631
5.848GluGlu: 5.848 ± 2.892
0.0GluPhe: 0.0 ± 0.0
2.924GluGly: 2.924 ± 3.223
4.386GluHis: 4.386 ± 1.852
1.462GluIle: 1.462 ± 0.813
4.386GluLys: 4.386 ± 2.887
5.848GluLeu: 5.848 ± 4.476
1.462GluMet: 1.462 ± 1.564
2.924GluAsn: 2.924 ± 2.108
1.462GluPro: 1.462 ± 1.564
2.924GluGln: 2.924 ± 1.387
1.462GluArg: 1.462 ± 0.813
2.924GluSer: 2.924 ± 3.127
2.924GluThr: 2.924 ± 1.39
0.0GluVal: 0.0 ± 0.0
2.924GluTrp: 2.924 ± 1.926
4.386GluTyr: 4.386 ± 1.232
0.0GluXaa: 0.0 ± 0.0
Phe
2.924PheAla: 2.924 ± 1.626
1.462PheCys: 1.462 ± 1.612
4.386PheAsp: 4.386 ± 2.439
0.0PheGlu: 0.0 ± 0.0
1.462PhePhe: 1.462 ± 0.813
0.0PheGly: 0.0 ± 0.0
0.0PheHis: 0.0 ± 0.0
4.386PheIle: 4.386 ± 1.409
0.0PheLys: 0.0 ± 0.0
1.462PheLeu: 1.462 ± 1.564
1.462PheMet: 1.462 ± 0.813
1.462PheAsn: 1.462 ± 1.612
1.462PhePro: 1.462 ± 0.813
4.386PheGln: 4.386 ± 1.497
0.0PheArg: 0.0 ± 0.0
4.386PheSer: 4.386 ± 1.497
1.462PheThr: 1.462 ± 0.813
1.462PheVal: 1.462 ± 0.813
0.0PheTrp: 0.0 ± 0.0
1.462PheTyr: 1.462 ± 0.813
0.0PheXaa: 0.0 ± 0.0
Gly
2.924GlyAla: 2.924 ± 3.223
0.0GlyCys: 0.0 ± 0.0
4.386GlyAsp: 4.386 ± 4.835
4.386GlyGlu: 4.386 ± 1.597
1.462GlyPhe: 1.462 ± 0.813
7.31GlyGly: 7.31 ± 2.004
0.0GlyHis: 0.0 ± 0.0
2.924GlyIle: 2.924 ± 1.626
4.386GlyLys: 4.386 ± 3.344
4.386GlyLeu: 4.386 ± 2.621
1.462GlyMet: 1.462 ± 0.813
4.386GlyAsn: 4.386 ± 1.232
2.924GlyPro: 2.924 ± 1.626
4.386GlyGln: 4.386 ± 1.497
1.462GlyArg: 1.462 ± 0.813
4.386GlySer: 4.386 ± 1.232
4.386GlyThr: 4.386 ± 1.603
1.462GlyVal: 1.462 ± 1.612
2.924GlyTrp: 2.924 ± 1.149
0.0GlyTyr: 0.0 ± 0.0
0.0GlyXaa: 0.0 ± 0.0
His
0.0HisAla: 0.0 ± 0.0
0.0HisCys: 0.0 ± 0.0
1.462HisAsp: 1.462 ± 0.813
1.462HisGlu: 1.462 ± 1.612
0.0HisPhe: 0.0 ± 0.0
0.0HisGly: 0.0 ± 0.0
0.0HisHis: 0.0 ± 0.0
0.0HisIle: 0.0 ± 0.0
0.0HisLys: 0.0 ± 0.0
1.462HisLeu: 1.462 ± 1.612
1.462HisMet: 1.462 ± 1.32
1.462HisAsn: 1.462 ± 1.326
5.848HisPro: 5.848 ± 2.131
2.924HisGln: 2.924 ± 1.39
1.462HisArg: 1.462 ± 1.326
1.462HisSer: 1.462 ± 0.813
2.924HisThr: 2.924 ± 1.962
0.0HisVal: 0.0 ± 0.0
1.462HisTrp: 1.462 ± 0.813
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
0.0IleAla: 0.0 ± 0.0
1.462IleCys: 1.462 ± 1.326
1.462IleAsp: 1.462 ± 0.813
1.462IleGlu: 1.462 ± 1.326
1.462IlePhe: 1.462 ± 0.813
1.462IleGly: 1.462 ± 1.612
0.0IleHis: 0.0 ± 0.0
0.0IleIle: 0.0 ± 0.0
5.848IleLys: 5.848 ± 2.131
7.31IleLeu: 7.31 ± 2.429
2.924IleMet: 2.924 ± 1.343
0.0IleAsn: 0.0 ± 0.0
1.462IlePro: 1.462 ± 0.813
1.462IleGln: 1.462 ± 0.813
1.462IleArg: 1.462 ± 0.813
2.924IleSer: 2.924 ± 1.149
4.386IleThr: 4.386 ± 1.852
2.924IleVal: 2.924 ± 1.626
0.0IleTrp: 0.0 ± 0.0
2.924IleTyr: 2.924 ± 1.387
0.0IleXaa: 0.0 ± 0.0
Lys
2.924LysAla: 2.924 ± 1.39
1.462LysCys: 1.462 ± 1.564
2.924LysAsp: 2.924 ± 1.626
4.386LysGlu: 4.386 ± 1.597
1.462LysPhe: 1.462 ± 0.813
2.924LysGly: 2.924 ± 1.149
4.386LysHis: 4.386 ± 1.409
0.0LysIle: 0.0 ± 0.0
10.234LysLys: 10.234 ± 5.989
7.31LysLeu: 7.31 ± 7.818
0.0LysMet: 0.0 ± 0.0
2.924LysAsn: 2.924 ± 1.149
1.462LysPro: 1.462 ± 1.612
4.386LysGln: 4.386 ± 1.597
5.848LysArg: 5.848 ± 2.727
2.924LysSer: 2.924 ± 1.626
7.31LysThr: 7.31 ± 2.798
8.772LysVal: 8.772 ± 4.171
2.924LysTrp: 2.924 ± 1.626
1.462LysTyr: 1.462 ± 1.612
0.0LysXaa: 0.0 ± 0.0
Leu
0.0LeuAla: 0.0 ± 0.0
4.386LeuCys: 4.386 ± 3.412
4.386LeuAsp: 4.386 ± 3.295
5.848LeuGlu: 5.848 ± 2.801
2.924LeuPhe: 2.924 ± 1.387
1.462LeuGly: 1.462 ± 1.612
1.462LeuHis: 1.462 ± 0.813
4.386LeuIle: 4.386 ± 1.603
7.31LeuLys: 7.31 ± 3.493
2.924LeuLeu: 2.924 ± 2.108
2.924LeuMet: 2.924 ± 1.626
2.924LeuAsn: 2.924 ± 1.962
1.462LeuPro: 1.462 ± 1.564
2.924LeuGln: 2.924 ± 1.149
5.848LeuArg: 5.848 ± 2.727
8.772LeuSer: 8.772 ± 2.818
4.386LeuThr: 4.386 ± 1.232
1.462LeuVal: 1.462 ± 0.813
1.462LeuTrp: 1.462 ± 0.813
1.462LeuTyr: 1.462 ± 0.813
0.0LeuXaa: 0.0 ± 0.0
Met
1.462MetAla: 1.462 ± 0.813
1.462MetCys: 1.462 ± 0.813
2.924MetAsp: 2.924 ± 1.626
1.462MetGlu: 1.462 ± 0.813
1.462MetPhe: 1.462 ± 1.564
2.924MetGly: 2.924 ± 1.626
0.0MetHis: 0.0 ± 0.0
1.462MetIle: 1.462 ± 1.612
2.924MetLys: 2.924 ± 1.149
2.924MetLeu: 2.924 ± 1.626
0.0MetMet: 0.0 ± 0.0
2.924MetAsn: 2.924 ± 2.108
0.0MetPro: 0.0 ± 0.0
1.462MetGln: 1.462 ± 1.326
0.0MetArg: 0.0 ± 0.0
0.0MetSer: 0.0 ± 0.0
2.924MetThr: 2.924 ± 1.149
4.386MetVal: 4.386 ± 1.852
1.462MetTrp: 1.462 ± 0.813
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
1.462AsnAla: 1.462 ± 0.813
2.924AsnCys: 2.924 ± 1.387
1.462AsnAsp: 1.462 ± 0.813
0.0AsnGlu: 0.0 ± 0.0
0.0AsnPhe: 0.0 ± 0.0
4.386AsnGly: 4.386 ± 3.344
1.462AsnHis: 1.462 ± 1.612
0.0AsnIle: 0.0 ± 0.0
7.31AsnLys: 7.31 ± 1.728
4.386AsnLeu: 4.386 ± 2.439
1.462AsnMet: 1.462 ± 0.813
4.386AsnAsn: 4.386 ± 2.439
4.386AsnPro: 4.386 ± 1.597
1.462AsnGln: 1.462 ± 0.813
4.386AsnArg: 4.386 ± 1.852
0.0AsnSer: 0.0 ± 0.0
2.924AsnThr: 2.924 ± 3.127
2.924AsnVal: 2.924 ± 2.108
2.924AsnTrp: 2.924 ± 1.149
1.462AsnTyr: 1.462 ± 0.813
0.0AsnXaa: 0.0 ± 0.0
Pro
1.462ProAla: 1.462 ± 1.326
0.0ProCys: 0.0 ± 0.0
0.0ProAsp: 0.0 ± 0.0
2.924ProGlu: 2.924 ± 3.223
1.462ProPhe: 1.462 ± 0.813
4.386ProGly: 4.386 ± 3.412
0.0ProHis: 0.0 ± 0.0
4.386ProIle: 4.386 ± 1.497
1.462ProLys: 1.462 ± 0.813
4.386ProLeu: 4.386 ± 1.232
1.462ProMet: 1.462 ± 1.326
4.386ProAsn: 4.386 ± 1.232
0.0ProPro: 0.0 ± 0.0
2.924ProGln: 2.924 ± 3.127
8.772ProArg: 8.772 ± 3.982
5.848ProSer: 5.848 ± 2.298
1.462ProThr: 1.462 ± 1.612
1.462ProVal: 1.462 ± 1.564
1.462ProTrp: 1.462 ± 0.813
0.0ProTyr: 0.0 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
2.924GlnAla: 2.924 ± 1.626
1.462GlnCys: 1.462 ± 1.564
1.462GlnAsp: 1.462 ± 1.612
2.924GlnGlu: 2.924 ± 1.962
2.924GlnPhe: 2.924 ± 1.149
5.848GlnGly: 5.848 ± 2.727
2.924GlnHis: 2.924 ± 1.39
0.0GlnIle: 0.0 ± 0.0
2.924GlnLys: 2.924 ± 2.651
2.924GlnLeu: 2.924 ± 1.962
2.924GlnMet: 2.924 ± 1.149
2.924GlnAsn: 2.924 ± 1.626
7.31GlnPro: 7.31 ± 1.016
2.924GlnGln: 2.924 ± 1.149
0.0GlnArg: 0.0 ± 0.0
0.0GlnSer: 0.0 ± 0.0
4.386GlnThr: 4.386 ± 1.409
0.0GlnVal: 0.0 ± 0.0
1.462GlnTrp: 1.462 ± 1.326
0.0GlnTyr: 0.0 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
7.31ArgAla: 7.31 ± 2.425
1.462ArgCys: 1.462 ± 0.813
1.462ArgAsp: 1.462 ± 0.813
5.848ArgGlu: 5.848 ± 1.503
2.924ArgPhe: 2.924 ± 1.149
7.31ArgGly: 7.31 ± 1.016
1.462ArgHis: 1.462 ± 1.326
4.386ArgIle: 4.386 ± 3.977
5.848ArgLys: 5.848 ± 1.743
0.0ArgLeu: 0.0 ± 0.0
0.0ArgMet: 0.0 ± 0.0
5.848ArgAsn: 5.848 ± 3.252
4.386ArgPro: 4.386 ± 2.592
4.386ArgGln: 4.386 ± 1.232
20.468ArgArg: 20.468 ± 7.644
1.462ArgSer: 1.462 ± 1.612
7.31ArgThr: 7.31 ± 4.065
2.924ArgVal: 2.924 ± 1.39
1.462ArgTrp: 1.462 ± 0.813
2.924ArgTyr: 2.924 ± 1.149
0.0ArgXaa: 0.0 ± 0.0
Ser
2.924SerAla: 2.924 ± 1.39
0.0SerCys: 0.0 ± 0.0
2.924SerAsp: 2.924 ± 1.149
4.386SerGlu: 4.386 ± 3.412
1.462SerPhe: 1.462 ± 0.813
2.924SerGly: 2.924 ± 1.149
1.462SerHis: 1.462 ± 1.564
2.924SerIle: 2.924 ± 1.626
8.772SerLys: 8.772 ± 3.448
7.31SerLeu: 7.31 ± 2.318
2.924SerMet: 2.924 ± 1.166
1.462SerAsn: 1.462 ± 0.813
2.924SerPro: 2.924 ± 1.149
2.924SerGln: 2.924 ± 2.108
4.386SerArg: 4.386 ± 1.409
7.31SerSer: 7.31 ± 7.818
4.386SerThr: 4.386 ± 2.621
2.924SerVal: 2.924 ± 1.149
5.848SerTrp: 5.848 ± 2.801
1.462SerTyr: 1.462 ± 0.813
0.0SerXaa: 0.0 ± 0.0
Thr
2.924ThrAla: 2.924 ± 1.626
0.0ThrCys: 0.0 ± 0.0
4.386ThrAsp: 4.386 ± 1.409
7.31ThrGlu: 7.31 ± 3.733
1.462ThrPhe: 1.462 ± 1.326
1.462ThrGly: 1.462 ± 0.813
2.924ThrHis: 2.924 ± 1.387
4.386ThrIle: 4.386 ± 2.439
1.462ThrLys: 1.462 ± 1.612
2.924ThrLeu: 2.924 ± 3.223
1.462ThrMet: 1.462 ± 1.537
0.0ThrAsn: 0.0 ± 0.0
2.924ThrPro: 2.924 ± 3.127
2.924ThrGln: 2.924 ± 1.962
7.31ThrArg: 7.31 ± 1.728
8.772ThrSer: 8.772 ± 2.818
5.848ThrThr: 5.848 ± 1.743
2.924ThrVal: 2.924 ± 1.149
5.848ThrTrp: 5.848 ± 3.252
2.924ThrTyr: 2.924 ± 1.626
0.0ThrXaa: 0.0 ± 0.0
Val
1.462ValAla: 1.462 ± 1.326
2.924ValCys: 2.924 ± 1.626
2.924ValAsp: 2.924 ± 2.108
1.462ValGlu: 1.462 ± 0.813
4.386ValPhe: 4.386 ± 1.232
2.924ValGly: 2.924 ± 1.626
1.462ValHis: 1.462 ± 1.326
2.924ValIle: 2.924 ± 1.39
1.462ValLys: 1.462 ± 1.326
5.848ValLeu: 5.848 ± 2.774
1.462ValMet: 1.462 ± 1.564
1.462ValAsn: 1.462 ± 0.813
2.924ValPro: 2.924 ± 1.387
1.462ValGln: 1.462 ± 1.326
2.924ValArg: 2.924 ± 1.626
4.386ValSer: 4.386 ± 1.232
0.0ValThr: 0.0 ± 0.0
0.0ValVal: 0.0 ± 0.0
1.462ValTrp: 1.462 ± 0.813
0.0ValTyr: 0.0 ± 0.0
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
1.462TrpAsp: 1.462 ± 0.813
4.386TrpGlu: 4.386 ± 1.497
1.462TrpPhe: 1.462 ± 0.813
1.462TrpGly: 1.462 ± 0.813
1.462TrpHis: 1.462 ± 0.813
0.0TrpIle: 0.0 ± 0.0
4.386TrpLys: 4.386 ± 2.439
1.462TrpLeu: 1.462 ± 1.612
2.924TrpMet: 2.924 ± 1.626
0.0TrpAsn: 0.0 ± 0.0
2.924TrpPro: 2.924 ± 1.962
0.0TrpGln: 0.0 ± 0.0
8.772TrpArg: 8.772 ± 2.646
2.924TrpSer: 2.924 ± 1.626
2.924TrpThr: 2.924 ± 2.108
0.0TrpVal: 0.0 ± 0.0
4.386TrpTrp: 4.386 ± 1.232
1.462TrpTyr: 1.462 ± 0.813
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.0TyrAla: 0.0 ± 0.0
0.0TyrCys: 0.0 ± 0.0
0.0TyrAsp: 0.0 ± 0.0
0.0TyrGlu: 0.0 ± 0.0
0.0TyrPhe: 0.0 ± 0.0
1.462TyrGly: 1.462 ± 0.813
0.0TyrHis: 0.0 ± 0.0
2.924TyrIle: 2.924 ± 1.626
4.386TyrLys: 4.386 ± 2.439
1.462TyrLeu: 1.462 ± 0.813
0.0TyrMet: 0.0 ± 0.0
0.0TyrAsn: 0.0 ± 0.0
0.0TyrPro: 0.0 ± 0.0
0.0TyrGln: 0.0 ± 0.0
8.772TyrArg: 8.772 ± 3.528
2.924TyrSer: 2.924 ± 2.108
2.924TyrThr: 2.924 ± 1.149
2.924TyrVal: 2.924 ± 1.626
1.462TyrTrp: 1.462 ± 0.813
0.0TyrTyr: 0.0 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (685 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski