Amino acid dipepetide frequency for Torque teno sus virus 1b

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.764AlaAla: 5.764 ± 15.102
0.0AlaCys: 0.0 ± 0.0
1.441AlaAsp: 1.441 ± 5.222
1.441AlaGlu: 1.441 ± 5.222
2.882AlaPhe: 2.882 ± 1.125
2.882AlaGly: 2.882 ± 1.125
2.882AlaHis: 2.882 ± 4.659
5.764AlaIle: 5.764 ± 3.534
1.441AlaLys: 1.441 ± 0.563
1.441AlaLeu: 1.441 ± 5.222
0.0AlaMet: 0.0 ± 0.0
1.441AlaAsn: 1.441 ± 0.563
2.882AlaPro: 2.882 ± 1.125
2.882AlaGln: 2.882 ± 4.659
1.441AlaArg: 1.441 ± 0.563
4.323AlaSer: 4.323 ± 1.688
4.323AlaThr: 4.323 ± 4.096
1.441AlaVal: 1.441 ± 0.563
1.441AlaTrp: 1.441 ± 0.563
2.882AlaTyr: 2.882 ± 4.659
0.0AlaXaa: 0.0 ± 0.0
Cys
1.441CysAla: 1.441 ± 5.222
0.0CysCys: 0.0 ± 0.0
4.323CysAsp: 4.323 ± 4.096
0.0CysGlu: 0.0 ± 0.0
0.0CysPhe: 0.0 ± 0.0
0.0CysGly: 0.0 ± 0.0
0.0CysHis: 0.0 ± 0.0
1.441CysIle: 1.441 ± 0.563
2.882CysLys: 2.882 ± 4.659
4.323CysLeu: 4.323 ± 4.096
0.0CysMet: 0.0 ± 0.0
1.441CysAsn: 1.441 ± 0.563
1.441CysPro: 1.441 ± 0.563
0.0CysGln: 0.0 ± 0.0
0.0CysArg: 0.0 ± 0.0
0.0CysSer: 0.0 ± 0.0
1.441CysThr: 1.441 ± 0.563
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
5.764AspAla: 5.764 ± 20.886
2.882AspCys: 2.882 ± 4.659
2.882AspAsp: 2.882 ± 1.125
4.323AspGlu: 4.323 ± 1.688
0.0AspPhe: 0.0 ± 0.0
2.882AspGly: 2.882 ± 4.659
1.441AspHis: 1.441 ± 0.563
4.323AspIle: 4.323 ± 4.096
0.0AspLys: 0.0 ± 0.0
1.441AspLeu: 1.441 ± 0.563
0.0AspMet: 0.0 ± 0.0
2.882AspAsn: 2.882 ± 1.125
5.764AspPro: 5.764 ± 3.534
1.441AspGln: 1.441 ± 0.563
2.882AspArg: 2.882 ± 4.659
1.441AspSer: 1.441 ± 0.563
2.882AspThr: 2.882 ± 1.125
0.0AspVal: 0.0 ± 0.0
2.882AspTrp: 2.882 ± 1.125
2.882AspTyr: 2.882 ± 1.125
0.0AspXaa: 0.0 ± 0.0
Glu
2.882GluAla: 2.882 ± 1.125
1.441GluCys: 1.441 ± 0.563
1.441GluAsp: 1.441 ± 5.222
10.086GluGlu: 10.086 ± 1.846
1.441GluPhe: 1.441 ± 0.563
5.764GluGly: 5.764 ± 3.534
0.0GluHis: 0.0 ± 0.0
1.441GluIle: 1.441 ± 0.563
7.205GluLys: 7.205 ± 2.971
7.205GluLeu: 7.205 ± 2.813
0.0GluMet: 0.0 ± 0.0
0.0GluAsn: 0.0 ± 0.0
4.323GluPro: 4.323 ± 1.688
2.882GluGln: 2.882 ± 1.125
2.882GluArg: 2.882 ± 4.659
1.441GluSer: 1.441 ± 0.563
4.323GluThr: 4.323 ± 1.688
1.441GluVal: 1.441 ± 0.563
0.0GluTrp: 0.0 ± 0.0
1.441GluTyr: 1.441 ± 0.563
0.0GluXaa: 0.0 ± 0.0
Phe
1.441PheAla: 1.441 ± 0.563
1.441PheCys: 1.441 ± 5.222
2.882PheAsp: 2.882 ± 4.659
1.441PheGlu: 1.441 ± 0.563
2.882PhePhe: 2.882 ± 1.125
2.882PheGly: 2.882 ± 1.125
1.441PheHis: 1.441 ± 0.563
1.441PheIle: 1.441 ± 0.563
5.764PheLys: 5.764 ± 2.25
1.441PheLeu: 1.441 ± 0.563
0.0PheMet: 0.0 ± 0.0
0.0PheAsn: 0.0 ± 0.0
0.0PhePro: 0.0 ± 0.0
1.441PheGln: 1.441 ± 0.563
7.205PheArg: 7.205 ± 2.813
0.0PheSer: 0.0 ± 0.0
4.323PheThr: 4.323 ± 1.688
1.441PheVal: 1.441 ± 0.563
1.441PheTrp: 1.441 ± 0.563
2.882PheTyr: 2.882 ± 1.125
0.0PheXaa: 0.0 ± 0.0
Gly
1.441GlyAla: 1.441 ± 0.563
1.441GlyCys: 1.441 ± 0.563
5.764GlyAsp: 5.764 ± 9.318
0.0GlyGlu: 0.0 ± 0.0
1.441GlyPhe: 1.441 ± 0.563
12.968GlyGly: 12.968 ± 12.289
4.323GlyHis: 4.323 ± 1.688
5.764GlyIle: 5.764 ± 3.534
5.764GlyLys: 5.764 ± 2.25
4.323GlyLeu: 4.323 ± 4.096
1.441GlyMet: 1.441 ± 0.563
5.764GlyAsn: 5.764 ± 2.25
1.441GlyPro: 1.441 ± 0.563
4.323GlyGln: 4.323 ± 1.688
0.0GlyArg: 0.0 ± 0.0
5.764GlySer: 5.764 ± 2.25
2.882GlyThr: 2.882 ± 4.659
2.882GlyVal: 2.882 ± 1.125
5.764GlyTrp: 5.764 ± 2.25
2.882GlyTyr: 2.882 ± 1.125
0.0GlyXaa: 0.0 ± 0.0
His
0.0HisAla: 0.0 ± 0.0
0.0HisCys: 0.0 ± 0.0
0.0HisAsp: 0.0 ± 0.0
1.441HisGlu: 1.441 ± 0.563
0.0HisPhe: 0.0 ± 0.0
2.882HisGly: 2.882 ± 4.659
1.441HisHis: 1.441 ± 0.563
1.441HisIle: 1.441 ± 0.563
2.882HisLys: 2.882 ± 1.125
2.882HisLeu: 2.882 ± 4.659
0.0HisMet: 0.0 ± 0.0
0.0HisAsn: 0.0 ± 0.0
4.323HisPro: 4.323 ± 1.688
0.0HisGln: 0.0 ± 0.0
1.441HisArg: 1.441 ± 0.563
2.882HisSer: 2.882 ± 1.125
0.0HisThr: 0.0 ± 0.0
0.0HisVal: 0.0 ± 0.0
2.882HisTrp: 2.882 ± 1.125
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
2.882IleAla: 2.882 ± 4.659
0.0IleCys: 0.0 ± 0.0
4.323IleAsp: 4.323 ± 4.096
2.882IleGlu: 2.882 ± 1.125
2.882IlePhe: 2.882 ± 1.125
5.764IleGly: 5.764 ± 3.534
0.0IleHis: 0.0 ± 0.0
0.0IleIle: 0.0 ± 0.0
2.882IleLys: 2.882 ± 1.125
1.441IleLeu: 1.441 ± 0.563
1.441IleMet: 1.441 ± 0.563
1.441IleAsn: 1.441 ± 0.563
2.882IlePro: 2.882 ± 1.125
4.323IleGln: 4.323 ± 1.688
1.441IleArg: 1.441 ± 0.563
1.441IleSer: 1.441 ± 0.563
0.0IleThr: 0.0 ± 0.0
4.323IleVal: 4.323 ± 1.688
1.441IleTrp: 1.441 ± 0.563
1.441IleTyr: 1.441 ± 0.563
0.0IleXaa: 0.0 ± 0.0
Lys
4.323LysAla: 4.323 ± 1.688
1.441LysCys: 1.441 ± 5.222
4.323LysAsp: 4.323 ± 4.096
4.323LysGlu: 4.323 ± 1.688
4.323LysPhe: 4.323 ± 1.688
0.0LysGly: 0.0 ± 0.0
4.323LysHis: 4.323 ± 4.096
1.441LysIle: 1.441 ± 0.563
5.764LysLys: 5.764 ± 3.534
7.205LysLeu: 7.205 ± 2.813
2.882LysMet: 2.882 ± 1.125
1.441LysAsn: 1.441 ± 0.563
5.764LysPro: 5.764 ± 2.25
4.323LysGln: 4.323 ± 1.688
4.323LysArg: 4.323 ± 1.688
2.882LysSer: 2.882 ± 1.125
5.764LysThr: 5.764 ± 2.25
2.882LysVal: 2.882 ± 1.125
0.0LysTrp: 0.0 ± 0.0
4.323LysTyr: 4.323 ± 1.688
0.0LysXaa: 0.0 ± 0.0
Leu
2.882LeuAla: 2.882 ± 4.659
1.441LeuCys: 1.441 ± 0.563
1.441LeuAsp: 1.441 ± 0.563
4.323LeuGlu: 4.323 ± 4.096
2.882LeuPhe: 2.882 ± 4.659
7.205LeuGly: 7.205 ± 2.813
1.441LeuHis: 1.441 ± 0.563
7.205LeuIle: 7.205 ± 2.813
10.086LeuLys: 10.086 ± 3.938
8.646LeuLeu: 8.646 ± 2.409
2.882LeuMet: 2.882 ± 1.002
4.323LeuAsn: 4.323 ± 1.688
4.323LeuPro: 4.323 ± 1.688
2.882LeuGln: 2.882 ± 1.125
4.323LeuArg: 4.323 ± 1.688
7.205LeuSer: 7.205 ± 2.813
5.764LeuThr: 5.764 ± 9.318
1.441LeuVal: 1.441 ± 0.563
0.0LeuTrp: 0.0 ± 0.0
4.323LeuTyr: 4.323 ± 1.688
0.0LeuXaa: 0.0 ± 0.0
Met
1.441MetAla: 1.441 ± 0.563
0.0MetCys: 0.0 ± 0.0
0.0MetAsp: 0.0 ± 0.0
2.882MetGlu: 2.882 ± 4.659
0.0MetPhe: 0.0 ± 0.0
0.0MetGly: 0.0 ± 0.0
0.0MetHis: 0.0 ± 0.0
0.0MetIle: 0.0 ± 0.0
2.882MetLys: 2.882 ± 1.125
4.323MetLeu: 4.323 ± 1.688
0.0MetMet: 0.0 ± 0.0
0.0MetAsn: 0.0 ± 0.0
2.882MetPro: 2.882 ± 1.125
1.441MetGln: 1.441 ± 0.563
0.0MetArg: 0.0 ± 0.0
0.0MetSer: 0.0 ± 0.0
0.0MetThr: 0.0 ± 0.0
0.0MetVal: 0.0 ± 0.0
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
4.323AsnAla: 4.323 ± 1.688
1.441AsnCys: 1.441 ± 0.563
0.0AsnAsp: 0.0 ± 0.0
1.441AsnGlu: 1.441 ± 0.563
2.882AsnPhe: 2.882 ± 1.125
1.441AsnGly: 1.441 ± 0.563
0.0AsnHis: 0.0 ± 0.0
1.441AsnIle: 1.441 ± 0.563
0.0AsnLys: 0.0 ± 0.0
7.205AsnLeu: 7.205 ± 2.813
1.441AsnMet: 1.441 ± 0.563
0.0AsnAsn: 0.0 ± 0.0
8.646AsnPro: 8.646 ± 3.375
1.441AsnGln: 1.441 ± 0.563
2.882AsnArg: 2.882 ± 1.125
0.0AsnSer: 0.0 ± 0.0
5.764AsnThr: 5.764 ± 2.25
1.441AsnVal: 1.441 ± 0.563
4.323AsnTrp: 4.323 ± 1.688
1.441AsnTyr: 1.441 ± 0.563
0.0AsnXaa: 0.0 ± 0.0
Pro
0.0ProAla: 0.0 ± 0.0
2.882ProCys: 2.882 ± 1.125
0.0ProAsp: 0.0 ± 0.0
1.441ProGlu: 1.441 ± 0.563
4.323ProPhe: 4.323 ± 1.688
4.323ProGly: 4.323 ± 1.688
0.0ProHis: 0.0 ± 0.0
1.441ProIle: 1.441 ± 0.563
5.764ProLys: 5.764 ± 3.534
8.646ProLeu: 8.646 ± 3.375
0.0ProMet: 0.0 ± 0.0
4.323ProAsn: 4.323 ± 1.688
4.323ProPro: 4.323 ± 1.688
0.0ProGln: 0.0 ± 0.0
1.441ProArg: 1.441 ± 0.563
7.205ProSer: 7.205 ± 2.813
2.882ProThr: 2.882 ± 1.125
2.882ProVal: 2.882 ± 1.125
4.323ProTrp: 4.323 ± 1.688
4.323ProTyr: 4.323 ± 1.688
0.0ProXaa: 0.0 ± 0.0
Gln
0.0GlnAla: 0.0 ± 0.0
0.0GlnCys: 0.0 ± 0.0
4.323GlnAsp: 4.323 ± 1.688
7.205GlnGlu: 7.205 ± 2.971
0.0GlnPhe: 0.0 ± 0.0
7.205GlnGly: 7.205 ± 2.813
2.882GlnHis: 2.882 ± 1.125
0.0GlnIle: 0.0 ± 0.0
2.882GlnLys: 2.882 ± 1.125
1.441GlnLeu: 1.441 ± 0.563
0.0GlnMet: 0.0 ± 0.0
8.646GlnAsn: 8.646 ± 3.375
0.0GlnPro: 0.0 ± 0.0
2.882GlnGln: 2.882 ± 1.125
4.323GlnArg: 4.323 ± 4.096
0.0GlnSer: 0.0 ± 0.0
0.0GlnThr: 0.0 ± 0.0
1.441GlnVal: 1.441 ± 0.563
5.764GlnTrp: 5.764 ± 2.25
0.0GlnTyr: 0.0 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
1.441ArgAla: 1.441 ± 0.563
0.0ArgCys: 0.0 ± 0.0
2.882ArgAsp: 2.882 ± 1.125
2.882ArgGlu: 2.882 ± 1.125
4.323ArgPhe: 4.323 ± 1.688
1.441ArgGly: 1.441 ± 0.563
2.882ArgHis: 2.882 ± 1.125
0.0ArgIle: 0.0 ± 0.0
7.205ArgLys: 7.205 ± 2.813
5.764ArgLeu: 5.764 ± 2.25
1.441ArgMet: 1.441 ± 2.115
2.882ArgAsn: 2.882 ± 1.125
2.882ArgPro: 2.882 ± 1.125
1.441ArgGln: 1.441 ± 5.222
30.259ArgArg: 30.259 ± 11.814
1.441ArgSer: 1.441 ± 0.563
4.323ArgThr: 4.323 ± 1.688
1.441ArgVal: 1.441 ± 0.563
4.323ArgTrp: 4.323 ± 4.096
5.764ArgTyr: 5.764 ± 2.25
0.0ArgXaa: 0.0 ± 0.0
Ser
0.0SerAla: 0.0 ± 0.0
0.0SerCys: 0.0 ± 0.0
2.882SerAsp: 2.882 ± 1.125
1.441SerGlu: 1.441 ± 0.563
1.441SerPhe: 1.441 ± 0.563
2.882SerGly: 2.882 ± 1.125
0.0SerHis: 0.0 ± 0.0
1.441SerIle: 1.441 ± 0.563
4.323SerLys: 4.323 ± 1.688
4.323SerLeu: 4.323 ± 1.688
0.0SerMet: 0.0 ± 0.0
5.764SerAsn: 5.764 ± 2.25
0.0SerPro: 0.0 ± 0.0
8.646SerGln: 8.646 ± 3.375
2.882SerArg: 2.882 ± 1.125
7.205SerSer: 7.205 ± 2.813
4.323SerThr: 4.323 ± 1.688
2.882SerVal: 2.882 ± 1.125
0.0SerTrp: 0.0 ± 0.0
1.441SerTyr: 1.441 ± 0.563
0.0SerXaa: 0.0 ± 0.0
Thr
2.882ThrAla: 2.882 ± 1.125
2.882ThrCys: 2.882 ± 1.125
2.882ThrAsp: 2.882 ± 4.659
7.205ThrGlu: 7.205 ± 2.813
2.882ThrPhe: 2.882 ± 4.659
5.764ThrGly: 5.764 ± 3.534
0.0ThrHis: 0.0 ± 0.0
2.882ThrIle: 2.882 ± 1.125
1.441ThrLys: 1.441 ± 0.563
4.323ThrLeu: 4.323 ± 1.688
1.441ThrMet: 1.441 ± 0.563
4.323ThrAsn: 4.323 ± 1.688
1.441ThrPro: 1.441 ± 0.563
4.323ThrGln: 4.323 ± 1.688
2.882ThrArg: 2.882 ± 1.125
2.882ThrSer: 2.882 ± 1.125
5.764ThrThr: 5.764 ± 2.25
2.882ThrVal: 2.882 ± 4.659
1.441ThrTrp: 1.441 ± 0.563
0.0ThrTyr: 0.0 ± 0.0
0.0ThrXaa: 0.0 ± 0.0
Val
2.882ValAla: 2.882 ± 4.659
0.0ValCys: 0.0 ± 0.0
1.441ValAsp: 1.441 ± 0.563
1.441ValGlu: 1.441 ± 0.563
0.0ValPhe: 0.0 ± 0.0
4.323ValGly: 4.323 ± 1.688
0.0ValHis: 0.0 ± 0.0
2.882ValIle: 2.882 ± 1.125
2.882ValLys: 2.882 ± 1.125
4.323ValLeu: 4.323 ± 1.688
1.441ValMet: 1.441 ± 0.563
0.0ValAsn: 0.0 ± 0.0
0.0ValPro: 0.0 ± 0.0
1.441ValGln: 1.441 ± 0.563
2.882ValArg: 2.882 ± 1.125
1.441ValSer: 1.441 ± 0.563
2.882ValThr: 2.882 ± 1.125
2.882ValVal: 2.882 ± 1.125
0.0ValTrp: 0.0 ± 0.0
1.441ValTyr: 1.441 ± 0.563
0.0ValXaa: 0.0 ± 0.0
Trp
4.323TrpAla: 4.323 ± 1.688
0.0TrpCys: 0.0 ± 0.0
2.882TrpAsp: 2.882 ± 1.125
0.0TrpGlu: 0.0 ± 0.0
1.441TrpPhe: 1.441 ± 0.563
4.323TrpGly: 4.323 ± 1.688
0.0TrpHis: 0.0 ± 0.0
1.441TrpIle: 1.441 ± 0.563
0.0TrpLys: 0.0 ± 0.0
4.323TrpLeu: 4.323 ± 4.096
0.0TrpMet: 0.0 ± 0.0
1.441TrpAsn: 1.441 ± 0.563
5.764TrpPro: 5.764 ± 2.25
0.0TrpGln: 0.0 ± 0.0
7.205TrpArg: 7.205 ± 2.813
2.882TrpSer: 2.882 ± 1.125
2.882TrpThr: 2.882 ± 1.125
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.882TyrAla: 2.882 ± 1.125
1.441TyrCys: 1.441 ± 5.222
2.882TyrAsp: 2.882 ± 1.125
1.441TyrGlu: 1.441 ± 0.563
5.764TyrPhe: 5.764 ± 2.25
1.441TyrGly: 1.441 ± 0.563
1.441TyrHis: 1.441 ± 0.563
1.441TyrIle: 1.441 ± 0.563
0.0TyrLys: 0.0 ± 0.0
0.0TyrLeu: 0.0 ± 0.0
0.0TyrMet: 0.0 ± 0.0
1.441TyrAsn: 1.441 ± 0.563
1.441TyrPro: 1.441 ± 0.563
2.882TyrGln: 2.882 ± 1.125
5.764TyrArg: 5.764 ± 2.25
1.441TyrSer: 1.441 ± 0.563
0.0TyrThr: 0.0 ± 0.0
2.882TyrVal: 2.882 ± 1.125
2.882TyrTrp: 2.882 ± 1.125
1.441TyrTyr: 1.441 ± 0.563
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (695 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski