Amino acid dipepetide frequency for Torque teno virus 5

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.264AlaAla: 8.264 ± 2.262
1.377AlaCys: 1.377 ± 2.462
5.51AlaAsp: 5.51 ± 0.465
1.377AlaGlu: 1.377 ± 2.462
0.0AlaPhe: 0.0 ± 0.0
8.264AlaGly: 8.264 ± 11.644
2.755AlaHis: 2.755 ± 4.924
0.0AlaIle: 0.0 ± 0.0
2.755AlaLys: 2.755 ± 1.331
6.887AlaLeu: 6.887 ± 6.055
1.377AlaMet: 1.377 ± 0.591
1.377AlaAsn: 1.377 ± 0.666
4.132AlaPro: 4.132 ± 1.131
4.132AlaGln: 4.132 ± 1.997
2.755AlaArg: 2.755 ± 1.796
1.377AlaSer: 1.377 ± 0.666
2.755AlaThr: 2.755 ± 1.796
4.132AlaVal: 4.132 ± 4.258
1.377AlaTrp: 1.377 ± 0.666
2.755AlaTyr: 2.755 ± 1.331
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
2.755CysCys: 2.755 ± 1.796
1.377CysAsp: 1.377 ± 0.666
0.0CysGlu: 0.0 ± 0.0
0.0CysPhe: 0.0 ± 0.0
2.755CysGly: 2.755 ± 4.924
1.377CysHis: 1.377 ± 0.666
1.377CysIle: 1.377 ± 0.666
0.0CysLys: 0.0 ± 0.0
1.377CysLeu: 1.377 ± 0.666
0.0CysMet: 0.0 ± 0.0
1.377CysAsn: 1.377 ± 0.666
1.377CysPro: 1.377 ± 0.666
0.0CysGln: 0.0 ± 0.0
0.0CysArg: 0.0 ± 0.0
1.377CysSer: 1.377 ± 0.666
1.377CysThr: 1.377 ± 0.666
1.377CysVal: 1.377 ± 2.462
0.0CysTrp: 0.0 ± 0.0
2.755CysTyr: 2.755 ± 1.331
0.0CysXaa: 0.0 ± 0.0
Asp
5.51AspAla: 5.51 ± 9.847
0.0AspCys: 0.0 ± 0.0
1.377AspAsp: 1.377 ± 0.666
2.755AspGlu: 2.755 ± 1.796
2.755AspPhe: 2.755 ± 1.796
2.755AspGly: 2.755 ± 4.924
0.0AspHis: 0.0 ± 0.0
1.377AspIle: 1.377 ± 0.666
4.132AspLys: 4.132 ± 1.997
11.019AspLeu: 11.019 ± 4.058
0.0AspMet: 0.0 ± 0.0
4.132AspAsn: 4.132 ± 1.997
4.132AspPro: 4.132 ± 1.997
1.377AspGln: 1.377 ± 0.666
1.377AspArg: 1.377 ± 0.666
1.377AspSer: 1.377 ± 0.666
4.132AspThr: 4.132 ± 1.997
2.755AspVal: 2.755 ± 1.796
1.377AspTrp: 1.377 ± 0.666
2.755AspTyr: 2.755 ± 1.331
0.0AspXaa: 0.0 ± 0.0
Glu
5.51GluAla: 5.51 ± 2.662
2.755GluCys: 2.755 ± 1.796
2.755GluAsp: 2.755 ± 1.796
4.132GluGlu: 4.132 ± 1.997
0.0GluPhe: 0.0 ± 0.0
2.755GluGly: 2.755 ± 1.331
0.0GluHis: 0.0 ± 0.0
1.377GluIle: 1.377 ± 0.666
0.0GluLys: 0.0 ± 0.0
2.755GluLeu: 2.755 ± 1.331
0.0GluMet: 0.0 ± 1.34
2.755GluAsn: 2.755 ± 1.331
1.377GluPro: 1.377 ± 2.462
4.132GluGln: 4.132 ± 1.997
2.755GluArg: 2.755 ± 1.796
1.377GluSer: 1.377 ± 0.666
0.0GluThr: 0.0 ± 0.0
2.755GluVal: 2.755 ± 1.331
0.0GluTrp: 0.0 ± 0.0
1.377GluTyr: 1.377 ± 0.666
0.0GluXaa: 0.0 ± 0.0
Phe
1.377PheAla: 1.377 ± 0.666
0.0PheCys: 0.0 ± 0.0
1.377PheAsp: 1.377 ± 0.666
1.377PheGlu: 1.377 ± 0.666
1.377PhePhe: 1.377 ± 2.462
6.887PheGly: 6.887 ± 2.927
0.0PheHis: 0.0 ± 0.0
0.0PheIle: 0.0 ± 0.0
0.0PheLys: 0.0 ± 0.0
4.132PheLeu: 4.132 ± 1.997
0.0PheMet: 0.0 ± 0.0
0.0PheAsn: 0.0 ± 0.0
0.0PhePro: 0.0 ± 0.0
2.755PheGln: 2.755 ± 1.331
2.755PheArg: 2.755 ± 1.331
2.755PheSer: 2.755 ± 1.331
1.377PheThr: 1.377 ± 0.666
2.755PheVal: 2.755 ± 1.796
0.0PheTrp: 0.0 ± 0.0
0.0PheTyr: 0.0 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
6.887GlyAla: 6.887 ± 6.055
1.377GlyCys: 1.377 ± 2.462
12.397GlyAsp: 12.397 ± 12.775
1.377GlyGlu: 1.377 ± 0.666
1.377GlyPhe: 1.377 ± 0.666
12.397GlyGly: 12.397 ± 6.52
2.755GlyHis: 2.755 ± 1.796
4.132GlyIle: 4.132 ± 1.131
4.132GlyLys: 4.132 ± 1.131
2.755GlyLeu: 2.755 ± 1.331
1.377GlyMet: 1.377 ± 0.666
2.755GlyAsn: 2.755 ± 1.331
4.132GlyPro: 4.132 ± 1.997
1.377GlyGln: 1.377 ± 0.666
9.642GlyArg: 9.642 ± 4.723
9.642GlySer: 9.642 ± 1.531
1.377GlyThr: 1.377 ± 0.666
0.0GlyVal: 0.0 ± 0.0
0.0GlyTrp: 0.0 ± 0.0
1.377GlyTyr: 1.377 ± 0.666
0.0GlyXaa: 0.0 ± 0.0
His
4.132HisAla: 4.132 ± 1.131
0.0HisCys: 0.0 ± 0.0
1.377HisAsp: 1.377 ± 2.462
0.0HisGlu: 0.0 ± 0.0
0.0HisPhe: 0.0 ± 0.0
0.0HisGly: 0.0 ± 0.0
0.0HisHis: 0.0 ± 0.0
1.377HisIle: 1.377 ± 2.462
1.377HisLys: 1.377 ± 0.666
1.377HisLeu: 1.377 ± 0.666
0.0HisMet: 0.0 ± 0.0
2.755HisAsn: 2.755 ± 1.331
1.377HisPro: 1.377 ± 0.666
2.755HisGln: 2.755 ± 1.796
1.377HisArg: 1.377 ± 2.462
2.755HisSer: 2.755 ± 1.331
1.377HisThr: 1.377 ± 2.462
0.0HisVal: 0.0 ± 0.0
1.377HisTrp: 1.377 ± 0.666
1.377HisTyr: 1.377 ± 0.666
0.0HisXaa: 0.0 ± 0.0
Ile
0.0IleAla: 0.0 ± 0.0
1.377IleCys: 1.377 ± 0.666
1.377IleAsp: 1.377 ± 0.666
2.755IleGlu: 2.755 ± 1.796
1.377IlePhe: 1.377 ± 0.666
2.755IleGly: 2.755 ± 1.331
1.377IleHis: 1.377 ± 0.666
2.755IleIle: 2.755 ± 1.331
6.887IleLys: 6.887 ± 3.328
2.755IleLeu: 2.755 ± 1.331
0.0IleMet: 0.0 ± 0.0
2.755IleAsn: 2.755 ± 1.796
1.377IlePro: 1.377 ± 0.666
1.377IleGln: 1.377 ± 0.666
1.377IleArg: 1.377 ± 0.666
2.755IleSer: 2.755 ± 1.331
6.887IleThr: 6.887 ± 3.328
1.377IleVal: 1.377 ± 0.666
2.755IleTrp: 2.755 ± 1.331
1.377IleTyr: 1.377 ± 0.666
0.0IleXaa: 0.0 ± 0.0
Lys
4.132LysAla: 4.132 ± 4.258
2.755LysCys: 2.755 ± 1.331
1.377LysAsp: 1.377 ± 0.666
1.377LysGlu: 1.377 ± 0.666
2.755LysPhe: 2.755 ± 1.331
5.51LysGly: 5.51 ± 2.662
2.755LysHis: 2.755 ± 1.331
4.132LysIle: 4.132 ± 1.997
5.51LysLys: 5.51 ± 0.465
1.377LysLeu: 1.377 ± 0.666
1.377LysMet: 1.377 ± 0.666
1.377LysAsn: 1.377 ± 0.666
0.0LysPro: 0.0 ± 0.0
5.51LysGln: 5.51 ± 2.662
6.887LysArg: 6.887 ± 0.2
1.377LysSer: 1.377 ± 0.666
4.132LysThr: 4.132 ± 1.131
1.377LysVal: 1.377 ± 2.462
2.755LysTrp: 2.755 ± 1.331
2.755LysTyr: 2.755 ± 1.331
0.0LysXaa: 0.0 ± 0.0
Leu
8.264LeuAla: 8.264 ± 2.262
2.755LeuCys: 2.755 ± 1.331
1.377LeuAsp: 1.377 ± 2.462
1.377LeuGlu: 1.377 ± 0.666
2.755LeuPhe: 2.755 ± 1.331
0.0LeuGly: 0.0 ± 0.0
2.755LeuHis: 2.755 ± 1.331
1.377LeuIle: 1.377 ± 0.666
5.51LeuLys: 5.51 ± 0.465
6.887LeuLeu: 6.887 ± 2.927
2.755LeuMet: 2.755 ± 1.331
5.51LeuAsn: 5.51 ± 2.662
8.264LeuPro: 8.264 ± 2.262
8.264LeuGln: 8.264 ± 3.993
4.132LeuArg: 4.132 ± 1.997
8.264LeuSer: 8.264 ± 0.866
4.132LeuThr: 4.132 ± 1.131
5.51LeuVal: 5.51 ± 0.465
0.0LeuTrp: 0.0 ± 0.0
1.377LeuTyr: 1.377 ± 0.666
0.0LeuXaa: 0.0 ± 0.0
Met
0.0MetAla: 0.0 ± 0.0
0.0MetCys: 0.0 ± 0.0
0.0MetAsp: 0.0 ± 0.0
0.0MetGlu: 0.0 ± 0.0
2.755MetPhe: 2.755 ± 1.796
2.755MetGly: 2.755 ± 1.796
0.0MetHis: 0.0 ± 0.0
1.377MetIle: 1.377 ± 0.666
1.377MetLys: 1.377 ± 0.666
2.755MetLeu: 2.755 ± 1.331
0.0MetMet: 0.0 ± 0.0
1.377MetAsn: 1.377 ± 2.462
1.377MetPro: 1.377 ± 0.666
0.0MetGln: 0.0 ± 0.0
1.377MetArg: 1.377 ± 0.666
1.377MetSer: 1.377 ± 0.666
0.0MetThr: 0.0 ± 0.0
0.0MetVal: 0.0 ± 0.0
1.377MetTrp: 1.377 ± 0.666
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
2.755AsnAla: 2.755 ± 1.331
0.0AsnCys: 0.0 ± 0.0
2.755AsnAsp: 2.755 ± 1.796
1.377AsnGlu: 1.377 ± 0.666
6.887AsnPhe: 6.887 ± 3.328
1.377AsnGly: 1.377 ± 0.666
0.0AsnHis: 0.0 ± 0.0
0.0AsnIle: 0.0 ± 0.0
4.132AsnLys: 4.132 ± 1.997
0.0AsnLeu: 0.0 ± 0.0
2.755AsnMet: 2.755 ± 1.331
4.132AsnAsn: 4.132 ± 1.997
4.132AsnPro: 4.132 ± 1.997
0.0AsnGln: 0.0 ± 0.0
2.755AsnArg: 2.755 ± 1.796
4.132AsnSer: 4.132 ± 1.997
12.397AsnThr: 12.397 ± 5.99
0.0AsnVal: 0.0 ± 0.0
2.755AsnTrp: 2.755 ± 1.331
2.755AsnTyr: 2.755 ± 1.331
0.0AsnXaa: 0.0 ± 0.0
Pro
5.51ProAla: 5.51 ± 9.847
1.377ProCys: 1.377 ± 0.666
0.0ProAsp: 0.0 ± 0.0
2.755ProGlu: 2.755 ± 1.796
2.755ProPhe: 2.755 ± 1.331
4.132ProGly: 4.132 ± 1.131
1.377ProHis: 1.377 ± 0.666
0.0ProIle: 0.0 ± 0.0
4.132ProLys: 4.132 ± 1.997
5.51ProLeu: 5.51 ± 0.465
1.377ProMet: 1.377 ± 0.666
0.0ProAsn: 0.0 ± 0.0
6.887ProPro: 6.887 ± 9.182
5.51ProGln: 5.51 ± 3.593
2.755ProArg: 2.755 ± 1.331
6.887ProSer: 6.887 ± 0.2
6.887ProThr: 6.887 ± 0.2
2.755ProVal: 2.755 ± 1.331
1.377ProTrp: 1.377 ± 2.462
1.377ProTyr: 1.377 ± 0.666
0.0ProXaa: 0.0 ± 0.0
Gln
1.377GlnAla: 1.377 ± 0.666
2.755GlnCys: 2.755 ± 1.331
0.0GlnAsp: 0.0 ± 0.0
6.887GlnGlu: 6.887 ± 3.328
0.0GlnPhe: 0.0 ± 0.0
4.132GlnGly: 4.132 ± 1.997
0.0GlnHis: 0.0 ± 0.0
1.377GlnIle: 1.377 ± 0.666
5.51GlnLys: 5.51 ± 3.593
9.642GlnLeu: 9.642 ± 4.659
0.0GlnMet: 0.0 ± 0.0
4.132GlnAsn: 4.132 ± 1.997
2.755GlnPro: 2.755 ± 1.796
5.51GlnGln: 5.51 ± 2.662
1.377GlnArg: 1.377 ± 0.666
1.377GlnSer: 1.377 ± 0.666
5.51GlnThr: 5.51 ± 2.662
4.132GlnVal: 4.132 ± 1.997
2.755GlnTrp: 2.755 ± 1.796
1.377GlnTyr: 1.377 ± 0.666
0.0GlnXaa: 0.0 ± 0.0
Arg
1.377ArgAla: 1.377 ± 2.462
1.377ArgCys: 1.377 ± 0.666
6.887ArgAsp: 6.887 ± 3.328
1.377ArgGlu: 1.377 ± 0.666
1.377ArgPhe: 1.377 ± 2.462
6.887ArgGly: 6.887 ± 2.927
1.377ArgHis: 1.377 ± 2.462
2.755ArgIle: 2.755 ± 1.331
2.755ArgLys: 2.755 ± 1.796
0.0ArgLeu: 0.0 ± 0.0
1.377ArgMet: 1.377 ± 0.666
2.755ArgAsn: 2.755 ± 1.331
5.51ArgPro: 5.51 ± 3.593
1.377ArgGln: 1.377 ± 2.462
4.132ArgArg: 4.132 ± 1.997
4.132ArgSer: 4.132 ± 1.131
0.0ArgThr: 0.0 ± 0.0
6.887ArgVal: 6.887 ± 0.2
2.755ArgTrp: 2.755 ± 1.796
0.0ArgTyr: 0.0 ± 0.0
0.0ArgXaa: 0.0 ± 0.0
Ser
1.377SerAla: 1.377 ± 0.666
0.0SerCys: 0.0 ± 0.0
2.755SerAsp: 2.755 ± 1.331
4.132SerGlu: 4.132 ± 1.997
0.0SerPhe: 0.0 ± 0.0
5.51SerGly: 5.51 ± 0.465
1.377SerHis: 1.377 ± 2.462
8.264SerIle: 8.264 ± 3.993
2.755SerLys: 2.755 ± 1.331
4.132SerLeu: 4.132 ± 1.131
1.377SerMet: 1.377 ± 2.462
6.887SerAsn: 6.887 ± 3.328
4.132SerPro: 4.132 ± 1.131
11.019SerGln: 11.019 ± 5.324
2.755SerArg: 2.755 ± 1.796
11.019SerSer: 11.019 ± 2.197
4.132SerThr: 4.132 ± 1.997
0.0SerVal: 0.0 ± 0.0
0.0SerTrp: 0.0 ± 0.0
1.377SerTyr: 1.377 ± 0.666
0.0SerXaa: 0.0 ± 0.0
Thr
1.377ThrAla: 1.377 ± 0.666
0.0ThrCys: 0.0 ± 0.0
2.755ThrAsp: 2.755 ± 1.331
4.132ThrGlu: 4.132 ± 1.997
0.0ThrPhe: 0.0 ± 0.0
6.887ThrGly: 6.887 ± 3.328
1.377ThrHis: 1.377 ± 0.666
5.51ThrIle: 5.51 ± 2.662
1.377ThrLys: 1.377 ± 0.666
8.264ThrLeu: 8.264 ± 3.993
1.377ThrMet: 1.377 ± 2.462
5.51ThrAsn: 5.51 ± 2.662
6.887ThrPro: 6.887 ± 9.182
2.755ThrGln: 2.755 ± 1.331
2.755ThrArg: 2.755 ± 1.331
2.755ThrSer: 2.755 ± 1.331
8.264ThrThr: 8.264 ± 0.866
1.377ThrVal: 1.377 ± 0.666
0.0ThrTrp: 0.0 ± 0.0
6.887ThrTyr: 6.887 ± 3.328
0.0ThrXaa: 0.0 ± 0.0
Val
2.755ValAla: 2.755 ± 1.796
0.0ValCys: 0.0 ± 0.0
5.51ValAsp: 5.51 ± 0.465
0.0ValGlu: 0.0 ± 0.0
1.377ValPhe: 1.377 ± 0.666
1.377ValGly: 1.377 ± 2.462
2.755ValHis: 2.755 ± 1.796
1.377ValIle: 1.377 ± 0.666
2.755ValLys: 2.755 ± 1.331
8.264ValLeu: 8.264 ± 0.866
1.377ValMet: 1.377 ± 0.666
0.0ValAsn: 0.0 ± 0.0
2.755ValPro: 2.755 ± 1.331
1.377ValGln: 1.377 ± 0.666
1.377ValArg: 1.377 ± 2.462
1.377ValSer: 1.377 ± 0.666
0.0ValThr: 0.0 ± 0.0
2.755ValVal: 2.755 ± 1.331
1.377ValTrp: 1.377 ± 2.462
2.755ValTyr: 2.755 ± 1.796
0.0ValXaa: 0.0 ± 0.0
Trp
1.377TrpAla: 1.377 ± 0.666
0.0TrpCys: 0.0 ± 0.0
1.377TrpAsp: 1.377 ± 0.666
1.377TrpGlu: 1.377 ± 0.666
0.0TrpPhe: 0.0 ± 0.0
1.377TrpGly: 1.377 ± 0.666
1.377TrpHis: 1.377 ± 0.666
2.755TrpIle: 2.755 ± 1.331
1.377TrpLys: 1.377 ± 0.666
1.377TrpLeu: 1.377 ± 0.666
0.0TrpMet: 0.0 ± 0.0
1.377TrpAsn: 1.377 ± 0.666
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
1.377TrpArg: 1.377 ± 2.462
2.755TrpSer: 2.755 ± 4.924
0.0TrpThr: 0.0 ± 0.0
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
4.132TrpTyr: 4.132 ± 1.131
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.377TyrAla: 1.377 ± 0.666
0.0TyrCys: 0.0 ± 0.0
2.755TyrAsp: 2.755 ± 1.331
1.377TyrGlu: 1.377 ± 2.462
1.377TyrPhe: 1.377 ± 0.666
2.755TyrGly: 2.755 ± 1.331
1.377TyrHis: 1.377 ± 0.666
4.132TyrIle: 4.132 ± 1.997
2.755TyrLys: 2.755 ± 1.331
0.0TyrLeu: 0.0 ± 0.0
0.0TyrMet: 0.0 ± 0.0
4.132TyrAsn: 4.132 ± 1.997
2.755TyrPro: 2.755 ± 1.331
1.377TyrGln: 1.377 ± 0.666
1.377TyrArg: 1.377 ± 2.462
4.132TyrSer: 4.132 ± 1.997
5.51TyrThr: 5.51 ± 2.662
1.377TyrVal: 1.377 ± 0.666
0.0TyrTrp: 0.0 ± 0.0
4.132TyrTyr: 4.132 ± 1.997
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (727 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski