Amino acid dipepetide frequency for Torque teno felis virus (isolate Fc-TTV4)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.297AlaAla: 1.297 ± 0.777
1.297AlaCys: 1.297 ± 0.777
2.594AlaAsp: 2.594 ± 0.804
6.485AlaGlu: 6.485 ± 0.981
0.0AlaPhe: 0.0 ± 0.0
0.0AlaGly: 0.0 ± 0.0
0.0AlaHis: 0.0 ± 0.0
0.0AlaIle: 0.0 ± 0.0
1.297AlaLys: 1.297 ± 0.777
3.891AlaLeu: 3.891 ± 3.012
0.0AlaMet: 0.0 ± 0.0
0.0AlaAsn: 0.0 ± 0.0
5.188AlaPro: 5.188 ± 3.109
1.297AlaGln: 1.297 ± 0.777
3.891AlaArg: 3.891 ± 2.331
9.079AlaSer: 9.079 ± 2.284
2.594AlaThr: 2.594 ± 1.554
6.485AlaVal: 6.485 ± 3.487
1.297AlaTrp: 1.297 ± 0.777
1.297AlaTyr: 1.297 ± 0.777
0.0AlaXaa: 0.0 ± 0.0
Cys
1.297CysAla: 1.297 ± 0.777
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
1.297CysGlu: 1.297 ± 0.777
3.891CysPhe: 3.891 ± 2.331
2.594CysGly: 2.594 ± 2.089
1.297CysHis: 1.297 ± 0.777
2.594CysIle: 2.594 ± 2.089
1.297CysLys: 1.297 ± 1.331
0.0CysLeu: 0.0 ± 0.0
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
2.594CysPro: 2.594 ± 1.554
0.0CysGln: 0.0 ± 0.0
0.0CysArg: 0.0 ± 0.0
2.594CysSer: 2.594 ± 2.089
1.297CysThr: 1.297 ± 0.777
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
2.594AspAla: 2.594 ± 0.804
1.297AspCys: 1.297 ± 0.777
10.376AspAsp: 10.376 ± 6.315
6.485AspGlu: 6.485 ± 3.487
2.594AspPhe: 2.594 ± 2.089
9.079AspGly: 9.079 ± 3.502
0.0AspHis: 0.0 ± 0.0
2.594AspIle: 2.594 ± 0.804
1.297AspLys: 1.297 ± 0.777
5.188AspLeu: 5.188 ± 0.942
0.0AspMet: 0.0 ± 0.0
1.297AspAsn: 1.297 ± 0.777
11.673AspPro: 11.673 ± 2.497
0.0AspGln: 0.0 ± 0.0
2.594AspArg: 2.594 ± 1.554
9.079AspSer: 9.079 ± 2.202
0.0AspThr: 0.0 ± 0.0
1.297AspVal: 1.297 ± 0.777
0.0AspTrp: 0.0 ± 0.0
5.188AspTyr: 5.188 ± 0.942
0.0AspXaa: 0.0 ± 0.0
Glu
7.782GluAla: 7.782 ± 1.499
2.594GluCys: 2.594 ± 2.089
7.782GluAsp: 7.782 ± 4.251
6.485GluGlu: 6.485 ± 3.487
0.0GluPhe: 0.0 ± 0.0
3.891GluGly: 3.891 ± 3.012
2.594GluHis: 2.594 ± 2.089
1.297GluIle: 1.297 ± 1.331
0.0GluLys: 0.0 ± 0.0
3.891GluLeu: 3.891 ± 3.012
2.594GluMet: 2.594 ± 0.804
1.297GluAsn: 1.297 ± 0.777
5.188GluPro: 5.188 ± 1.607
2.594GluGln: 2.594 ± 1.554
2.594GluArg: 2.594 ± 1.554
6.485GluSer: 6.485 ± 1.461
1.297GluThr: 1.297 ± 0.777
2.594GluVal: 2.594 ± 2.089
0.0GluTrp: 0.0 ± 0.0
0.0GluTyr: 0.0 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
0.0PheAla: 0.0 ± 0.0
2.594PheCys: 2.594 ± 2.089
1.297PheAsp: 1.297 ± 0.777
0.0PheGlu: 0.0 ± 0.0
1.297PhePhe: 1.297 ± 0.777
1.297PheGly: 1.297 ± 0.777
0.0PheHis: 0.0 ± 0.0
0.0PheIle: 0.0 ± 0.0
1.297PheLys: 1.297 ± 0.777
2.594PheLeu: 2.594 ± 1.554
2.594PheMet: 2.594 ± 1.117
0.0PheAsn: 0.0 ± 0.0
3.891PhePro: 3.891 ± 1.422
2.594PheGln: 2.594 ± 1.554
2.594PheArg: 2.594 ± 2.089
3.891PheSer: 3.891 ± 0.853
3.891PheThr: 3.891 ± 1.422
0.0PheVal: 0.0 ± 0.0
1.297PheTrp: 1.297 ± 0.777
0.0PheTyr: 0.0 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
0.0GlyAla: 0.0 ± 0.0
0.0GlyCys: 0.0 ± 0.0
14.267GlyAsp: 14.267 ± 9.743
5.188GlyGlu: 5.188 ± 2.236
0.0GlyPhe: 0.0 ± 0.0
7.782GlyGly: 7.782 ± 2.632
2.594GlyHis: 2.594 ± 1.554
6.485GlyIle: 6.485 ± 1.464
0.0GlyLys: 0.0 ± 0.0
0.0GlyLeu: 0.0 ± 0.0
1.297GlyMet: 1.297 ± 1.331
3.891GlyAsn: 3.891 ± 2.331
3.891GlyPro: 3.891 ± 2.057
2.594GlyGln: 2.594 ± 1.554
5.188GlyArg: 5.188 ± 1.42
2.594GlySer: 2.594 ± 0.804
7.782GlyThr: 7.782 ± 4.663
2.594GlyVal: 2.594 ± 2.089
2.594GlyTrp: 2.594 ± 1.554
1.297GlyTyr: 1.297 ± 0.777
0.0GlyXaa: 0.0 ± 0.0
His
1.297HisAla: 1.297 ± 1.331
0.0HisCys: 0.0 ± 0.0
0.0HisAsp: 0.0 ± 0.0
0.0HisGlu: 0.0 ± 0.0
2.594HisPhe: 2.594 ± 2.089
2.594HisGly: 2.594 ± 1.554
1.297HisHis: 1.297 ± 0.777
0.0HisIle: 0.0 ± 0.0
0.0HisLys: 0.0 ± 0.0
1.297HisLeu: 1.297 ± 0.777
0.0HisMet: 0.0 ± 0.0
0.0HisAsn: 0.0 ± 0.0
2.594HisPro: 2.594 ± 2.089
0.0HisGln: 0.0 ± 0.0
3.891HisArg: 3.891 ± 1.422
5.188HisSer: 5.188 ± 0.942
0.0HisThr: 0.0 ± 0.0
2.594HisVal: 2.594 ± 2.089
2.594HisTrp: 2.594 ± 1.554
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
2.594IleAla: 2.594 ± 2.089
1.297IleCys: 1.297 ± 0.777
0.0IleAsp: 0.0 ± 0.0
0.0IleGlu: 0.0 ± 0.0
0.0IlePhe: 0.0 ± 0.0
0.0IleGly: 0.0 ± 0.0
1.297IleHis: 1.297 ± 1.331
2.594IleIle: 2.594 ± 0.804
0.0IleLys: 0.0 ± 0.0
6.485IleLeu: 6.485 ± 2.125
0.0IleMet: 0.0 ± 0.0
0.0IleAsn: 0.0 ± 0.0
1.297IlePro: 1.297 ± 0.777
0.0IleGln: 0.0 ± 0.0
5.188IleArg: 5.188 ± 0.942
1.297IleSer: 1.297 ± 1.331
3.891IleThr: 3.891 ± 0.853
1.297IleVal: 1.297 ± 0.777
1.297IleTrp: 1.297 ± 0.777
2.594IleTyr: 2.594 ± 0.804
0.0IleXaa: 0.0 ± 0.0
Lys
1.297LysAla: 1.297 ± 0.777
0.0LysCys: 0.0 ± 0.0
0.0LysAsp: 0.0 ± 0.0
5.188LysGlu: 5.188 ± 4.177
3.891LysPhe: 3.891 ± 1.422
0.0LysGly: 0.0 ± 0.0
0.0LysHis: 0.0 ± 0.0
1.297LysIle: 1.297 ± 0.777
0.0LysLys: 0.0 ± 0.0
2.594LysLeu: 2.594 ± 1.554
0.0LysMet: 0.0 ± 0.0
1.297LysAsn: 1.297 ± 0.777
2.594LysPro: 2.594 ± 0.804
5.188LysGln: 5.188 ± 2.236
5.188LysArg: 5.188 ± 0.942
1.297LysSer: 1.297 ± 1.331
6.485LysThr: 6.485 ± 2.125
1.297LysVal: 1.297 ± 0.777
2.594LysTrp: 2.594 ± 0.804
1.297LysTyr: 1.297 ± 0.777
0.0LysXaa: 0.0 ± 0.0
Leu
0.0LeuAla: 0.0 ± 0.0
2.594LeuCys: 2.594 ± 0.804
3.891LeuAsp: 3.891 ± 2.331
5.188LeuGlu: 5.188 ± 2.236
0.0LeuPhe: 0.0 ± 0.0
3.891LeuGly: 3.891 ± 2.331
2.594LeuHis: 2.594 ± 2.089
2.594LeuIle: 2.594 ± 0.804
3.891LeuLys: 3.891 ± 2.057
6.485LeuLeu: 6.485 ± 2.825
5.188LeuMet: 5.188 ± 1.263
1.297LeuAsn: 1.297 ± 0.777
2.594LeuPro: 2.594 ± 1.554
2.594LeuGln: 2.594 ± 1.554
5.188LeuArg: 5.188 ± 3.109
6.485LeuSer: 6.485 ± 1.461
1.297LeuThr: 1.297 ± 1.331
3.891LeuVal: 3.891 ± 1.422
3.891LeuTrp: 3.891 ± 3.654
1.297LeuTyr: 1.297 ± 1.331
0.0LeuXaa: 0.0 ± 0.0
Met
2.594MetAla: 2.594 ± 2.089
1.297MetCys: 1.297 ± 0.777
0.0MetAsp: 0.0 ± 0.0
1.297MetGlu: 1.297 ± 1.331
1.297MetPhe: 1.297 ± 0.777
3.891MetGly: 3.891 ± 3.993
1.297MetHis: 1.297 ± 0.777
0.0MetIle: 0.0 ± 0.0
0.0MetLys: 0.0 ± 0.0
0.0MetLeu: 0.0 ± 0.0
0.0MetMet: 0.0 ± 0.0
1.297MetAsn: 1.297 ± 1.331
0.0MetPro: 0.0 ± 0.0
1.297MetGln: 1.297 ± 0.777
0.0MetArg: 0.0 ± 0.0
3.891MetSer: 3.891 ± 3.012
1.297MetThr: 1.297 ± 0.777
0.0MetVal: 0.0 ± 0.0
0.0MetTrp: 0.0 ± 0.0
3.891MetTyr: 3.891 ± 2.331
0.0MetXaa: 0.0 ± 0.0
Asn
0.0AsnAla: 0.0 ± 0.0
0.0AsnCys: 0.0 ± 0.0
1.297AsnAsp: 1.297 ± 0.777
1.297AsnGlu: 1.297 ± 1.331
0.0AsnPhe: 0.0 ± 0.0
0.0AsnGly: 0.0 ± 0.0
0.0AsnHis: 0.0 ± 0.0
2.594AsnIle: 2.594 ± 1.554
0.0AsnLys: 0.0 ± 0.0
3.891AsnLeu: 3.891 ± 3.012
2.594AsnMet: 2.594 ± 2.662
0.0AsnAsn: 0.0 ± 0.0
2.594AsnPro: 2.594 ± 0.804
1.297AsnGln: 1.297 ± 0.777
2.594AsnArg: 2.594 ± 1.554
2.594AsnSer: 2.594 ± 0.804
0.0AsnThr: 0.0 ± 0.0
0.0AsnVal: 0.0 ± 0.0
1.297AsnTrp: 1.297 ± 0.777
1.297AsnTyr: 1.297 ± 0.777
0.0AsnXaa: 0.0 ± 0.0
Pro
0.0ProAla: 0.0 ± 0.0
3.891ProCys: 3.891 ± 2.331
6.485ProAsp: 6.485 ± 1.461
5.188ProGlu: 5.188 ± 4.162
1.297ProPhe: 1.297 ± 0.777
6.485ProGly: 6.485 ± 2.125
0.0ProHis: 0.0 ± 0.0
1.297ProIle: 1.297 ± 0.777
3.891ProLys: 3.891 ± 2.331
5.188ProLeu: 5.188 ± 1.607
0.0ProMet: 0.0 ± 0.0
7.782ProAsn: 7.782 ± 4.646
6.485ProPro: 6.485 ± 2.825
2.594ProGln: 2.594 ± 2.662
7.782ProArg: 7.782 ± 2.867
6.485ProSer: 6.485 ± 1.461
5.188ProThr: 5.188 ± 3.109
6.485ProVal: 6.485 ± 0.981
2.594ProTrp: 2.594 ± 0.804
3.891ProTyr: 3.891 ± 2.331
0.0ProXaa: 0.0 ± 0.0
Gln
2.594GlnAla: 2.594 ± 1.554
0.0GlnCys: 0.0 ± 0.0
0.0GlnAsp: 0.0 ± 0.0
3.891GlnGlu: 3.891 ± 1.422
0.0GlnPhe: 0.0 ± 0.0
2.594GlnGly: 2.594 ± 0.804
3.891GlnHis: 3.891 ± 1.422
0.0GlnIle: 0.0 ± 0.0
3.891GlnLys: 3.891 ± 2.331
5.188GlnLeu: 5.188 ± 0.942
0.0GlnMet: 0.0 ± 0.0
0.0GlnAsn: 0.0 ± 0.0
2.594GlnPro: 2.594 ± 0.804
3.891GlnGln: 3.891 ± 2.331
1.297GlnArg: 1.297 ± 0.777
5.188GlnSer: 5.188 ± 0.942
2.594GlnThr: 2.594 ± 0.804
2.594GlnVal: 2.594 ± 1.554
1.297GlnTrp: 1.297 ± 0.777
0.0GlnTyr: 0.0 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
5.188ArgAla: 5.188 ± 3.109
0.0ArgCys: 0.0 ± 0.0
3.891ArgAsp: 3.891 ± 2.331
1.297ArgGlu: 1.297 ± 0.777
3.891ArgPhe: 3.891 ± 1.422
5.188ArgGly: 5.188 ± 3.109
0.0ArgHis: 0.0 ± 0.0
2.594ArgIle: 2.594 ± 1.554
3.891ArgLys: 3.891 ± 1.422
5.188ArgLeu: 5.188 ± 1.607
0.0ArgMet: 0.0 ± 0.0
0.0ArgAsn: 0.0 ± 0.0
9.079ArgPro: 9.079 ± 5.44
6.485ArgGln: 6.485 ± 0.981
22.049ArgArg: 22.049 ± 13.211
2.594ArgSer: 2.594 ± 1.554
5.188ArgThr: 5.188 ± 1.42
5.188ArgVal: 5.188 ± 0.942
5.188ArgTrp: 5.188 ± 3.109
2.594ArgTyr: 2.594 ± 0.804
0.0ArgXaa: 0.0 ± 0.0
Ser
7.782SerAla: 7.782 ± 2.843
3.891SerCys: 3.891 ± 1.422
9.079SerAsp: 9.079 ± 3.609
2.594SerGlu: 2.594 ± 0.804
2.594SerPhe: 2.594 ± 1.554
6.485SerGly: 6.485 ± 1.461
2.594SerHis: 2.594 ± 2.089
2.594SerIle: 2.594 ± 2.089
2.594SerLys: 2.594 ± 2.662
3.891SerLeu: 3.891 ± 1.422
1.297SerMet: 1.297 ± 0.777
1.297SerAsn: 1.297 ± 1.331
6.485SerPro: 6.485 ± 5.009
5.188SerGln: 5.188 ± 0.942
10.376SerArg: 10.376 ± 4.388
9.079SerSer: 9.079 ± 7.349
3.891SerThr: 3.891 ± 2.057
0.0SerVal: 0.0 ± 0.0
1.297SerTrp: 1.297 ± 0.777
7.782SerTyr: 7.782 ± 0.693
0.0SerXaa: 0.0 ± 0.0
Thr
5.188ThrAla: 5.188 ± 3.109
0.0ThrCys: 0.0 ± 0.0
3.891ThrAsp: 3.891 ± 2.057
1.297ThrGlu: 1.297 ± 0.777
1.297ThrPhe: 1.297 ± 0.777
3.891ThrGly: 3.891 ± 2.331
2.594ThrHis: 2.594 ± 1.554
1.297ThrIle: 1.297 ± 0.777
5.188ThrLys: 5.188 ± 0.942
1.297ThrLeu: 1.297 ± 0.777
1.297ThrMet: 1.297 ± 0.777
2.594ThrAsn: 2.594 ± 1.554
5.188ThrPro: 5.188 ± 1.42
2.594ThrGln: 2.594 ± 2.089
2.594ThrArg: 2.594 ± 0.804
3.891ThrSer: 3.891 ± 0.853
1.297ThrThr: 1.297 ± 0.777
3.891ThrVal: 3.891 ± 0.853
5.188ThrTrp: 5.188 ± 3.109
3.891ThrTyr: 3.891 ± 3.012
0.0ThrXaa: 0.0 ± 0.0
Val
2.594ValAla: 2.594 ± 2.089
0.0ValCys: 0.0 ± 0.0
2.594ValAsp: 2.594 ± 1.554
2.594ValGlu: 2.594 ± 1.554
0.0ValPhe: 0.0 ± 0.0
6.485ValGly: 6.485 ± 3.487
0.0ValHis: 0.0 ± 0.0
0.0ValIle: 0.0 ± 0.0
3.891ValLys: 3.891 ± 1.422
3.891ValLeu: 3.891 ± 2.331
3.891ValMet: 3.891 ± 3.012
0.0ValAsn: 0.0 ± 0.0
5.188ValPro: 5.188 ± 3.109
0.0ValGln: 0.0 ± 0.0
1.297ValArg: 1.297 ± 0.777
5.188ValSer: 5.188 ± 4.177
5.188ValThr: 5.188 ± 0.942
3.891ValVal: 3.891 ± 1.422
0.0ValTrp: 0.0 ± 0.0
0.0ValTyr: 0.0 ± 0.0
0.0ValXaa: 0.0 ± 0.0
Trp
2.594TrpAla: 2.594 ± 1.554
0.0TrpCys: 0.0 ± 0.0
2.594TrpAsp: 2.594 ± 1.554
3.891TrpGlu: 3.891 ± 0.853
3.891TrpPhe: 3.891 ± 2.331
3.891TrpGly: 3.891 ± 2.331
1.297TrpHis: 1.297 ± 0.777
1.297TrpIle: 1.297 ± 0.777
2.594TrpLys: 2.594 ± 2.089
0.0TrpLeu: 0.0 ± 0.0
1.297TrpMet: 1.297 ± 1.295
1.297TrpAsn: 1.297 ± 1.331
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
3.891TrpArg: 3.891 ± 2.331
1.297TrpSer: 1.297 ± 0.777
1.297TrpThr: 1.297 ± 0.777
1.297TrpVal: 1.297 ± 0.777
3.891TrpTrp: 3.891 ± 2.331
1.297TrpTyr: 1.297 ± 0.777
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.297TyrAla: 1.297 ± 0.777
0.0TyrCys: 0.0 ± 0.0
2.594TyrAsp: 2.594 ± 1.554
1.297TyrGlu: 1.297 ± 0.777
3.891TyrPhe: 3.891 ± 2.057
0.0TyrGly: 0.0 ± 0.0
2.594TyrHis: 2.594 ± 2.089
0.0TyrIle: 0.0 ± 0.0
6.485TyrLys: 6.485 ± 0.981
3.891TyrLeu: 3.891 ± 2.331
0.0TyrMet: 0.0 ± 0.0
0.0TyrAsn: 0.0 ± 0.0
3.891TyrPro: 3.891 ± 2.057
1.297TyrGln: 1.297 ± 0.777
1.297TyrArg: 1.297 ± 0.777
2.594TyrSer: 2.594 ± 0.804
3.891TyrThr: 3.891 ± 1.422
1.297TyrVal: 1.297 ± 0.777
1.297TyrTrp: 1.297 ± 0.777
1.297TyrTyr: 1.297 ± 0.777
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (772 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski