Amino acid dipepetide frequency for Trichomonas vaginalis virus 4

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.188AlaAla: 7.188 ± 1.295
1.797AlaCys: 1.797 ± 0.521
3.145AlaAsp: 3.145 ± 0.272
2.246AlaGlu: 2.246 ± 1.321
4.043AlaPhe: 4.043 ± 0.778
7.637AlaGly: 7.637 ± 1.031
1.797AlaHis: 1.797 ± 0.521
6.289AlaIle: 6.289 ± 0.543
0.898AlaLys: 0.898 ± 0.528
10.332AlaLeu: 10.332 ± 1.812
2.246AlaMet: 2.246 ± 0.419
5.391AlaAsn: 5.391 ± 0.774
7.188AlaPro: 7.188 ± 0.506
1.797AlaGln: 1.797 ± 0.268
6.739AlaArg: 6.739 ± 0.77
7.637AlaSer: 7.637 ± 1.031
6.289AlaThr: 6.289 ± 0.543
3.594AlaVal: 3.594 ± 1.324
0.0AlaTrp: 0.0 ± 0.0
3.145AlaTyr: 3.145 ± 0.517
0.0AlaXaa: 0.0 ± 0.0
Cys
1.797CysAla: 1.797 ± 0.268
0.0CysCys: 0.0 ± 0.0
2.246CysAsp: 2.246 ± 0.532
0.898CysGlu: 0.898 ± 0.26
0.449CysPhe: 0.449 ± 0.264
1.348CysGly: 1.348 ± 0.792
0.898CysHis: 0.898 ± 0.26
0.898CysIle: 0.898 ± 0.528
0.449CysLys: 0.449 ± 0.264
0.898CysLeu: 0.898 ± 0.26
0.449CysMet: 0.449 ± 0.264
0.0CysAsn: 0.0 ± 0.0
2.695CysPro: 2.695 ± 0.796
0.898CysGln: 0.898 ± 0.26
0.449CysArg: 0.449 ± 0.264
2.246CysSer: 2.246 ± 0.257
1.348CysThr: 1.348 ± 0.004
1.797CysVal: 1.797 ± 0.521
0.0CysTrp: 0.0 ± 0.0
1.797CysTyr: 1.797 ± 0.521
0.0CysXaa: 0.0 ± 0.0
Asp
3.594AspAla: 3.594 ± 0.253
1.348AspCys: 1.348 ± 0.004
1.797AspAsp: 1.797 ± 0.268
1.797AspGlu: 1.797 ± 0.521
4.043AspPhe: 4.043 ± 0.011
4.942AspGly: 4.942 ± 1.038
2.246AspHis: 2.246 ± 0.257
4.043AspIle: 4.043 ± 0.778
0.449AspLys: 0.449 ± 0.264
2.695AspLeu: 2.695 ± 0.796
0.449AspMet: 0.449 ± 0.264
3.594AspAsn: 3.594 ± 0.253
3.594AspPro: 3.594 ± 0.536
0.898AspGln: 0.898 ± 0.26
4.043AspArg: 4.043 ± 0.011
2.246AspSer: 2.246 ± 0.257
2.695AspThr: 2.695 ± 0.007
2.246AspVal: 2.246 ± 0.532
0.449AspTrp: 0.449 ± 0.264
2.695AspTyr: 2.695 ± 0.796
0.0AspXaa: 0.0 ± 0.0
Glu
2.695GluAla: 2.695 ± 0.007
0.449GluCys: 0.449 ± 0.264
0.898GluAsp: 0.898 ± 0.26
1.348GluGlu: 1.348 ± 0.004
2.695GluPhe: 2.695 ± 0.796
4.043GluGly: 4.043 ± 0.778
2.695GluHis: 2.695 ± 0.007
2.246GluIle: 2.246 ± 0.257
0.898GluLys: 0.898 ± 0.528
1.797GluLeu: 1.797 ± 1.057
1.348GluMet: 1.348 ± 0.004
0.0GluAsn: 0.0 ± 0.0
0.898GluPro: 0.898 ± 0.26
0.0GluGln: 0.0 ± 0.0
0.898GluArg: 0.898 ± 0.26
2.695GluSer: 2.695 ± 0.007
1.797GluThr: 1.797 ± 0.521
4.492GluVal: 4.492 ± 0.275
0.0GluTrp: 0.0 ± 0.0
3.145GluTyr: 3.145 ± 0.272
0.0GluXaa: 0.0 ± 0.0
Phe
0.898PheAla: 0.898 ± 0.528
1.348PheCys: 1.348 ± 0.004
1.797PheAsp: 1.797 ± 0.268
0.898PheGlu: 0.898 ± 0.528
0.898PhePhe: 0.898 ± 0.26
3.594PheGly: 3.594 ± 0.253
1.797PheHis: 1.797 ± 0.521
1.348PheIle: 1.348 ± 0.004
1.797PheLys: 1.797 ± 0.521
1.797PheLeu: 1.797 ± 1.057
0.898PheMet: 0.898 ± 0.26
2.695PheAsn: 2.695 ± 0.007
3.145PhePro: 3.145 ± 0.272
1.797PheGln: 1.797 ± 0.521
1.348PheArg: 1.348 ± 0.004
0.0PheSer: 0.0 ± 0.0
4.043PheThr: 4.043 ± 0.778
3.145PheVal: 3.145 ± 0.272
1.348PheTrp: 1.348 ± 0.004
2.246PheTyr: 2.246 ± 0.257
0.0PheXaa: 0.0 ± 0.0
Gly
5.84GlyAla: 5.84 ± 1.298
0.449GlyCys: 0.449 ± 0.264
3.594GlyAsp: 3.594 ± 0.253
2.695GlyGlu: 2.695 ± 0.007
2.695GlyPhe: 2.695 ± 0.781
3.145GlyGly: 3.145 ± 1.306
3.145GlyHis: 3.145 ± 0.517
5.391GlyIle: 5.391 ± 1.563
0.449GlyLys: 0.449 ± 0.264
4.492GlyLeu: 4.492 ± 0.513
0.0GlyMet: 0.0 ± 0.0
3.594GlyAsn: 3.594 ± 0.253
8.086GlyPro: 8.086 ± 1.555
2.246GlyGln: 2.246 ± 0.257
1.348GlyArg: 1.348 ± 0.004
4.043GlySer: 4.043 ± 0.011
4.492GlyThr: 4.492 ± 0.513
4.942GlyVal: 4.942 ± 0.249
0.898GlyTrp: 0.898 ± 0.26
3.145GlyTyr: 3.145 ± 0.517
0.0GlyXaa: 0.0 ± 0.0
His
2.246HisAla: 2.246 ± 0.257
0.898HisCys: 0.898 ± 0.528
1.348HisAsp: 1.348 ± 0.004
1.348HisGlu: 1.348 ± 0.004
0.898HisPhe: 0.898 ± 0.26
1.797HisGly: 1.797 ± 0.268
2.695HisHis: 2.695 ± 0.796
2.246HisIle: 2.246 ± 0.257
0.898HisLys: 0.898 ± 0.26
3.145HisLeu: 3.145 ± 1.06
0.0HisMet: 0.0 ± 0.0
0.898HisAsn: 0.898 ± 0.26
2.695HisPro: 2.695 ± 0.007
1.348HisGln: 1.348 ± 0.004
1.348HisArg: 1.348 ± 0.792
4.492HisSer: 4.492 ± 1.302
1.348HisThr: 1.348 ± 0.004
3.594HisVal: 3.594 ± 0.253
0.0HisTrp: 0.0 ± 0.0
1.348HisTyr: 1.348 ± 0.004
0.0HisXaa: 0.0 ± 0.0
Ile
8.985IleAla: 8.985 ± 1.027
0.449IleCys: 0.449 ± 0.264
6.289IleAsp: 6.289 ± 1.034
0.0IleGlu: 0.0 ± 0.0
2.695IlePhe: 2.695 ± 0.781
3.594IleGly: 3.594 ± 0.253
1.348IleHis: 1.348 ± 0.004
4.942IleIle: 4.942 ± 1.038
0.898IleLys: 0.898 ± 0.528
4.043IleLeu: 4.043 ± 0.011
2.695IleMet: 2.695 ± 0.007
4.942IleAsn: 4.942 ± 0.249
1.797IlePro: 1.797 ± 0.268
2.695IleGln: 2.695 ± 0.781
4.492IleArg: 4.492 ± 0.513
3.594IleSer: 3.594 ± 0.536
3.145IleThr: 3.145 ± 1.06
1.348IleVal: 1.348 ± 0.004
0.449IleTrp: 0.449 ± 0.264
4.942IleTyr: 4.942 ± 0.249
0.0IleXaa: 0.0 ± 0.0
Lys
4.043LysAla: 4.043 ± 0.011
0.898LysCys: 0.898 ± 0.26
0.898LysAsp: 0.898 ± 0.528
3.145LysGlu: 3.145 ± 0.517
0.898LysPhe: 0.898 ± 0.528
2.246LysGly: 2.246 ± 0.257
0.898LysHis: 0.898 ± 0.26
0.898LysIle: 0.898 ± 0.528
1.797LysLys: 1.797 ± 0.268
1.797LysLeu: 1.797 ± 0.268
1.348LysMet: 1.348 ± 0.004
0.449LysAsn: 0.449 ± 0.264
1.797LysPro: 1.797 ± 1.057
2.246LysGln: 2.246 ± 0.532
1.797LysArg: 1.797 ± 0.268
1.348LysSer: 1.348 ± 0.004
1.797LysThr: 1.797 ± 0.268
2.246LysVal: 2.246 ± 0.532
0.449LysTrp: 0.449 ± 0.264
0.0LysTyr: 0.0 ± 0.0
0.0LysXaa: 0.0 ± 0.0
Leu
9.434LeuAla: 9.434 ± 0.763
0.898LeuCys: 0.898 ± 0.528
5.84LeuAsp: 5.84 ± 0.279
2.695LeuGlu: 2.695 ± 0.796
1.797LeuPhe: 1.797 ± 0.268
5.391LeuGly: 5.391 ± 0.774
2.695LeuHis: 2.695 ± 0.796
4.942LeuIle: 4.942 ± 1.038
3.145LeuLys: 3.145 ± 1.849
10.332LeuLeu: 10.332 ± 2.132
1.797LeuMet: 1.797 ± 0.793
6.739LeuAsn: 6.739 ± 0.807
4.942LeuPro: 4.942 ± 1.328
6.289LeuGln: 6.289 ± 1.823
4.492LeuArg: 4.492 ± 0.275
6.739LeuSer: 6.739 ± 0.807
4.043LeuThr: 4.043 ± 1.589
6.289LeuVal: 6.289 ± 0.246
0.898LeuTrp: 0.898 ± 0.26
3.145LeuTyr: 3.145 ± 0.272
0.0LeuXaa: 0.0 ± 0.0
Met
3.594MetAla: 3.594 ± 0.253
0.898MetCys: 0.898 ± 0.26
0.0MetAsp: 0.0 ± 0.0
0.0MetGlu: 0.0 ± 0.0
1.348MetPhe: 1.348 ± 0.004
0.898MetGly: 0.898 ± 0.26
0.0MetHis: 0.0 ± 0.0
0.898MetIle: 0.898 ± 0.528
1.797MetLys: 1.797 ± 0.268
0.898MetLeu: 0.898 ± 0.26
0.0MetMet: 0.0 ± 0.0
0.449MetAsn: 0.449 ± 0.264
0.898MetPro: 0.898 ± 0.26
0.898MetGln: 0.898 ± 0.26
2.246MetArg: 2.246 ± 0.257
0.898MetSer: 0.898 ± 0.26
0.898MetThr: 0.898 ± 0.528
1.348MetVal: 1.348 ± 0.004
0.0MetTrp: 0.0 ± 0.0
1.797MetTyr: 1.797 ± 0.521
0.0MetXaa: 0.0 ± 0.0
Asn
3.145AsnAla: 3.145 ± 0.272
1.797AsnCys: 1.797 ± 1.057
3.594AsnAsp: 3.594 ± 0.253
0.898AsnGlu: 0.898 ± 0.26
1.797AsnPhe: 1.797 ± 0.521
3.594AsnGly: 3.594 ± 1.042
1.797AsnHis: 1.797 ± 0.521
4.492AsnIle: 4.492 ± 1.302
2.246AsnLys: 2.246 ± 0.257
4.492AsnLeu: 4.492 ± 0.513
0.449AsnMet: 0.449 ± 0.264
1.348AsnAsn: 1.348 ± 0.004
1.797AsnPro: 1.797 ± 0.268
3.594AsnGln: 3.594 ± 0.253
2.695AsnArg: 2.695 ± 0.796
1.797AsnSer: 1.797 ± 1.057
1.797AsnThr: 1.797 ± 0.268
1.797AsnVal: 1.797 ± 0.521
0.898AsnTrp: 0.898 ± 0.528
2.695AsnTyr: 2.695 ± 0.007
0.0AsnXaa: 0.0 ± 0.0
Pro
5.84ProAla: 5.84 ± 0.279
0.898ProCys: 0.898 ± 0.26
3.594ProAsp: 3.594 ± 0.253
4.043ProGlu: 4.043 ± 0.778
2.246ProPhe: 2.246 ± 0.532
4.043ProGly: 4.043 ± 0.778
2.246ProHis: 2.246 ± 0.532
6.739ProIle: 6.739 ± 0.77
5.84ProLys: 5.84 ± 0.51
7.188ProLeu: 7.188 ± 1.071
1.348ProMet: 1.348 ± 0.004
1.348ProAsn: 1.348 ± 0.004
2.695ProPro: 2.695 ± 0.007
4.043ProGln: 4.043 ± 0.8
2.246ProArg: 2.246 ± 0.257
6.289ProSer: 6.289 ± 1.332
3.145ProThr: 3.145 ± 0.272
6.739ProVal: 6.739 ± 0.77
0.449ProTrp: 0.449 ± 0.264
3.594ProTyr: 3.594 ± 0.536
0.0ProXaa: 0.0 ± 0.0
Gln
4.492GlnAla: 4.492 ± 0.513
1.348GlnCys: 1.348 ± 0.004
0.0GlnAsp: 0.0 ± 0.0
1.797GlnGlu: 1.797 ± 0.268
0.449GlnPhe: 0.449 ± 0.264
0.898GlnGly: 0.898 ± 0.26
1.797GlnHis: 1.797 ± 0.268
2.246GlnIle: 2.246 ± 0.257
2.246GlnLys: 2.246 ± 0.257
5.391GlnLeu: 5.391 ± 1.592
0.0GlnMet: 0.0 ± 0.0
0.898GlnAsn: 0.898 ± 0.26
5.84GlnPro: 5.84 ± 0.51
1.797GlnGln: 1.797 ± 0.521
0.898GlnArg: 0.898 ± 0.26
2.246GlnSer: 2.246 ± 0.257
4.942GlnThr: 4.942 ± 1.038
2.695GlnVal: 2.695 ± 0.781
0.0GlnTrp: 0.0 ± 0.0
1.797GlnTyr: 1.797 ± 0.521
0.0GlnXaa: 0.0 ± 0.0
Arg
6.289ArgAla: 6.289 ± 0.246
2.246ArgCys: 2.246 ± 0.257
1.348ArgAsp: 1.348 ± 0.792
0.898ArgGlu: 0.898 ± 0.26
1.348ArgPhe: 1.348 ± 0.004
4.043ArgGly: 4.043 ± 0.778
1.797ArgHis: 1.797 ± 0.268
0.898ArgIle: 0.898 ± 0.528
2.695ArgLys: 2.695 ± 0.007
6.289ArgLeu: 6.289 ± 1.034
1.348ArgMet: 1.348 ± 0.004
2.695ArgAsn: 2.695 ± 0.007
7.637ArgPro: 7.637 ± 1.031
1.348ArgGln: 1.348 ± 0.004
3.145ArgArg: 3.145 ± 0.272
3.145ArgSer: 3.145 ± 1.06
2.246ArgThr: 2.246 ± 0.257
0.898ArgVal: 0.898 ± 0.528
0.898ArgTrp: 0.898 ± 0.528
2.246ArgTyr: 2.246 ± 1.321
0.0ArgXaa: 0.0 ± 0.0
Ser
7.188SerAla: 7.188 ± 0.506
1.797SerCys: 1.797 ± 0.521
4.942SerAsp: 4.942 ± 0.539
1.797SerGlu: 1.797 ± 0.268
4.043SerPhe: 4.043 ± 0.011
4.942SerGly: 4.942 ± 0.249
2.695SerHis: 2.695 ± 0.796
4.942SerIle: 4.942 ± 0.249
1.797SerLys: 1.797 ± 0.268
7.188SerLeu: 7.188 ± 0.506
2.246SerMet: 2.246 ± 0.257
1.348SerAsn: 1.348 ± 0.004
2.695SerPro: 2.695 ± 0.796
2.695SerGln: 2.695 ± 0.007
4.492SerArg: 4.492 ± 0.275
4.492SerSer: 4.492 ± 0.275
4.492SerThr: 4.492 ± 0.513
2.695SerVal: 2.695 ± 1.585
0.449SerTrp: 0.449 ± 0.264
3.594SerTyr: 3.594 ± 0.536
0.0SerXaa: 0.0 ± 0.0
Thr
3.145ThrAla: 3.145 ± 0.272
0.898ThrCys: 0.898 ± 0.26
3.594ThrAsp: 3.594 ± 0.253
3.145ThrGlu: 3.145 ± 0.272
1.348ThrPhe: 1.348 ± 0.004
3.594ThrGly: 3.594 ± 0.253
1.797ThrHis: 1.797 ± 0.521
3.594ThrIle: 3.594 ± 0.536
0.898ThrLys: 0.898 ± 0.26
5.84ThrLeu: 5.84 ± 0.279
1.348ThrMet: 1.348 ± 0.004
2.246ThrAsn: 2.246 ± 0.532
8.535ThrPro: 8.535 ± 1.291
1.797ThrGln: 1.797 ± 0.268
4.942ThrArg: 4.942 ± 0.539
4.043ThrSer: 4.043 ± 0.8
6.739ThrThr: 6.739 ± 0.019
2.695ThrVal: 2.695 ± 0.007
0.0ThrTrp: 0.0 ± 0.0
1.348ThrTyr: 1.348 ± 0.004
0.0ThrXaa: 0.0 ± 0.0
Val
3.594ValAla: 3.594 ± 0.536
0.449ValCys: 0.449 ± 0.264
2.246ValAsp: 2.246 ± 0.257
4.043ValGlu: 4.043 ± 0.011
1.797ValPhe: 1.797 ± 0.268
1.348ValGly: 1.348 ± 0.004
0.898ValHis: 0.898 ± 0.26
2.246ValIle: 2.246 ± 0.532
1.348ValLys: 1.348 ± 0.792
9.434ValLeu: 9.434 ± 0.815
0.449ValMet: 0.449 ± 0.264
4.492ValAsn: 4.492 ± 0.513
4.492ValPro: 4.492 ± 0.275
2.695ValGln: 2.695 ± 0.781
3.145ValArg: 3.145 ± 0.517
5.84ValSer: 5.84 ± 1.068
4.492ValThr: 4.492 ± 0.513
2.246ValVal: 2.246 ± 0.257
0.898ValTrp: 0.898 ± 0.26
0.449ValTyr: 0.449 ± 0.264
0.0ValXaa: 0.0 ± 0.0
Trp
0.449TrpAla: 0.449 ± 0.264
0.898TrpCys: 0.898 ± 0.528
0.0TrpAsp: 0.0 ± 0.0
0.0TrpGlu: 0.0 ± 0.0
0.0TrpPhe: 0.0 ± 0.0
0.898TrpGly: 0.898 ± 0.26
0.0TrpHis: 0.0 ± 0.0
0.898TrpIle: 0.898 ± 0.26
0.0TrpLys: 0.0 ± 0.0
1.797TrpLeu: 1.797 ± 0.268
0.0TrpMet: 0.0 ± 0.0
2.246TrpAsn: 2.246 ± 0.257
0.449TrpPro: 0.449 ± 0.264
0.449TrpGln: 0.449 ± 0.264
0.449TrpArg: 0.449 ± 0.264
0.0TrpSer: 0.0 ± 0.0
0.0TrpThr: 0.0 ± 0.0
0.449TrpVal: 0.449 ± 0.264
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
4.942TyrAla: 4.942 ± 1.038
1.797TyrCys: 1.797 ± 0.268
2.695TyrAsp: 2.695 ± 0.781
1.348TyrGlu: 1.348 ± 0.004
1.348TyrPhe: 1.348 ± 0.004
2.695TyrGly: 2.695 ± 0.781
1.348TyrHis: 1.348 ± 0.792
3.145TyrIle: 3.145 ± 1.06
0.449TyrLys: 0.449 ± 0.264
3.145TyrLeu: 3.145 ± 0.272
0.898TyrMet: 0.898 ± 0.26
1.348TyrAsn: 1.348 ± 0.004
2.695TyrPro: 2.695 ± 0.796
2.246TyrGln: 2.246 ± 0.532
2.246TyrArg: 2.246 ± 0.532
6.739TyrSer: 6.739 ± 0.77
1.797TyrThr: 1.797 ± 0.268
1.348TyrVal: 1.348 ± 0.004
0.898TyrTrp: 0.898 ± 0.26
4.043TyrTyr: 4.043 ± 0.778
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (2227 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski