Amino acid dipepetide frequency for Trichomonas vaginalis virus 1 (TVV1)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.978AlaAla: 6.978 ± 2.316
1.396AlaCys: 1.396 ± 0.964
0.698AlaAsp: 0.698 ± 0.537
0.698AlaGlu: 0.698 ± 0.537
4.187AlaPhe: 4.187 ± 1.186
6.281AlaGly: 6.281 ± 1.279
2.094AlaHis: 2.094 ± 0.593
6.978AlaIle: 6.978 ± 1.297
2.791AlaLys: 2.791 ± 0.111
8.374AlaLeu: 8.374 ± 0.333
2.791AlaMet: 2.791 ± 0.908
3.489AlaAsn: 3.489 ± 0.648
2.791AlaPro: 2.791 ± 1.13
3.489AlaGln: 3.489 ± 2.687
4.187AlaArg: 4.187 ± 1.186
4.885AlaSer: 4.885 ± 0.704
4.187AlaThr: 4.187 ± 0.853
4.885AlaVal: 4.885 ± 1.723
0.0AlaTrp: 0.0 ± 0.0
4.187AlaTyr: 4.187 ± 1.186
0.0AlaXaa: 0.0 ± 0.0
Cys
2.791CysAla: 2.791 ± 0.111
0.0CysCys: 0.0 ± 0.0
1.396CysAsp: 1.396 ± 0.964
0.0CysGlu: 0.0 ± 0.0
0.0CysPhe: 0.0 ± 0.0
0.698CysGly: 0.698 ± 0.482
0.0CysHis: 0.0 ± 0.0
0.698CysIle: 0.698 ± 0.482
2.094CysLys: 2.094 ± 0.426
1.396CysLeu: 1.396 ± 0.055
0.0CysMet: 0.0 ± 0.0
1.396CysAsn: 1.396 ± 0.964
1.396CysPro: 1.396 ± 1.075
0.0CysGln: 0.0 ± 0.0
0.698CysArg: 0.698 ± 0.482
1.396CysSer: 1.396 ± 0.055
0.698CysThr: 0.698 ± 0.482
0.698CysVal: 0.698 ± 0.482
0.0CysTrp: 0.0 ± 0.0
0.698CysTyr: 0.698 ± 0.482
0.0CysXaa: 0.0 ± 0.0
Asp
4.187AspAla: 4.187 ± 0.853
0.0AspCys: 0.0 ± 0.0
4.187AspAsp: 4.187 ± 0.853
0.698AspGlu: 0.698 ± 0.482
3.489AspPhe: 3.489 ± 0.648
2.791AspGly: 2.791 ± 1.13
0.0AspHis: 0.0 ± 0.0
2.791AspIle: 2.791 ± 1.13
1.396AspLys: 1.396 ± 0.055
4.885AspLeu: 4.885 ± 0.315
0.698AspMet: 0.698 ± 0.482
2.094AspAsn: 2.094 ± 0.426
2.791AspPro: 2.791 ± 1.13
0.698AspGln: 0.698 ± 0.482
2.094AspArg: 2.094 ± 0.593
2.791AspSer: 2.791 ± 0.111
2.791AspThr: 2.791 ± 0.111
4.885AspVal: 4.885 ± 0.704
1.396AspTrp: 1.396 ± 0.964
6.281AspTyr: 6.281 ± 2.298
0.0AspXaa: 0.0 ± 0.0
Glu
5.583GluAla: 5.583 ± 0.797
1.396GluCys: 1.396 ± 0.964
0.698GluAsp: 0.698 ± 0.279
1.396GluGlu: 1.396 ± 0.055
3.489GluPhe: 3.489 ± 0.371
2.791GluGly: 2.791 ± 1.13
2.094GluHis: 2.094 ± 0.426
0.0GluIle: 0.0 ± 0.0
2.791GluLys: 2.791 ± 1.13
3.489GluLeu: 3.489 ± 1.39
0.0GluMet: 0.0 ± 0.324
1.396GluAsn: 1.396 ± 0.055
0.698GluPro: 0.698 ± 0.482
0.698GluGln: 0.698 ± 0.482
3.489GluArg: 3.489 ± 0.371
4.187GluSer: 4.187 ± 1.186
5.583GluThr: 5.583 ± 1.241
2.791GluVal: 2.791 ± 0.111
0.0GluTrp: 0.0 ± 0.0
2.094GluTyr: 2.094 ± 0.426
0.0GluXaa: 0.0 ± 0.0
Phe
2.094PheAla: 2.094 ± 0.426
0.0PheCys: 0.0 ± 0.0
3.489PheAsp: 3.489 ± 0.371
3.489PheGlu: 3.489 ± 0.371
2.094PhePhe: 2.094 ± 1.612
2.791PheGly: 2.791 ± 2.149
0.698PheHis: 0.698 ± 0.537
6.281PheIle: 6.281 ± 0.759
2.791PheLys: 2.791 ± 0.908
4.885PheLeu: 4.885 ± 0.315
1.396PheMet: 1.396 ± 0.055
3.489PheAsn: 3.489 ± 0.371
1.396PhePro: 1.396 ± 0.964
0.0PheGln: 0.0 ± 0.0
2.791PheArg: 2.791 ± 0.111
3.489PheSer: 3.489 ± 2.687
3.489PheThr: 3.489 ± 0.648
4.885PheVal: 4.885 ± 1.723
0.0PheTrp: 0.0 ± 0.0
0.698PheTyr: 0.698 ± 0.482
0.0PheXaa: 0.0 ± 0.0
Gly
1.396GlyAla: 1.396 ± 0.055
0.0GlyCys: 0.0 ± 0.0
4.187GlyAsp: 4.187 ± 1.872
3.489GlyGlu: 3.489 ± 0.648
0.698GlyPhe: 0.698 ± 0.537
2.094GlyGly: 2.094 ± 1.612
0.698GlyHis: 0.698 ± 0.482
2.094GlyIle: 2.094 ± 0.593
2.094GlyLys: 2.094 ± 0.593
7.676GlyLeu: 7.676 ± 2.243
2.094GlyMet: 2.094 ± 1.612
2.094GlyAsn: 2.094 ± 0.426
4.187GlyPro: 4.187 ± 1.186
1.396GlyGln: 1.396 ± 0.055
3.489GlyArg: 3.489 ± 0.648
2.094GlySer: 2.094 ± 0.593
5.583GlyThr: 5.583 ± 2.26
4.187GlyVal: 4.187 ± 0.853
2.094GlyTrp: 2.094 ± 0.426
2.791GlyTyr: 2.791 ± 0.111
0.0GlyXaa: 0.0 ± 0.0
His
2.094HisAla: 2.094 ± 0.426
0.698HisCys: 0.698 ± 0.482
2.094HisAsp: 2.094 ± 0.593
1.396HisGlu: 1.396 ± 1.075
1.396HisPhe: 1.396 ± 0.964
2.791HisGly: 2.791 ± 0.111
1.396HisHis: 1.396 ± 0.964
1.396HisIle: 1.396 ± 1.075
0.698HisLys: 0.698 ± 0.482
2.094HisLeu: 2.094 ± 0.593
0.698HisMet: 0.698 ± 0.482
2.094HisAsn: 2.094 ± 0.593
0.0HisPro: 0.0 ± 0.0
0.0HisGln: 0.0 ± 0.0
2.094HisArg: 2.094 ± 1.446
2.094HisSer: 2.094 ± 0.593
2.094HisThr: 2.094 ± 0.593
1.396HisVal: 1.396 ± 0.055
0.698HisTrp: 0.698 ± 0.537
1.396HisTyr: 1.396 ± 0.964
0.0HisXaa: 0.0 ± 0.0
Ile
4.885IleAla: 4.885 ± 1.723
0.698IleCys: 0.698 ± 0.537
4.885IleAsp: 4.885 ± 0.704
2.791IleGlu: 2.791 ± 0.111
4.885IlePhe: 4.885 ± 1.723
1.396IleGly: 1.396 ± 1.075
0.698IleHis: 0.698 ± 0.537
2.791IleIle: 2.791 ± 0.111
1.396IleLys: 1.396 ± 0.964
5.583IleLeu: 5.583 ± 1.817
0.0IleMet: 0.0 ± 0.0
2.791IleAsn: 2.791 ± 0.908
6.281IlePro: 6.281 ± 1.778
2.791IleGln: 2.791 ± 0.111
1.396IleArg: 1.396 ± 0.055
8.374IleSer: 8.374 ± 1.706
5.583IleThr: 5.583 ± 1.241
2.094IleVal: 2.094 ± 0.593
0.698IleTrp: 0.698 ± 0.537
2.094IleTyr: 2.094 ± 0.426
0.0IleXaa: 0.0 ± 0.0
Lys
3.489LysAla: 3.489 ± 0.371
0.698LysCys: 0.698 ± 0.482
3.489LysAsp: 3.489 ± 0.648
4.885LysGlu: 4.885 ± 1.335
2.094LysPhe: 2.094 ± 0.426
0.698LysGly: 0.698 ± 0.537
4.187LysHis: 4.187 ± 1.872
3.489LysIle: 3.489 ± 2.409
2.791LysLys: 2.791 ± 0.908
4.187LysLeu: 4.187 ± 2.891
2.094LysMet: 2.094 ± 1.446
1.396LysAsn: 1.396 ± 0.055
2.791LysPro: 2.791 ± 0.908
3.489LysGln: 3.489 ± 1.39
2.094LysArg: 2.094 ± 0.593
2.791LysSer: 2.791 ± 0.111
0.698LysThr: 0.698 ± 0.482
2.791LysVal: 2.791 ± 1.13
0.698LysTrp: 0.698 ± 0.482
1.396LysTyr: 1.396 ± 0.055
0.0LysXaa: 0.0 ± 0.0
Leu
5.583LeuAla: 5.583 ± 2.26
0.698LeuCys: 0.698 ± 0.482
3.489LeuAsp: 3.489 ± 1.39
3.489LeuGlu: 3.489 ± 0.371
3.489LeuPhe: 3.489 ± 1.39
4.187LeuGly: 4.187 ± 1.872
3.489LeuHis: 3.489 ± 0.371
5.583LeuIle: 5.583 ± 2.836
6.978LeuLys: 6.978 ± 2.78
11.165LeuLeu: 11.165 ± 4.652
3.489LeuMet: 3.489 ± 1.39
4.187LeuAsn: 4.187 ± 0.166
7.676LeuPro: 7.676 ± 2.243
2.791LeuGln: 2.791 ± 0.908
6.978LeuArg: 6.978 ± 1.297
8.374LeuSer: 8.374 ± 1.706
6.281LeuThr: 6.281 ± 0.26
2.094LeuVal: 2.094 ± 1.446
2.094LeuTrp: 2.094 ± 0.426
2.791LeuTyr: 2.791 ± 1.13
0.0LeuXaa: 0.0 ± 0.0
Met
2.791MetAla: 2.791 ± 0.111
1.396MetCys: 1.396 ± 0.055
0.698MetAsp: 0.698 ± 0.537
1.396MetGlu: 1.396 ± 1.075
0.698MetPhe: 0.698 ± 0.537
0.0MetGly: 0.0 ± 0.0
0.0MetHis: 0.0 ± 0.0
1.396MetIle: 1.396 ± 1.075
2.094MetLys: 2.094 ± 0.426
2.094MetLeu: 2.094 ± 0.426
0.0MetMet: 0.0 ± 0.0
1.396MetAsn: 1.396 ± 1.075
0.698MetPro: 0.698 ± 0.537
1.396MetGln: 1.396 ± 0.964
0.698MetArg: 0.698 ± 0.482
3.489MetSer: 3.489 ± 0.648
1.396MetThr: 1.396 ± 0.055
2.094MetVal: 2.094 ± 0.426
1.396MetTrp: 1.396 ± 0.964
0.698MetTyr: 0.698 ± 0.482
0.0MetXaa: 0.0 ± 0.0
Asn
4.187AsnAla: 4.187 ± 1.186
1.396AsnCys: 1.396 ± 0.055
2.791AsnAsp: 2.791 ± 1.13
2.094AsnGlu: 2.094 ± 1.446
1.396AsnPhe: 1.396 ± 0.055
4.187AsnGly: 4.187 ± 1.186
0.0AsnHis: 0.0 ± 0.0
2.791AsnIle: 2.791 ± 1.13
2.791AsnLys: 2.791 ± 0.111
0.0AsnLeu: 0.0 ± 0.0
1.396AsnMet: 1.396 ± 0.055
1.396AsnAsn: 1.396 ± 0.055
2.791AsnPro: 2.791 ± 0.908
0.698AsnGln: 0.698 ± 0.482
0.698AsnArg: 0.698 ± 0.537
5.583AsnSer: 5.583 ± 1.241
3.489AsnThr: 3.489 ± 0.371
3.489AsnVal: 3.489 ± 2.687
1.396AsnTrp: 1.396 ± 0.964
1.396AsnTyr: 1.396 ± 0.964
0.0AsnXaa: 0.0 ± 0.0
Pro
0.698ProAla: 0.698 ± 0.537
0.698ProCys: 0.698 ± 0.482
1.396ProAsp: 1.396 ± 0.055
3.489ProGlu: 3.489 ± 0.648
2.094ProPhe: 2.094 ± 0.593
4.187ProGly: 4.187 ± 1.872
1.396ProHis: 1.396 ± 1.075
6.281ProIle: 6.281 ± 0.26
2.791ProLys: 2.791 ± 1.927
2.094ProLeu: 2.094 ± 0.426
1.396ProMet: 1.396 ± 1.075
3.489ProAsn: 3.489 ± 1.668
2.791ProPro: 2.791 ± 1.927
1.396ProGln: 1.396 ± 0.964
2.791ProArg: 2.791 ± 1.13
4.885ProSer: 4.885 ± 0.315
3.489ProThr: 3.489 ± 1.39
1.396ProVal: 1.396 ± 0.055
1.396ProTrp: 1.396 ± 0.964
2.094ProTyr: 2.094 ± 0.593
0.0ProXaa: 0.0 ± 0.0
Gln
1.396GlnAla: 1.396 ± 0.055
1.396GlnCys: 1.396 ± 0.964
3.489GlnAsp: 3.489 ± 1.39
2.094GlnGlu: 2.094 ± 0.593
0.698GlnPhe: 0.698 ± 0.482
2.791GlnGly: 2.791 ± 2.149
0.698GlnHis: 0.698 ± 0.482
2.094GlnIle: 2.094 ± 0.426
0.0GlnLys: 0.0 ± 0.0
5.583GlnLeu: 5.583 ± 1.817
1.396GlnMet: 1.396 ± 0.055
1.396GlnAsn: 1.396 ± 0.055
1.396GlnPro: 1.396 ± 0.055
1.396GlnGln: 1.396 ± 0.964
1.396GlnArg: 1.396 ± 1.075
2.791GlnSer: 2.791 ± 1.927
2.094GlnThr: 2.094 ± 0.426
0.698GlnVal: 0.698 ± 0.482
0.0GlnTrp: 0.0 ± 0.0
0.698GlnTyr: 0.698 ± 0.537
0.0GlnXaa: 0.0 ± 0.0
Arg
5.583ArgAla: 5.583 ± 0.222
1.396ArgCys: 1.396 ± 0.055
2.094ArgAsp: 2.094 ± 0.426
1.396ArgGlu: 1.396 ± 0.055
2.791ArgPhe: 2.791 ± 1.13
1.396ArgGly: 1.396 ± 0.964
3.489ArgHis: 3.489 ± 0.371
2.791ArgIle: 2.791 ± 1.13
4.187ArgLys: 4.187 ± 0.853
6.281ArgLeu: 6.281 ± 2.298
2.094ArgMet: 2.094 ± 1.612
2.791ArgAsn: 2.791 ± 1.13
0.698ArgPro: 0.698 ± 0.537
0.698ArgGln: 0.698 ± 0.482
0.698ArgArg: 0.698 ± 0.537
2.791ArgSer: 2.791 ± 0.111
4.187ArgThr: 4.187 ± 1.186
2.094ArgVal: 2.094 ± 1.612
0.0ArgTrp: 0.0 ± 0.0
1.396ArgTyr: 1.396 ± 1.075
0.0ArgXaa: 0.0 ± 0.0
Ser
5.583SerAla: 5.583 ± 1.241
1.396SerCys: 1.396 ± 1.075
5.583SerAsp: 5.583 ± 1.019
6.281SerGlu: 6.281 ± 1.279
6.978SerPhe: 6.978 ± 0.277
6.281SerGly: 6.281 ± 3.318
4.885SerHis: 4.885 ± 0.315
2.094SerIle: 2.094 ± 0.593
3.489SerLys: 3.489 ± 0.371
9.072SerLeu: 9.072 ± 1.168
2.094SerMet: 2.094 ± 0.531
2.094SerAsn: 2.094 ± 1.612
2.094SerPro: 2.094 ± 0.426
5.583SerGln: 5.583 ± 0.797
5.583SerArg: 5.583 ± 0.222
9.77SerSer: 9.77 ± 0.631
4.885SerThr: 4.885 ± 1.723
2.791SerVal: 2.791 ± 0.111
1.396SerTrp: 1.396 ± 1.075
0.698SerTyr: 0.698 ± 0.482
0.0SerXaa: 0.0 ± 0.0
Thr
6.978ThrAla: 6.978 ± 2.316
0.0ThrCys: 0.0 ± 0.0
2.094ThrAsp: 2.094 ± 0.593
2.791ThrGlu: 2.791 ± 0.908
2.791ThrPhe: 2.791 ± 0.111
2.791ThrGly: 2.791 ± 1.13
1.396ThrHis: 1.396 ± 1.075
6.978ThrIle: 6.978 ± 2.316
3.489ThrLys: 3.489 ± 1.39
6.281ThrLeu: 6.281 ± 0.759
1.396ThrMet: 1.396 ± 0.055
2.094ThrAsn: 2.094 ± 1.612
2.791ThrPro: 2.791 ± 0.908
2.791ThrGln: 2.791 ± 0.111
3.489ThrArg: 3.489 ± 1.39
8.374ThrSer: 8.374 ± 0.333
3.489ThrThr: 3.489 ± 0.648
9.072ThrVal: 9.072 ± 1.889
0.0ThrTrp: 0.0 ± 0.0
3.489ThrTyr: 3.489 ± 1.39
0.0ThrXaa: 0.0 ± 0.0
Val
4.885ValAla: 4.885 ± 2.742
0.698ValCys: 0.698 ± 0.482
2.791ValAsp: 2.791 ± 1.13
1.396ValGlu: 1.396 ± 1.075
4.885ValPhe: 4.885 ± 0.704
4.187ValGly: 4.187 ± 1.186
0.0ValHis: 0.0 ± 0.0
4.187ValIle: 4.187 ± 1.186
1.396ValLys: 1.396 ± 0.964
4.885ValLeu: 4.885 ± 1.335
0.698ValMet: 0.698 ± 0.537
0.698ValAsn: 0.698 ± 0.537
5.583ValPro: 5.583 ± 0.797
1.396ValGln: 1.396 ± 1.075
2.791ValArg: 2.791 ± 1.13
6.281ValSer: 6.281 ± 0.26
4.885ValThr: 4.885 ± 1.723
0.698ValVal: 0.698 ± 0.537
0.698ValTrp: 0.698 ± 0.482
2.094ValTyr: 2.094 ± 0.426
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
2.094TrpCys: 2.094 ± 0.426
0.0TrpAsp: 0.0 ± 0.0
0.0TrpGlu: 0.0 ± 0.0
0.0TrpPhe: 0.0 ± 0.0
0.0TrpGly: 0.0 ± 0.0
0.698TrpHis: 0.698 ± 0.537
0.0TrpIle: 0.0 ± 0.0
1.396TrpLys: 1.396 ± 0.964
2.094TrpLeu: 2.094 ± 0.426
0.0TrpMet: 0.0 ± 0.0
1.396TrpAsn: 1.396 ± 0.055
0.0TrpPro: 0.0 ± 0.0
2.094TrpGln: 2.094 ± 1.446
0.0TrpArg: 0.0 ± 0.0
1.396TrpSer: 1.396 ± 0.964
1.396TrpThr: 1.396 ± 0.055
1.396TrpVal: 1.396 ± 0.055
1.396TrpTrp: 1.396 ± 0.055
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
4.187TyrAla: 4.187 ± 0.166
0.0TyrCys: 0.0 ± 0.0
1.396TyrAsp: 1.396 ± 0.964
1.396TyrGlu: 1.396 ± 0.055
2.094TyrPhe: 2.094 ± 0.593
2.094TyrGly: 2.094 ± 0.593
0.698TyrHis: 0.698 ± 0.537
1.396TyrIle: 1.396 ± 0.964
2.791TyrLys: 2.791 ± 0.908
2.791TyrLeu: 2.791 ± 0.111
1.396TyrMet: 1.396 ± 1.075
2.791TyrAsn: 2.791 ± 1.927
1.396TyrPro: 1.396 ± 0.964
0.698TyrGln: 0.698 ± 0.537
1.396TyrArg: 1.396 ± 0.055
3.489TyrSer: 3.489 ± 1.39
6.281TyrThr: 6.281 ± 0.26
0.698TyrVal: 0.698 ± 0.482
0.0TyrTrp: 0.0 ± 0.0
1.396TyrTyr: 1.396 ± 0.055
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (1434 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski