Amino acid dipepetide frequency for Trichomonas vaginalis virus 3

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.323AlaAla: 4.323 ± 0.062
1.441AlaCys: 1.441 ± 1.038
4.323AlaAsp: 4.323 ± 0.062
4.323AlaGlu: 4.323 ± 0.997
0.72AlaPhe: 0.72 ± 0.519
3.602AlaGly: 3.602 ± 0.581
0.72AlaHis: 0.72 ± 0.54
6.484AlaIle: 6.484 ± 2.554
2.882AlaLys: 2.882 ± 1.1
3.602AlaLeu: 3.602 ± 2.595
2.161AlaMet: 2.161 ± 1.619
4.323AlaAsn: 4.323 ± 0.997
7.205AlaPro: 7.205 ± 1.161
2.161AlaGln: 2.161 ± 0.498
2.882AlaArg: 2.882 ± 0.041
4.323AlaSer: 4.323 ± 0.997
4.323AlaThr: 4.323 ± 0.062
5.764AlaVal: 5.764 ± 1.141
0.0AlaTrp: 0.0 ± 0.0
4.323AlaTyr: 4.323 ± 0.997
0.0AlaXaa: 0.0 ± 0.0
Cys
2.882CysAla: 2.882 ± 0.041
0.0CysCys: 0.0 ± 0.0
0.72CysAsp: 0.72 ± 0.54
0.72CysGlu: 0.72 ± 0.519
0.0CysPhe: 0.0 ± 0.0
2.882CysGly: 2.882 ± 2.158
0.72CysHis: 0.72 ± 0.519
1.441CysIle: 1.441 ± 1.079
0.72CysLys: 0.72 ± 0.54
1.441CysLeu: 1.441 ± 0.021
0.72CysMet: 0.72 ± 0.54
1.441CysAsn: 1.441 ± 1.079
2.882CysPro: 2.882 ± 0.041
0.72CysGln: 0.72 ± 0.519
0.0CysArg: 0.0 ± 0.0
2.161CysSer: 2.161 ± 0.498
1.441CysThr: 1.441 ± 1.038
0.72CysVal: 0.72 ± 0.54
0.0CysTrp: 0.0 ± 0.0
2.882CysTyr: 2.882 ± 0.041
0.0CysXaa: 0.0 ± 0.0
Asp
2.882AspAla: 2.882 ± 1.017
0.0AspCys: 0.0 ± 0.0
2.882AspAsp: 2.882 ± 1.1
2.882AspGlu: 2.882 ± 2.076
5.764AspPhe: 5.764 ± 0.082
0.72AspGly: 0.72 ± 0.519
2.161AspHis: 2.161 ± 0.56
4.323AspIle: 4.323 ± 0.997
2.161AspLys: 2.161 ± 0.56
5.043AspLeu: 5.043 ± 0.601
0.72AspMet: 0.72 ± 0.54
0.0AspAsn: 0.0 ± 0.0
1.441AspPro: 1.441 ± 1.079
0.72AspGln: 0.72 ± 0.519
2.882AspArg: 2.882 ± 1.017
4.323AspSer: 4.323 ± 0.997
5.764AspThr: 5.764 ± 0.082
5.043AspVal: 5.043 ± 2.718
0.72AspTrp: 0.72 ± 0.54
3.602AspTyr: 3.602 ± 2.698
0.0AspXaa: 0.0 ± 0.0
Glu
3.602GluAla: 3.602 ± 0.478
2.882GluCys: 2.882 ± 2.158
1.441GluAsp: 1.441 ± 1.038
0.0GluGlu: 0.0 ± 0.0
3.602GluPhe: 3.602 ± 0.478
2.882GluGly: 2.882 ± 1.017
2.882GluHis: 2.882 ± 1.1
2.161GluIle: 2.161 ± 0.498
2.161GluLys: 2.161 ± 0.498
7.205GluLeu: 7.205 ± 2.22
2.882GluMet: 2.882 ± 1.1
0.72GluAsn: 0.72 ± 0.519
2.882GluPro: 2.882 ± 1.017
0.72GluGln: 0.72 ± 0.519
0.72GluArg: 0.72 ± 0.54
2.161GluSer: 2.161 ± 0.56
2.882GluThr: 2.882 ± 2.076
2.882GluVal: 2.882 ± 2.076
0.72GluTrp: 0.72 ± 0.54
2.161GluTyr: 2.161 ± 0.498
0.0GluXaa: 0.0 ± 0.0
Phe
2.882PheAla: 2.882 ± 1.017
0.72PheCys: 0.72 ± 0.519
3.602PheAsp: 3.602 ± 1.639
1.441PheGlu: 1.441 ± 0.021
2.882PhePhe: 2.882 ± 1.017
2.161PheGly: 2.161 ± 0.498
2.161PheHis: 2.161 ± 0.498
3.602PheIle: 3.602 ± 1.639
1.441PheLys: 1.441 ± 0.021
3.602PheLeu: 3.602 ± 1.639
0.72PheMet: 0.72 ± 0.519
2.882PheAsn: 2.882 ± 0.041
2.882PhePro: 2.882 ± 1.017
1.441PheGln: 1.441 ± 1.038
2.161PheArg: 2.161 ± 1.557
4.323PheSer: 4.323 ± 0.062
2.882PheThr: 2.882 ± 1.017
2.161PheVal: 2.161 ± 0.498
0.72PheTrp: 0.72 ± 0.519
2.161PheTyr: 2.161 ± 1.557
0.0PheXaa: 0.0 ± 0.0
Gly
2.882GlyAla: 2.882 ± 1.017
1.441GlyCys: 1.441 ± 0.021
2.882GlyAsp: 2.882 ± 2.158
1.441GlyGlu: 1.441 ± 1.038
3.602GlyPhe: 3.602 ± 1.536
1.441GlyGly: 1.441 ± 0.021
2.161GlyHis: 2.161 ± 0.56
4.323GlyIle: 4.323 ± 2.055
1.441GlyLys: 1.441 ± 1.079
6.484GlyLeu: 6.484 ± 2.554
1.441GlyMet: 1.441 ± 0.348
2.161GlyAsn: 2.161 ± 0.56
2.882GlyPro: 2.882 ± 1.017
2.161GlyGln: 2.161 ± 1.619
3.602GlyArg: 3.602 ± 1.536
2.882GlySer: 2.882 ± 0.041
2.161GlyThr: 2.161 ± 1.557
5.043GlyVal: 5.043 ± 1.516
0.0GlyTrp: 0.0 ± 0.0
2.161GlyTyr: 2.161 ± 0.498
0.0GlyXaa: 0.0 ± 0.0
His
2.161HisAla: 2.161 ± 1.557
0.72HisCys: 0.72 ± 0.54
0.72HisAsp: 0.72 ± 0.519
2.161HisGlu: 2.161 ± 0.498
0.72HisPhe: 0.72 ± 0.54
2.161HisGly: 2.161 ± 0.56
1.441HisHis: 1.441 ± 1.079
2.161HisIle: 2.161 ± 0.56
0.0HisLys: 0.0 ± 0.0
0.72HisLeu: 0.72 ± 0.54
0.72HisMet: 0.72 ± 0.54
0.0HisAsn: 0.0 ± 0.0
2.882HisPro: 2.882 ± 1.1
0.72HisGln: 0.72 ± 0.519
3.602HisArg: 3.602 ± 0.581
1.441HisSer: 1.441 ± 0.021
0.72HisThr: 0.72 ± 0.519
2.161HisVal: 2.161 ± 0.56
0.72HisTrp: 0.72 ± 0.54
1.441HisTyr: 1.441 ± 1.079
0.0HisXaa: 0.0 ± 0.0
Ile
7.925IleAla: 7.925 ± 3.818
2.161IleCys: 2.161 ± 0.56
3.602IleAsp: 3.602 ± 1.639
4.323IleGlu: 4.323 ± 2.055
0.0IlePhe: 0.0 ± 0.0
5.043IleGly: 5.043 ± 3.777
0.72IleHis: 0.72 ± 0.519
5.043IleIle: 5.043 ± 0.457
1.441IleLys: 1.441 ± 0.021
5.043IleLeu: 5.043 ± 0.601
0.72IleMet: 0.72 ± 0.811
2.161IleAsn: 2.161 ± 0.56
3.602IlePro: 3.602 ± 0.581
2.161IleGln: 2.161 ± 1.557
2.882IleArg: 2.882 ± 1.017
3.602IleSer: 3.602 ± 0.478
2.882IleThr: 2.882 ± 1.1
2.161IleVal: 2.161 ± 1.619
0.0IleTrp: 0.0 ± 0.0
3.602IleTyr: 3.602 ± 0.478
0.0IleXaa: 0.0 ± 0.0
Lys
4.323LysAla: 4.323 ± 0.062
0.0LysCys: 0.0 ± 0.0
2.161LysAsp: 2.161 ± 0.498
2.161LysGlu: 2.161 ± 0.56
1.441LysPhe: 1.441 ± 0.021
2.161LysGly: 2.161 ± 1.557
0.0LysHis: 0.0 ± 0.0
2.882LysIle: 2.882 ± 2.158
2.882LysLys: 2.882 ± 0.041
6.484LysLeu: 6.484 ± 0.622
2.161LysMet: 2.161 ± 0.56
2.161LysAsn: 2.161 ± 0.56
1.441LysPro: 1.441 ± 1.038
2.882LysGln: 2.882 ± 0.041
0.72LysArg: 0.72 ± 0.519
2.882LysSer: 2.882 ± 1.1
1.441LysThr: 1.441 ± 1.079
2.882LysVal: 2.882 ± 0.041
0.0LysTrp: 0.0 ± 0.0
2.161LysTyr: 2.161 ± 1.619
0.0LysXaa: 0.0 ± 0.0
Leu
6.484LeuAla: 6.484 ± 1.495
2.882LeuCys: 2.882 ± 1.017
6.484LeuAsp: 6.484 ± 1.495
5.043LeuGlu: 5.043 ± 1.66
4.323LeuPhe: 4.323 ± 0.997
2.161LeuGly: 2.161 ± 1.557
2.161LeuHis: 2.161 ± 1.619
5.764LeuIle: 5.764 ± 2.199
5.043LeuLys: 5.043 ± 0.601
9.366LeuLeu: 9.366 ± 0.663
0.0LeuMet: 0.0 ± 0.0
7.925LeuAsn: 7.925 ± 1.701
6.484LeuPro: 6.484 ± 1.68
4.323LeuGln: 4.323 ± 3.114
8.646LeuArg: 8.646 ± 2.241
4.323LeuSer: 4.323 ± 0.062
3.602LeuThr: 3.602 ± 0.581
4.323LeuVal: 4.323 ± 0.062
2.161LeuTrp: 2.161 ± 0.498
2.882LeuTyr: 2.882 ± 1.017
0.0LeuXaa: 0.0 ± 0.0
Met
1.441MetAla: 1.441 ± 1.038
0.72MetCys: 0.72 ± 0.519
3.602MetAsp: 3.602 ± 0.581
1.441MetGlu: 1.441 ± 1.079
0.0MetPhe: 0.0 ± 0.0
0.0MetGly: 0.0 ± 0.0
1.441MetHis: 1.441 ± 0.021
2.161MetIle: 2.161 ± 0.56
0.72MetLys: 0.72 ± 0.54
0.0MetLeu: 0.0 ± 0.0
0.0MetMet: 0.0 ± 0.0
0.72MetAsn: 0.72 ± 0.519
2.882MetPro: 2.882 ± 0.041
1.441MetGln: 1.441 ± 0.021
1.441MetArg: 1.441 ± 0.021
2.161MetSer: 2.161 ± 0.56
0.72MetThr: 0.72 ± 0.54
0.72MetVal: 0.72 ± 0.519
0.0MetTrp: 0.0 ± 0.0
0.72MetTyr: 0.72 ± 0.519
0.0MetXaa: 0.0 ± 0.0
Asn
1.441AsnAla: 1.441 ± 1.079
1.441AsnCys: 1.441 ± 1.079
3.602AsnAsp: 3.602 ± 0.478
3.602AsnGlu: 3.602 ± 0.478
2.161AsnPhe: 2.161 ± 0.498
2.882AsnGly: 2.882 ± 1.017
0.72AsnHis: 0.72 ± 0.519
5.043AsnIle: 5.043 ± 0.457
5.043AsnLys: 5.043 ± 2.718
2.161AsnLeu: 2.161 ± 0.498
1.441AsnMet: 1.441 ± 1.038
2.161AsnAsn: 2.161 ± 0.498
2.882AsnPro: 2.882 ± 0.041
2.161AsnGln: 2.161 ± 0.498
3.602AsnArg: 3.602 ± 0.478
2.882AsnSer: 2.882 ± 1.1
3.602AsnThr: 3.602 ± 1.536
5.043AsnVal: 5.043 ± 0.601
0.72AsnTrp: 0.72 ± 0.54
1.441AsnTyr: 1.441 ± 1.038
0.0AsnXaa: 0.0 ± 0.0
Pro
5.764ProAla: 5.764 ± 1.141
1.441ProCys: 1.441 ± 0.021
1.441ProAsp: 1.441 ± 1.038
5.764ProGlu: 5.764 ± 0.082
4.323ProPhe: 4.323 ± 0.062
7.205ProGly: 7.205 ± 2.014
1.441ProHis: 1.441 ± 1.079
2.161ProIle: 2.161 ± 1.619
1.441ProLys: 1.441 ± 1.038
6.484ProLeu: 6.484 ± 2.554
0.72ProMet: 0.72 ± 0.519
2.882ProAsn: 2.882 ± 0.041
4.323ProPro: 4.323 ± 1.12
2.882ProGln: 2.882 ± 1.1
2.161ProArg: 2.161 ± 0.56
6.484ProSer: 6.484 ± 4.856
1.441ProThr: 1.441 ± 1.038
4.323ProVal: 4.323 ± 2.055
0.0ProTrp: 0.0 ± 0.0
4.323ProTyr: 4.323 ± 2.179
0.0ProXaa: 0.0 ± 0.0
Gln
4.323GlnAla: 4.323 ± 1.12
1.441GlnCys: 1.441 ± 0.021
0.0GlnAsp: 0.0 ± 0.0
1.441GlnGlu: 1.441 ± 0.021
1.441GlnPhe: 1.441 ± 0.021
1.441GlnGly: 1.441 ± 1.038
2.161GlnHis: 2.161 ± 0.56
1.441GlnIle: 1.441 ± 1.038
2.882GlnLys: 2.882 ± 0.041
3.602GlnLeu: 3.602 ± 0.478
0.0GlnMet: 0.0 ± 0.0
2.161GlnAsn: 2.161 ± 1.557
2.882GlnPro: 2.882 ± 1.017
1.441GlnGln: 1.441 ± 0.021
2.882GlnArg: 2.882 ± 2.076
2.882GlnSer: 2.882 ± 0.041
3.602GlnThr: 3.602 ± 0.581
2.161GlnVal: 2.161 ± 0.498
0.0GlnTrp: 0.0 ± 0.0
1.441GlnTyr: 1.441 ± 1.038
0.0GlnXaa: 0.0 ± 0.0
Arg
5.043ArgAla: 5.043 ± 0.457
0.72ArgCys: 0.72 ± 0.519
2.882ArgAsp: 2.882 ± 1.017
0.72ArgGlu: 0.72 ± 0.54
2.882ArgPhe: 2.882 ± 0.041
2.161ArgGly: 2.161 ± 0.498
2.882ArgHis: 2.882 ± 0.041
1.441ArgIle: 1.441 ± 1.079
3.602ArgLys: 3.602 ± 1.536
2.882ArgLeu: 2.882 ± 0.041
1.441ArgMet: 1.441 ± 1.038
4.323ArgAsn: 4.323 ± 0.997
4.323ArgPro: 4.323 ± 0.997
0.72ArgGln: 0.72 ± 0.54
1.441ArgArg: 1.441 ± 1.079
5.043ArgSer: 5.043 ± 1.516
2.882ArgThr: 2.882 ± 1.017
3.602ArgVal: 3.602 ± 0.478
0.72ArgTrp: 0.72 ± 0.54
1.441ArgTyr: 1.441 ± 1.079
0.0ArgXaa: 0.0 ± 0.0
Ser
2.161SerAla: 2.161 ± 0.498
2.161SerCys: 2.161 ± 0.498
5.764SerAsp: 5.764 ± 2.199
4.323SerGlu: 4.323 ± 0.997
3.602SerPhe: 3.602 ± 0.478
5.764SerGly: 5.764 ± 0.976
1.441SerHis: 1.441 ± 0.021
1.441SerIle: 1.441 ± 0.021
4.323SerLys: 4.323 ± 1.12
5.764SerLeu: 5.764 ± 0.082
0.72SerMet: 0.72 ± 0.54
4.323SerAsn: 4.323 ± 0.997
2.161SerPro: 2.161 ± 0.498
2.882SerGln: 2.882 ± 2.158
3.602SerArg: 3.602 ± 1.639
3.602SerSer: 3.602 ± 0.581
5.043SerThr: 5.043 ± 0.601
10.086SerVal: 10.086 ± 1.203
0.72SerTrp: 0.72 ± 0.54
1.441SerTyr: 1.441 ± 0.021
0.0SerXaa: 0.0 ± 0.0
Thr
1.441ThrAla: 1.441 ± 1.038
0.0ThrCys: 0.0 ± 0.0
4.323ThrAsp: 4.323 ± 0.062
2.161ThrGlu: 2.161 ± 0.56
2.882ThrPhe: 2.882 ± 1.017
2.882ThrGly: 2.882 ± 2.076
0.0ThrHis: 0.0 ± 0.0
1.441ThrIle: 1.441 ± 0.021
0.0ThrLys: 0.0 ± 0.0
7.925ThrLeu: 7.925 ± 0.416
2.161ThrMet: 2.161 ± 0.498
3.602ThrAsn: 3.602 ± 0.478
3.602ThrPro: 3.602 ± 0.478
2.882ThrGln: 2.882 ± 1.017
2.161ThrArg: 2.161 ± 0.498
7.205ThrSer: 7.205 ± 0.103
3.602ThrThr: 3.602 ± 2.698
1.441ThrVal: 1.441 ± 0.021
0.0ThrTrp: 0.0 ± 0.0
2.161ThrTyr: 2.161 ± 0.56
0.72ThrXaa: 0.72 ± 0.54
Val
4.323ValAla: 4.323 ± 0.997
2.161ValCys: 2.161 ± 1.619
1.441ValAsp: 1.441 ± 1.038
1.441ValGlu: 1.441 ± 1.079
5.043ValPhe: 5.043 ± 0.601
2.882ValGly: 2.882 ± 1.017
1.441ValHis: 1.441 ± 1.038
3.602ValIle: 3.602 ± 1.639
2.882ValLys: 2.882 ± 0.041
10.086ValLeu: 10.086 ± 2.261
2.161ValMet: 2.161 ± 0.56
5.764ValAsn: 5.764 ± 0.082
5.764ValPro: 5.764 ± 2.199
2.882ValGln: 2.882 ± 2.076
2.161ValArg: 2.161 ± 1.557
5.764ValSer: 5.764 ± 0.976
2.882ValThr: 2.882 ± 1.017
2.161ValVal: 2.161 ± 1.557
2.161ValTrp: 2.161 ± 0.56
0.0ValTyr: 0.0 ± 0.0
0.0ValXaa: 0.0 ± 0.0
Trp
0.72TrpAla: 0.72 ± 0.519
1.441TrpCys: 1.441 ± 1.079
0.0TrpAsp: 0.0 ± 0.0
0.72TrpGlu: 0.72 ± 0.54
0.72TrpPhe: 0.72 ± 0.519
0.72TrpGly: 0.72 ± 0.519
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
0.0TrpLys: 0.0 ± 0.0
1.441TrpLeu: 1.441 ± 1.079
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.72TrpGln: 0.72 ± 0.54
2.161TrpArg: 2.161 ± 0.498
0.72TrpSer: 0.72 ± 0.54
0.0TrpThr: 0.0 ± 0.0
0.72TrpVal: 0.72 ± 0.54
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.882TyrAla: 2.882 ± 1.017
0.72TyrCys: 0.72 ± 0.54
2.161TyrAsp: 2.161 ± 0.56
1.441TyrGlu: 1.441 ± 0.021
0.72TyrPhe: 0.72 ± 0.519
1.441TyrGly: 1.441 ± 1.038
0.72TyrHis: 0.72 ± 0.54
2.882TyrIle: 2.882 ± 2.158
2.161TyrLys: 2.161 ± 0.498
5.043TyrLeu: 5.043 ± 0.601
0.72TyrMet: 0.72 ± 0.519
4.323TyrAsn: 4.323 ± 0.062
3.602TyrPro: 3.602 ± 1.639
3.602TyrGln: 3.602 ± 0.478
1.441TyrArg: 1.441 ± 1.038
1.441TyrSer: 1.441 ± 1.079
0.72TyrThr: 0.72 ± 0.54
3.602TyrVal: 3.602 ± 0.478
0.72TyrTrp: 0.72 ± 0.519
4.323TyrTyr: 4.323 ± 0.997
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.72XaaSer: 0.72 ± 0.54
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (1389 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski