Amino acid dipepetide frequency for Golden shiner totivirus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.67AlaAla: 1.67 ± 0.175
0.418AlaCys: 0.418 ± 0.234
1.67AlaAsp: 1.67 ± 0.175
4.593AlaGlu: 4.593 ± 1.053
3.34AlaPhe: 3.34 ± 1.111
2.088AlaGly: 2.088 ± 0.409
0.418AlaHis: 0.418 ± 0.234
6.681AlaIle: 6.681 ± 0.061
4.175AlaLys: 4.175 ± 0.057
3.758AlaLeu: 3.758 ± 1.346
2.505AlaMet: 2.505 ± 1.526
5.846AlaAsn: 5.846 ± 1.755
0.835AlaPro: 0.835 ± 0.468
1.67AlaGln: 1.67 ± 0.936
1.67AlaArg: 1.67 ± 0.175
3.758AlaSer: 3.758 ± 0.584
4.175AlaThr: 4.175 ± 0.818
3.34AlaVal: 3.34 ± 0.35
1.253AlaTrp: 1.253 ± 0.702
1.67AlaTyr: 1.67 ± 0.175
0.0AlaXaa: 0.0 ± 0.0
Cys
0.418CysAla: 0.418 ± 0.234
0.0CysCys: 0.0 ± 0.0
1.253CysAsp: 1.253 ± 0.702
0.418CysGlu: 0.418 ± 0.234
0.835CysPhe: 0.835 ± 0.468
0.418CysGly: 0.418 ± 0.234
0.418CysHis: 0.418 ± 0.234
1.253CysIle: 1.253 ± 0.059
1.67CysLys: 1.67 ± 0.175
1.253CysLeu: 1.253 ± 0.059
0.0CysMet: 0.0 ± 0.0
0.835CysAsn: 0.835 ± 0.293
0.418CysPro: 0.418 ± 0.234
0.0CysGln: 0.0 ± 0.0
0.418CysArg: 0.418 ± 0.234
0.418CysSer: 0.418 ± 0.234
0.418CysThr: 0.418 ± 0.234
0.418CysVal: 0.418 ± 0.234
0.0CysTrp: 0.0 ± 0.0
0.835CysTyr: 0.835 ± 0.293
0.0CysXaa: 0.0 ± 0.0
Asp
3.34AspAla: 3.34 ± 1.111
0.835AspCys: 0.835 ± 0.468
2.088AspAsp: 2.088 ± 0.352
3.34AspGlu: 3.34 ± 1.111
1.253AspPhe: 1.253 ± 0.059
3.34AspGly: 3.34 ± 0.411
2.088AspHis: 2.088 ± 0.352
5.846AspIle: 5.846 ± 0.994
3.34AspLys: 3.34 ± 0.35
5.01AspLeu: 5.01 ± 0.525
0.418AspMet: 0.418 ± 0.234
5.428AspAsn: 5.428 ± 1.521
0.0AspPro: 0.0 ± 0.0
1.67AspGln: 1.67 ± 0.175
0.835AspArg: 0.835 ± 0.293
2.923AspSer: 2.923 ± 0.645
0.835AspThr: 0.835 ± 0.468
2.088AspVal: 2.088 ± 1.17
2.505AspTrp: 2.505 ± 0.879
3.34AspTyr: 3.34 ± 1.933
0.0AspXaa: 0.0 ± 0.0
Glu
4.593GluAla: 4.593 ± 1.053
0.835GluCys: 0.835 ± 0.293
2.505GluAsp: 2.505 ± 1.405
3.34GluGlu: 3.34 ± 1.111
3.34GluPhe: 3.34 ± 0.411
1.67GluGly: 1.67 ± 0.175
2.505GluHis: 2.505 ± 1.405
6.263GluIle: 6.263 ± 2.578
3.758GluLys: 3.758 ± 0.938
4.175GluLeu: 4.175 ± 0.704
2.088GluMet: 2.088 ± 0.352
4.593GluAsn: 4.593 ± 1.992
2.088GluPro: 2.088 ± 0.409
2.505GluGln: 2.505 ± 1.405
1.253GluArg: 1.253 ± 0.059
3.758GluSer: 3.758 ± 0.177
4.175GluThr: 4.175 ± 1.58
2.088GluVal: 2.088 ± 1.17
0.835GluTrp: 0.835 ± 0.468
2.505GluTyr: 2.505 ± 0.118
0.0GluXaa: 0.0 ± 0.0
Phe
2.923PheAla: 2.923 ± 0.645
1.253PheCys: 1.253 ± 0.702
2.923PheAsp: 2.923 ± 1.406
2.923PheGlu: 2.923 ± 0.116
0.418PhePhe: 0.418 ± 0.527
3.758PheGly: 3.758 ± 0.177
0.418PheHis: 0.418 ± 0.234
1.253PheIle: 1.253 ± 0.059
2.505PheLys: 2.505 ± 1.64
2.923PheLeu: 2.923 ± 0.645
1.253PheMet: 1.253 ± 0.059
1.253PheAsn: 1.253 ± 0.82
3.34PhePro: 3.34 ± 1.111
1.67PheGln: 1.67 ± 0.175
0.835PheArg: 0.835 ± 0.293
2.505PheSer: 2.505 ± 0.643
2.505PheThr: 2.505 ± 0.643
2.505PheVal: 2.505 ± 0.879
0.835PheTrp: 0.835 ± 0.293
2.088PheTyr: 2.088 ± 0.352
0.0PheXaa: 0.0 ± 0.0
Gly
1.253GlyAla: 1.253 ± 0.059
0.0GlyCys: 0.0 ± 0.0
2.505GlyAsp: 2.505 ± 0.879
3.34GlyGlu: 3.34 ± 0.411
2.923GlyPhe: 2.923 ± 1.406
3.34GlyGly: 3.34 ± 0.411
1.67GlyHis: 1.67 ± 2.109
3.758GlyIle: 3.758 ± 0.938
2.923GlyLys: 2.923 ± 0.645
6.263GlyLeu: 6.263 ± 0.295
0.835GlyMet: 0.835 ± 0.468
4.175GlyAsn: 4.175 ± 0.057
2.923GlyPro: 2.923 ± 1.639
2.088GlyGln: 2.088 ± 1.17
1.67GlyArg: 1.67 ± 0.175
4.593GlySer: 4.593 ± 1.053
1.67GlyThr: 1.67 ± 0.936
2.088GlyVal: 2.088 ± 0.409
0.835GlyTrp: 0.835 ± 1.054
2.088GlyTyr: 2.088 ± 0.352
0.0GlyXaa: 0.0 ± 0.0
His
0.835HisAla: 0.835 ± 0.293
0.0HisCys: 0.0 ± 0.0
0.0HisAsp: 0.0 ± 0.0
1.253HisGlu: 1.253 ± 0.059
0.835HisPhe: 0.835 ± 0.468
1.67HisGly: 1.67 ± 0.586
0.418HisHis: 0.418 ± 0.234
0.418HisIle: 0.418 ± 0.234
0.835HisLys: 0.835 ± 0.293
2.088HisLeu: 2.088 ± 0.409
0.835HisMet: 0.835 ± 0.468
0.835HisAsn: 0.835 ± 0.468
1.253HisPro: 1.253 ± 0.702
0.835HisGln: 0.835 ± 1.054
2.088HisArg: 2.088 ± 1.113
2.088HisSer: 2.088 ± 0.409
1.67HisThr: 1.67 ± 0.936
1.253HisVal: 1.253 ± 0.82
0.0HisTrp: 0.0 ± 0.0
0.418HisTyr: 0.418 ± 0.527
0.0HisXaa: 0.0 ± 0.0
Ile
4.593IleAla: 4.593 ± 1.053
1.253IleCys: 1.253 ± 0.059
5.428IleAsp: 5.428 ± 0.759
5.428IleGlu: 5.428 ± 1.521
1.67IlePhe: 1.67 ± 0.175
3.758IleGly: 3.758 ± 0.177
0.418IleHis: 0.418 ± 0.527
3.34IleIle: 3.34 ± 1.933
5.846IleLys: 5.846 ± 2.051
4.175IleLeu: 4.175 ± 0.704
1.67IleMet: 1.67 ± 0.936
7.933IleAsn: 7.933 ± 1.642
4.593IlePro: 4.593 ± 0.291
2.923IleGln: 2.923 ± 0.116
4.175IleArg: 4.175 ± 1.465
5.846IleSer: 5.846 ± 0.232
7.516IleThr: 7.516 ± 4.921
4.175IleVal: 4.175 ± 0.704
3.34IleTrp: 3.34 ± 1.172
1.253IleTyr: 1.253 ± 1.581
0.0IleXaa: 0.0 ± 0.0
Lys
2.088LysAla: 2.088 ± 0.409
0.0LysCys: 0.0 ± 0.0
4.593LysAsp: 4.593 ± 1.992
5.01LysGlu: 5.01 ± 2.52
4.175LysPhe: 4.175 ± 2.988
2.923LysGly: 2.923 ± 1.406
0.835LysHis: 0.835 ± 0.293
7.516LysIle: 7.516 ± 4.16
5.428LysLys: 5.428 ± 0.759
6.681LysLeu: 6.681 ± 3.867
0.418LysMet: 0.418 ± 0.527
4.593LysAsn: 4.593 ± 1.231
2.923LysPro: 2.923 ± 1.406
4.175LysGln: 4.175 ± 1.58
4.175LysArg: 4.175 ± 0.057
3.34LysSer: 3.34 ± 0.35
2.088LysThr: 2.088 ± 1.113
6.263LysVal: 6.263 ± 1.228
2.088LysTrp: 2.088 ± 0.352
2.505LysTyr: 2.505 ± 1.64
0.0LysXaa: 0.0 ± 0.0
Leu
5.846LeuAla: 5.846 ± 0.994
1.67LeuCys: 1.67 ± 0.175
2.923LeuAsp: 2.923 ± 1.406
2.923LeuGlu: 2.923 ± 0.645
2.088LeuPhe: 2.088 ± 1.17
7.098LeuGly: 7.098 ± 2.872
2.088LeuHis: 2.088 ± 0.352
7.516LeuIle: 7.516 ± 3.399
5.428LeuLys: 5.428 ± 2.285
5.01LeuLeu: 5.01 ± 1.287
3.34LeuMet: 3.34 ± 0.35
6.263LeuAsn: 6.263 ± 1.056
6.263LeuPro: 6.263 ± 0.295
2.923LeuGln: 2.923 ± 0.877
6.681LeuArg: 6.681 ± 0.701
7.516LeuSer: 7.516 ± 1.115
9.186LeuThr: 9.186 ± 0.583
2.505LeuVal: 2.505 ± 0.118
0.835LeuTrp: 0.835 ± 0.468
2.923LeuTyr: 2.923 ± 0.116
0.0LeuXaa: 0.0 ± 0.0
Met
1.67MetAla: 1.67 ± 0.936
0.835MetCys: 0.835 ± 0.468
0.835MetAsp: 0.835 ± 0.293
0.835MetGlu: 0.835 ± 0.468
0.418MetPhe: 0.418 ± 0.234
0.835MetGly: 0.835 ± 0.468
0.418MetHis: 0.418 ± 0.234
0.835MetIle: 0.835 ± 0.468
2.505MetLys: 2.505 ± 0.118
3.34MetLeu: 3.34 ± 1.111
0.418MetMet: 0.418 ± 0.234
1.253MetAsn: 1.253 ± 0.059
0.835MetPro: 0.835 ± 0.468
0.418MetGln: 0.418 ± 0.234
0.835MetArg: 0.835 ± 0.293
2.088MetSer: 2.088 ± 1.113
2.088MetThr: 2.088 ± 0.409
1.253MetVal: 1.253 ± 0.702
0.418MetTrp: 0.418 ± 0.234
1.67MetTyr: 1.67 ± 0.586
0.0MetXaa: 0.0 ± 0.0
Asn
5.846AsnAla: 5.846 ± 0.994
0.418AsnCys: 0.418 ± 0.527
2.923AsnAsp: 2.923 ± 1.639
6.681AsnGlu: 6.681 ± 0.061
2.505AsnPhe: 2.505 ± 0.118
2.505AsnGly: 2.505 ± 1.405
0.835AsnHis: 0.835 ± 0.293
5.846AsnIle: 5.846 ± 2.051
5.428AsnLys: 5.428 ± 1.524
7.516AsnLeu: 7.516 ± 3.399
2.505AsnMet: 2.505 ± 0.879
2.505AsnAsn: 2.505 ± 0.643
5.01AsnPro: 5.01 ± 0.236
2.088AsnGln: 2.088 ± 1.17
4.175AsnArg: 4.175 ± 1.58
5.846AsnSer: 5.846 ± 0.994
4.175AsnThr: 4.175 ± 1.58
3.758AsnVal: 3.758 ± 1.699
2.088AsnTrp: 2.088 ± 0.409
2.505AsnTyr: 2.505 ± 2.402
0.0AsnXaa: 0.0 ± 0.0
Pro
2.505ProAla: 2.505 ± 1.405
0.835ProCys: 0.835 ± 0.468
1.253ProAsp: 1.253 ± 0.702
2.505ProGlu: 2.505 ± 0.643
2.505ProPhe: 2.505 ± 0.118
2.923ProGly: 2.923 ± 0.877
0.835ProHis: 0.835 ± 0.293
4.175ProIle: 4.175 ± 0.818
3.758ProLys: 3.758 ± 0.177
1.67ProLeu: 1.67 ± 0.175
0.0ProMet: 0.0 ± 0.0
2.923ProAsn: 2.923 ± 1.639
2.505ProPro: 2.505 ± 1.405
3.758ProGln: 3.758 ± 0.177
2.088ProArg: 2.088 ± 1.17
3.34ProSer: 3.34 ± 0.35
6.263ProThr: 6.263 ± 1.989
1.67ProVal: 1.67 ± 0.936
1.253ProTrp: 1.253 ± 0.82
0.418ProTyr: 0.418 ± 0.234
0.0ProXaa: 0.0 ± 0.0
Gln
2.923GlnAla: 2.923 ± 0.877
0.0GlnCys: 0.0 ± 0.0
2.505GlnAsp: 2.505 ± 1.405
0.835GlnGlu: 0.835 ± 0.293
2.923GlnPhe: 2.923 ± 0.645
1.253GlnGly: 1.253 ± 0.059
0.418GlnHis: 0.418 ± 0.234
2.088GlnIle: 2.088 ± 1.17
1.67GlnLys: 1.67 ± 0.175
3.34GlnLeu: 3.34 ± 0.411
1.67GlnMet: 1.67 ± 0.175
2.923GlnAsn: 2.923 ± 1.639
2.505GlnPro: 2.505 ± 0.643
0.835GlnGln: 0.835 ± 1.054
3.758GlnArg: 3.758 ± 1.346
1.253GlnSer: 1.253 ± 0.059
3.758GlnThr: 3.758 ± 0.584
3.758GlnVal: 3.758 ± 1.346
0.0GlnTrp: 0.0 ± 0.0
0.0GlnTyr: 0.0 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
2.088ArgAla: 2.088 ± 0.352
0.418ArgCys: 0.418 ± 0.234
3.34ArgAsp: 3.34 ± 1.111
0.835ArgGlu: 0.835 ± 0.293
1.67ArgPhe: 1.67 ± 0.175
0.418ArgGly: 0.418 ± 0.527
0.835ArgHis: 0.835 ± 0.293
4.175ArgIle: 4.175 ± 2.341
3.758ArgLys: 3.758 ± 0.938
5.01ArgLeu: 5.01 ± 0.236
0.418ArgMet: 0.418 ± 0.234
3.34ArgAsn: 3.34 ± 0.411
2.923ArgPro: 2.923 ± 1.639
1.253ArgGln: 1.253 ± 0.702
1.253ArgArg: 1.253 ± 0.059
1.67ArgSer: 1.67 ± 0.586
2.923ArgThr: 2.923 ± 0.645
4.175ArgVal: 4.175 ± 0.818
0.0ArgTrp: 0.0 ± 0.0
1.67ArgTyr: 1.67 ± 0.586
0.0ArgXaa: 0.0 ± 0.0
Ser
3.34SerAla: 3.34 ± 1.873
0.835SerCys: 0.835 ± 0.468
3.34SerAsp: 3.34 ± 1.873
5.01SerGlu: 5.01 ± 1.287
2.505SerPhe: 2.505 ± 1.405
5.01SerGly: 5.01 ± 0.997
0.835SerHis: 0.835 ± 0.468
5.846SerIle: 5.846 ± 0.529
3.758SerLys: 3.758 ± 1.699
7.933SerLeu: 7.933 ± 1.642
0.0SerMet: 0.0 ± 0.0
5.428SerAsn: 5.428 ± 2.285
1.67SerPro: 1.67 ± 0.936
2.505SerGln: 2.505 ± 0.118
0.835SerArg: 0.835 ± 1.054
3.758SerSer: 3.758 ± 0.938
5.428SerThr: 5.428 ± 0.002
2.923SerVal: 2.923 ± 0.116
2.088SerTrp: 2.088 ± 0.352
3.34SerTyr: 3.34 ± 1.111
0.0SerXaa: 0.0 ± 0.0
Thr
4.593ThrAla: 4.593 ± 1.814
0.835ThrCys: 0.835 ± 0.293
5.01ThrAsp: 5.01 ± 0.236
1.67ThrGlu: 1.67 ± 0.175
2.505ThrPhe: 2.505 ± 0.879
3.758ThrGly: 3.758 ± 0.584
0.835ThrHis: 0.835 ± 0.468
5.01ThrIle: 5.01 ± 0.525
8.351ThrLys: 8.351 ± 3.692
6.681ThrLeu: 6.681 ± 2.223
2.088ThrMet: 2.088 ± 1.17
6.263ThrAsn: 6.263 ± 0.466
1.67ThrPro: 1.67 ± 0.936
2.923ThrGln: 2.923 ± 0.116
2.505ThrArg: 2.505 ± 0.643
5.846ThrSer: 5.846 ± 0.994
6.263ThrThr: 6.263 ± 1.989
2.505ThrVal: 2.505 ± 0.643
1.253ThrTrp: 1.253 ± 0.702
0.835ThrTyr: 0.835 ± 0.293
0.0ThrXaa: 0.0 ± 0.0
Val
2.505ValAla: 2.505 ± 0.118
0.835ValCys: 0.835 ± 0.468
2.923ValAsp: 2.923 ± 0.116
3.758ValGlu: 3.758 ± 0.584
1.253ValPhe: 1.253 ± 0.702
2.505ValGly: 2.505 ± 0.643
1.67ValHis: 1.67 ± 0.936
3.34ValIle: 3.34 ± 0.411
2.923ValLys: 2.923 ± 2.168
7.098ValLeu: 7.098 ± 0.173
1.253ValMet: 1.253 ± 0.702
4.175ValAsn: 4.175 ± 0.818
2.088ValPro: 2.088 ± 0.409
2.088ValGln: 2.088 ± 1.17
2.505ValArg: 2.505 ± 0.643
3.758ValSer: 3.758 ± 0.938
2.923ValThr: 2.923 ± 1.639
3.34ValVal: 3.34 ± 1.111
0.418ValTrp: 0.418 ± 0.527
0.835ValTyr: 0.835 ± 0.293
0.0ValXaa: 0.0 ± 0.0
Trp
1.67TrpAla: 1.67 ± 0.936
0.0TrpCys: 0.0 ± 0.0
1.253TrpAsp: 1.253 ± 0.82
2.923TrpGlu: 2.923 ± 1.406
0.835TrpPhe: 0.835 ± 1.054
0.418TrpGly: 0.418 ± 0.234
0.835TrpHis: 0.835 ± 0.293
2.088TrpIle: 2.088 ± 0.352
0.835TrpLys: 0.835 ± 0.468
1.253TrpLeu: 1.253 ± 0.702
0.0TrpMet: 0.0 ± 0.0
2.088TrpAsn: 2.088 ± 1.113
2.088TrpPro: 2.088 ± 1.17
0.418TrpGln: 0.418 ± 0.527
0.418TrpArg: 0.418 ± 0.234
0.418TrpSer: 0.418 ± 0.527
1.253TrpThr: 1.253 ± 0.059
1.253TrpVal: 1.253 ± 0.059
0.418TrpTrp: 0.418 ± 0.234
0.835TrpTyr: 0.835 ± 0.468
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.253TyrAla: 1.253 ± 0.059
0.418TyrCys: 0.418 ± 0.234
1.253TyrAsp: 1.253 ± 0.702
1.253TyrGlu: 1.253 ± 0.059
2.088TyrPhe: 2.088 ± 1.874
1.253TyrGly: 1.253 ± 0.059
0.835TyrHis: 0.835 ± 0.293
2.088TyrIle: 2.088 ± 1.874
2.923TyrLys: 2.923 ± 2.929
5.846TyrLeu: 5.846 ± 2.813
1.67TyrMet: 1.67 ± 0.349
2.505TyrAsn: 2.505 ± 0.879
1.253TyrPro: 1.253 ± 0.702
1.67TyrGln: 1.67 ± 0.175
0.0TyrArg: 0.0 ± 0.0
1.67TyrSer: 1.67 ± 0.175
2.088TyrThr: 2.088 ± 0.409
0.835TyrVal: 0.835 ± 0.293
0.835TyrTrp: 0.835 ± 0.468
0.835TyrTyr: 0.835 ± 1.054
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (2396 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski