Amino acid dipepetide frequency for Polygala garcinii associated virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.036AlaAla: 4.036 ± 2.072
2.018AlaCys: 2.018 ± 0.753
1.009AlaAsp: 1.009 ± 0.735
1.009AlaGlu: 1.009 ± 0.735
1.009AlaPhe: 1.009 ± 0.991
0.0AlaGly: 0.0 ± 0.0
1.009AlaHis: 1.009 ± 0.991
2.018AlaIle: 2.018 ± 1.471
4.036AlaLys: 4.036 ± 0.953
7.064AlaLeu: 7.064 ± 2.507
1.009AlaMet: 1.009 ± 0.86
2.018AlaAsn: 2.018 ± 1.471
2.018AlaPro: 2.018 ± 1.265
3.027AlaGln: 3.027 ± 1.451
5.045AlaArg: 5.045 ± 1.441
2.018AlaSer: 2.018 ± 1.17
1.009AlaThr: 1.009 ± 0.86
4.036AlaVal: 4.036 ± 2.362
1.009AlaTrp: 1.009 ± 0.735
0.0AlaTyr: 0.0 ± 0.0
0.0AlaXaa: 0.0 ± 0.0
Cys
1.009CysAla: 1.009 ± 1.075
4.036CysCys: 4.036 ± 2.476
1.009CysAsp: 1.009 ± 0.735
1.009CysGlu: 1.009 ± 0.86
0.0CysPhe: 0.0 ± 0.0
2.018CysGly: 2.018 ± 1.002
2.018CysHis: 2.018 ± 1.17
2.018CysIle: 2.018 ± 1.238
2.018CysLys: 2.018 ± 1.238
1.009CysLeu: 1.009 ± 0.86
1.009CysMet: 1.009 ± 1.075
1.009CysAsn: 1.009 ± 0.735
3.027CysPro: 3.027 ± 2.101
1.009CysGln: 1.009 ± 0.735
1.009CysArg: 1.009 ± 0.735
1.009CysSer: 1.009 ± 0.735
5.045CysThr: 5.045 ± 2.808
1.009CysVal: 1.009 ± 1.075
0.0CysTrp: 0.0 ± 0.0
1.009CysTyr: 1.009 ± 1.045
0.0CysXaa: 0.0 ± 0.0
Asp
2.018AspAla: 2.018 ± 1.002
3.027AspCys: 3.027 ± 2.169
2.018AspAsp: 2.018 ± 0.753
2.018AspGlu: 2.018 ± 1.15
2.018AspPhe: 2.018 ± 1.15
7.064AspGly: 7.064 ± 1.862
3.027AspHis: 3.027 ± 0.823
4.036AspIle: 4.036 ± 3.398
1.009AspLys: 1.009 ± 0.735
3.027AspLeu: 3.027 ± 1.451
1.009AspMet: 1.009 ± 0.86
3.027AspAsn: 3.027 ± 0.822
4.036AspPro: 4.036 ± 1.525
2.018AspGln: 2.018 ± 1.72
6.054AspArg: 6.054 ± 1.965
5.045AspSer: 5.045 ± 1.544
2.018AspThr: 2.018 ± 0.94
5.045AspVal: 5.045 ± 1.593
0.0AspTrp: 0.0 ± 0.0
0.0AspTyr: 0.0 ± 0.0
0.0AspXaa: 0.0 ± 0.0
Glu
2.018GluAla: 2.018 ± 1.471
0.0GluCys: 0.0 ± 0.0
1.009GluAsp: 1.009 ± 0.735
9.082GluGlu: 9.082 ± 4.465
1.009GluPhe: 1.009 ± 0.735
6.054GluGly: 6.054 ± 1.758
1.009GluHis: 1.009 ± 1.045
0.0GluIle: 0.0 ± 0.0
3.027GluLys: 3.027 ± 2.206
3.027GluLeu: 3.027 ± 2.101
1.009GluMet: 1.009 ± 0.735
2.018GluAsn: 2.018 ± 0.94
4.036GluPro: 4.036 ± 2.362
0.0GluGln: 0.0 ± 0.0
0.0GluArg: 0.0 ± 0.0
3.027GluSer: 3.027 ± 1.405
3.027GluThr: 3.027 ± 2.101
2.018GluVal: 2.018 ± 1.265
3.027GluTrp: 3.027 ± 0.822
1.009GluTyr: 1.009 ± 0.735
0.0GluXaa: 0.0 ± 0.0
Phe
0.0PheAla: 0.0 ± 0.0
0.0PheCys: 0.0 ± 0.0
5.045PheAsp: 5.045 ± 2.13
2.018PheGlu: 2.018 ± 1.15
2.018PhePhe: 2.018 ± 0.753
2.018PheGly: 2.018 ± 1.72
0.0PheHis: 0.0 ± 0.0
1.009PheIle: 1.009 ± 0.735
2.018PheLys: 2.018 ± 0.94
7.064PheLeu: 7.064 ± 2.534
1.009PheMet: 1.009 ± 0.735
1.009PheAsn: 1.009 ± 1.045
2.018PhePro: 2.018 ± 1.15
6.054PheGln: 6.054 ± 1.85
4.036PheArg: 4.036 ± 2.068
3.027PheSer: 3.027 ± 0.822
3.027PheThr: 3.027 ± 1.262
5.045PheVal: 5.045 ± 2.13
0.0PheTrp: 0.0 ± 0.0
2.018PheTyr: 2.018 ± 1.17
0.0PheXaa: 0.0 ± 0.0
Gly
0.0GlyAla: 0.0 ± 0.0
3.027GlyCys: 3.027 ± 0.822
4.036GlyAsp: 4.036 ± 2.072
3.027GlyGlu: 3.027 ± 1.561
3.027GlyPhe: 3.027 ± 2.286
3.027GlyGly: 3.027 ± 1.214
2.018GlyHis: 2.018 ± 1.002
1.009GlyIle: 1.009 ± 0.735
5.045GlyLys: 5.045 ± 1.882
2.018GlyLeu: 2.018 ± 1.17
1.009GlyMet: 1.009 ± 1.045
3.027GlyAsn: 3.027 ± 2.581
5.045GlyPro: 5.045 ± 2.13
2.018GlyGln: 2.018 ± 0.753
4.036GlyArg: 4.036 ± 1.233
5.045GlySer: 5.045 ± 1.397
3.027GlyThr: 3.027 ± 1.99
4.036GlyVal: 4.036 ± 1.725
0.0GlyTrp: 0.0 ± 0.0
1.009GlyTyr: 1.009 ± 0.991
0.0GlyXaa: 0.0 ± 0.0
His
0.0HisAla: 0.0 ± 0.0
3.027HisCys: 3.027 ± 2.169
4.036HisAsp: 4.036 ± 2.35
1.009HisGlu: 1.009 ± 0.735
1.009HisPhe: 1.009 ± 0.735
4.036HisGly: 4.036 ± 1.958
1.009HisHis: 1.009 ± 0.991
3.027HisIle: 3.027 ± 1.264
2.018HisLys: 2.018 ± 1.523
3.027HisLeu: 3.027 ± 1.262
0.0HisMet: 0.0 ± 0.0
4.036HisAsn: 4.036 ± 1.395
1.009HisPro: 1.009 ± 0.735
1.009HisGln: 1.009 ± 0.991
3.027HisArg: 3.027 ± 1.799
3.027HisSer: 3.027 ± 1.214
3.027HisThr: 3.027 ± 1.799
2.018HisVal: 2.018 ± 0.94
0.0HisTrp: 0.0 ± 0.0
2.018HisTyr: 2.018 ± 1.471
0.0HisXaa: 0.0 ± 0.0
Ile
2.018IleAla: 2.018 ± 1.981
2.018IleCys: 2.018 ± 1.471
6.054IleAsp: 6.054 ± 2.903
0.0IleGlu: 0.0 ± 0.0
3.027IlePhe: 3.027 ± 1.214
2.018IleGly: 2.018 ± 1.238
1.009IleHis: 1.009 ± 1.045
2.018IleIle: 2.018 ± 1.523
2.018IleLys: 2.018 ± 1.265
0.0IleLeu: 0.0 ± 0.0
0.0IleMet: 0.0 ± 0.0
2.018IleAsn: 2.018 ± 0.753
2.018IlePro: 2.018 ± 1.471
2.018IleGln: 2.018 ± 1.471
6.054IleArg: 6.054 ± 2.86
7.064IleSer: 7.064 ± 1.674
4.036IleThr: 4.036 ± 1.725
2.018IleVal: 2.018 ± 0.753
0.0IleTrp: 0.0 ± 0.0
2.018IleTyr: 2.018 ± 1.17
0.0IleXaa: 0.0 ± 0.0
Lys
2.018LysAla: 2.018 ± 1.17
1.009LysCys: 1.009 ± 1.045
3.027LysAsp: 3.027 ± 1.214
6.054LysGlu: 6.054 ± 2.652
4.036LysPhe: 4.036 ± 0.915
3.027LysGly: 3.027 ± 1.44
2.018LysHis: 2.018 ± 0.753
3.027LysIle: 3.027 ± 2.347
5.045LysLys: 5.045 ± 2.605
1.009LysLeu: 1.009 ± 1.045
0.0LysMet: 0.0 ± 0.0
3.027LysAsn: 3.027 ± 2.206
1.009LysPro: 1.009 ± 0.735
1.009LysGln: 1.009 ± 0.86
4.036LysArg: 4.036 ± 2.34
7.064LysSer: 7.064 ± 2.597
2.018LysThr: 2.018 ± 1.471
4.036LysVal: 4.036 ± 2.594
0.0LysTrp: 0.0 ± 0.0
4.036LysTyr: 4.036 ± 2.2
0.0LysXaa: 0.0 ± 0.0
Leu
3.027LeuAla: 3.027 ± 1.307
4.036LeuCys: 4.036 ± 3.133
7.064LeuAsp: 7.064 ± 2.497
2.018LeuGlu: 2.018 ± 1.523
3.027LeuPhe: 3.027 ± 0.822
5.045LeuGly: 5.045 ± 2.004
4.036LeuHis: 4.036 ± 2.3
1.009LeuIle: 1.009 ± 0.735
7.064LeuLys: 7.064 ± 1.879
6.054LeuLeu: 6.054 ± 3.029
1.009LeuMet: 1.009 ± 0.926
3.027LeuAsn: 3.027 ± 2.156
1.009LeuPro: 1.009 ± 0.991
4.036LeuGln: 4.036 ± 1.88
3.027LeuArg: 3.027 ± 2.368
4.036LeuSer: 4.036 ± 0.915
5.045LeuThr: 5.045 ± 2.884
1.009LeuVal: 1.009 ± 0.86
1.009LeuTrp: 1.009 ± 0.735
7.064LeuTyr: 7.064 ± 2.907
0.0LeuXaa: 0.0 ± 0.0
Met
1.009MetAla: 1.009 ± 0.86
0.0MetCys: 0.0 ± 0.0
1.009MetAsp: 1.009 ± 1.045
2.018MetGlu: 2.018 ± 1.471
1.009MetPhe: 1.009 ± 0.86
1.009MetGly: 1.009 ± 0.735
1.009MetHis: 1.009 ± 1.045
0.0MetIle: 0.0 ± 0.0
1.009MetLys: 1.009 ± 0.86
1.009MetLeu: 1.009 ± 1.075
0.0MetMet: 0.0 ± 0.0
1.009MetAsn: 1.009 ± 0.86
1.009MetPro: 1.009 ± 0.735
1.009MetGln: 1.009 ± 0.991
0.0MetArg: 0.0 ± 0.0
0.0MetSer: 0.0 ± 0.0
1.009MetThr: 1.009 ± 0.86
1.009MetVal: 1.009 ± 0.735
1.009MetTrp: 1.009 ± 1.075
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
3.027AsnAla: 3.027 ± 0.823
0.0AsnCys: 0.0 ± 0.0
1.009AsnAsp: 1.009 ± 0.735
1.009AsnGlu: 1.009 ± 1.045
2.018AsnPhe: 2.018 ± 1.265
2.018AsnGly: 2.018 ± 1.265
7.064AsnHis: 7.064 ± 2.443
2.018AsnIle: 2.018 ± 1.471
1.009AsnLys: 1.009 ± 0.735
3.027AsnLeu: 3.027 ± 0.823
2.018AsnMet: 2.018 ± 0.772
1.009AsnAsn: 1.009 ± 0.735
4.036AsnPro: 4.036 ± 1.032
2.018AsnGln: 2.018 ± 1.17
3.027AsnArg: 3.027 ± 1.44
2.018AsnSer: 2.018 ± 0.753
4.036AsnThr: 4.036 ± 1.88
3.027AsnVal: 3.027 ± 1.214
3.027AsnTrp: 3.027 ± 1.214
2.018AsnTyr: 2.018 ± 1.471
0.0AsnXaa: 0.0 ± 0.0
Pro
2.018ProAla: 2.018 ± 1.265
1.009ProCys: 1.009 ± 0.86
0.0ProAsp: 0.0 ± 0.0
4.036ProGlu: 4.036 ± 2.121
3.027ProPhe: 3.027 ± 1.44
1.009ProGly: 1.009 ± 0.735
4.036ProHis: 4.036 ± 2.214
5.045ProIle: 5.045 ± 1.441
2.018ProLys: 2.018 ± 0.753
5.045ProLeu: 5.045 ± 2.274
0.0ProMet: 0.0 ± 0.928
3.027ProAsn: 3.027 ± 1.852
5.045ProPro: 5.045 ± 2.383
5.045ProGln: 5.045 ± 2.024
6.054ProArg: 6.054 ± 2.976
1.009ProSer: 1.009 ± 0.86
8.073ProThr: 8.073 ± 4.143
4.036ProVal: 4.036 ± 0.953
0.0ProTrp: 0.0 ± 0.0
0.0ProTyr: 0.0 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
2.018GlnAla: 2.018 ± 1.471
2.018GlnCys: 2.018 ± 1.471
4.036GlnAsp: 4.036 ± 1.658
1.009GlnGlu: 1.009 ± 0.735
3.027GlnPhe: 3.027 ± 1.451
1.009GlnGly: 1.009 ± 0.735
1.009GlnHis: 1.009 ± 1.045
3.027GlnIle: 3.027 ± 0.823
1.009GlnLys: 1.009 ± 0.86
1.009GlnLeu: 1.009 ± 0.735
0.0GlnMet: 0.0 ± 0.0
3.027GlnAsn: 3.027 ± 0.823
5.045GlnPro: 5.045 ± 2.024
0.0GlnGln: 0.0 ± 0.0
1.009GlnArg: 1.009 ± 0.86
3.027GlnSer: 3.027 ± 1.264
4.036GlnThr: 4.036 ± 1.767
5.045GlnVal: 5.045 ± 2.568
0.0GlnTrp: 0.0 ± 0.0
2.018GlnTyr: 2.018 ± 0.753
0.0GlnXaa: 0.0 ± 0.0
Arg
2.018ArgAla: 2.018 ± 2.09
1.009ArgCys: 1.009 ± 1.075
6.054ArgAsp: 6.054 ± 2.6
1.009ArgGlu: 1.009 ± 0.735
7.064ArgPhe: 7.064 ± 1.562
4.036ArgGly: 4.036 ± 1.352
2.018ArgHis: 2.018 ± 1.396
6.054ArgIle: 6.054 ± 1.85
2.018ArgLys: 2.018 ± 1.981
4.036ArgLeu: 4.036 ± 1.397
0.0ArgMet: 0.0 ± 0.0
4.036ArgAsn: 4.036 ± 1.397
6.054ArgPro: 6.054 ± 0.938
1.009ArgGln: 1.009 ± 0.735
8.073ArgArg: 8.073 ± 2.395
4.036ArgSer: 4.036 ± 2.3
7.064ArgThr: 7.064 ± 2.226
5.045ArgVal: 5.045 ± 1.432
0.0ArgTrp: 0.0 ± 0.0
3.027ArgTyr: 3.027 ± 1.894
0.0ArgXaa: 0.0 ± 0.0
Ser
7.064SerAla: 7.064 ± 2.115
1.009SerCys: 1.009 ± 1.075
5.045SerAsp: 5.045 ± 1.398
2.018SerGlu: 2.018 ± 1.15
3.027SerPhe: 3.027 ± 1.214
1.009SerGly: 1.009 ± 0.86
1.009SerHis: 1.009 ± 0.991
3.027SerIle: 3.027 ± 1.44
1.009SerLys: 1.009 ± 0.86
6.054SerLeu: 6.054 ± 2.508
1.009SerMet: 1.009 ± 0.86
5.045SerAsn: 5.045 ± 1.882
4.036SerPro: 4.036 ± 1.032
4.036SerGln: 4.036 ± 1.861
6.054SerArg: 6.054 ± 2.065
11.1SerSer: 11.1 ± 2.045
5.045SerThr: 5.045 ± 1.771
6.054SerVal: 6.054 ± 2.673
0.0SerTrp: 0.0 ± 0.0
3.027SerTyr: 3.027 ± 1.264
0.0SerXaa: 0.0 ± 0.0
Thr
6.054ThrAla: 6.054 ± 0.938
1.009ThrCys: 1.009 ± 0.735
1.009ThrAsp: 1.009 ± 1.045
2.018ThrGlu: 2.018 ± 1.238
0.0ThrPhe: 0.0 ± 0.0
5.045ThrGly: 5.045 ± 1.441
3.027ThrHis: 3.027 ± 1.852
4.036ThrIle: 4.036 ± 1.016
5.045ThrLys: 5.045 ± 0.961
4.036ThrLeu: 4.036 ± 2.941
1.009ThrMet: 1.009 ± 0.735
4.036ThrAsn: 4.036 ± 1.505
4.036ThrPro: 4.036 ± 0.953
3.027ThrGln: 3.027 ± 1.824
6.054ThrArg: 6.054 ± 2.186
6.054ThrSer: 6.054 ± 2.042
2.018ThrThr: 2.018 ± 1.471
5.045ThrVal: 5.045 ± 2.004
2.018ThrTrp: 2.018 ± 2.09
3.027ThrTyr: 3.027 ± 2.206
0.0ThrXaa: 0.0 ± 0.0
Val
1.009ValAla: 1.009 ± 0.86
3.027ValCys: 3.027 ± 1.44
2.018ValAsp: 2.018 ± 0.94
3.027ValGlu: 3.027 ± 0.993
6.054ValPhe: 6.054 ± 1.647
3.027ValGly: 3.027 ± 2.169
2.018ValHis: 2.018 ± 1.002
4.036ValIle: 4.036 ± 0.915
6.054ValLys: 6.054 ± 1.0
7.064ValLeu: 7.064 ± 2.538
0.0ValMet: 0.0 ± 0.0
2.018ValAsn: 2.018 ± 1.265
3.027ValPro: 3.027 ± 1.44
3.027ValGln: 3.027 ± 1.824
5.045ValArg: 5.045 ± 2.121
4.036ValSer: 4.036 ± 2.594
2.018ValThr: 2.018 ± 1.72
6.054ValVal: 6.054 ± 2.879
1.009ValTrp: 1.009 ± 0.86
4.036ValTyr: 4.036 ± 2.657
0.0ValXaa: 0.0 ± 0.0
Trp
3.027TrpAla: 3.027 ± 2.206
0.0TrpCys: 0.0 ± 0.0
1.009TrpAsp: 1.009 ± 1.075
0.0TrpGlu: 0.0 ± 0.0
0.0TrpPhe: 0.0 ± 0.0
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
2.018TrpLys: 2.018 ± 0.753
2.018TrpLeu: 2.018 ± 1.72
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
1.009TrpArg: 1.009 ± 0.991
0.0TrpSer: 0.0 ± 0.0
2.018TrpThr: 2.018 ± 2.09
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
1.009TrpTyr: 1.009 ± 0.735
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.009TyrAla: 1.009 ± 0.735
0.0TyrCys: 0.0 ± 0.0
2.018TyrAsp: 2.018 ± 1.238
2.018TyrGlu: 2.018 ± 1.265
3.027TyrPhe: 3.027 ± 0.823
2.018TyrGly: 2.018 ± 0.753
3.027TyrHis: 3.027 ± 1.451
1.009TyrIle: 1.009 ± 0.735
2.018TyrLys: 2.018 ± 0.94
6.054TyrLeu: 6.054 ± 0.757
3.027TyrMet: 3.027 ± 0.86
1.009TyrAsn: 1.009 ± 0.735
3.027TyrPro: 3.027 ± 1.264
1.009TyrGln: 1.009 ± 1.045
1.009TyrArg: 1.009 ± 0.86
4.036TyrSer: 4.036 ± 2.015
1.009TyrThr: 1.009 ± 0.735
2.018TyrVal: 2.018 ± 1.17
0.0TyrTrp: 0.0 ± 0.0
0.0TyrTyr: 0.0 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (992 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski