Amino acid dipepetide frequency for East African cassava mosaic virus-KE2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.352AlaAla: 2.352 ± 0.846
1.764AlaCys: 1.764 ± 0.721
1.176AlaAsp: 1.176 ± 0.913
1.764AlaGlu: 1.764 ± 2.095
1.176AlaPhe: 1.176 ± 0.539
0.588AlaGly: 0.588 ± 0.457
2.352AlaHis: 2.352 ± 0.948
2.352AlaIle: 2.352 ± 1.595
4.115AlaLys: 4.115 ± 0.809
6.467AlaLeu: 6.467 ± 1.329
0.0AlaMet: 0.0 ± 0.0
1.176AlaAsn: 1.176 ± 0.631
2.939AlaPro: 2.939 ± 0.864
2.939AlaGln: 2.939 ± 1.042
3.527AlaArg: 3.527 ± 1.827
4.115AlaSer: 4.115 ± 1.771
5.291AlaThr: 5.291 ± 1.656
1.176AlaVal: 1.176 ± 0.848
1.176AlaTrp: 1.176 ± 0.913
1.764AlaTyr: 1.764 ± 0.934
0.0AlaXaa: 0.0 ± 0.0
Cys
0.588CysAla: 0.588 ± 0.675
1.176CysCys: 1.176 ± 1.397
0.588CysAsp: 0.588 ± 0.518
0.588CysGlu: 0.588 ± 0.558
0.588CysPhe: 0.588 ± 0.687
1.176CysGly: 1.176 ± 0.792
0.0CysHis: 0.0 ± 0.0
1.764CysIle: 1.764 ± 1.023
1.764CysLys: 1.764 ± 0.905
2.939CysLeu: 2.939 ± 1.363
0.588CysMet: 0.588 ± 0.46
1.176CysAsn: 1.176 ± 0.539
1.176CysPro: 1.176 ± 1.397
0.0CysGln: 0.0 ± 0.0
1.176CysArg: 1.176 ± 0.601
2.352CysSer: 2.352 ± 1.524
1.176CysThr: 1.176 ± 0.631
1.764CysVal: 1.764 ± 1.158
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
2.939AspAla: 2.939 ± 1.26
1.176AspCys: 1.176 ± 0.735
2.939AspAsp: 2.939 ± 0.964
1.764AspGlu: 1.764 ± 0.675
2.352AspPhe: 2.352 ± 1.116
2.352AspGly: 2.352 ± 1.203
2.939AspHis: 2.939 ± 1.586
2.939AspIle: 2.939 ± 0.953
1.176AspLys: 1.176 ± 0.845
4.703AspLeu: 4.703 ± 1.347
0.588AspMet: 0.588 ± 0.687
4.115AspAsn: 4.115 ± 1.845
3.527AspPro: 3.527 ± 1.336
1.176AspGln: 1.176 ± 0.8
2.352AspArg: 2.352 ± 1.164
5.291AspSer: 5.291 ± 1.538
0.588AspThr: 0.588 ± 0.675
5.879AspVal: 5.879 ± 1.279
1.176AspTrp: 1.176 ± 0.792
0.588AspTyr: 0.588 ± 0.46
0.0AspXaa: 0.0 ± 0.0
Glu
4.115GluAla: 4.115 ± 2.031
0.588GluCys: 0.588 ± 0.675
2.939GluAsp: 2.939 ± 1.082
2.352GluGlu: 2.352 ± 1.349
2.352GluPhe: 2.352 ± 1.477
4.115GluGly: 4.115 ± 1.326
1.176GluHis: 1.176 ± 0.735
1.176GluIle: 1.176 ± 0.731
2.352GluLys: 2.352 ± 1.304
3.527GluLeu: 3.527 ± 0.923
0.0GluMet: 0.0 ± 0.0
2.939GluAsn: 2.939 ± 1.585
2.352GluPro: 2.352 ± 0.942
3.527GluGln: 3.527 ± 1.645
0.588GluArg: 0.588 ± 0.518
1.764GluSer: 1.764 ± 1.024
2.352GluThr: 2.352 ± 1.444
1.176GluVal: 1.176 ± 0.8
0.588GluTrp: 0.588 ± 0.675
2.939GluTyr: 2.939 ± 1.51
0.0GluXaa: 0.0 ± 0.0
Phe
1.764PheAla: 1.764 ± 0.732
1.176PheCys: 1.176 ± 0.682
2.352PheAsp: 2.352 ± 1.085
1.176PheGlu: 1.176 ± 0.539
3.527PhePhe: 3.527 ± 1.077
1.764PheGly: 1.764 ± 1.092
1.176PheHis: 1.176 ± 0.913
0.588PheIle: 0.588 ± 0.457
2.352PheLys: 2.352 ± 0.738
6.467PheLeu: 6.467 ± 2.235
1.176PheMet: 1.176 ± 0.601
2.939PheAsn: 2.939 ± 1.273
2.352PhePro: 2.352 ± 1.259
2.352PheGln: 2.352 ± 1.826
3.527PheArg: 3.527 ± 1.322
4.703PheSer: 4.703 ± 1.747
2.352PheThr: 2.352 ± 0.944
2.352PheVal: 2.352 ± 1.203
0.588PheTrp: 0.588 ± 0.46
2.939PheTyr: 2.939 ± 1.475
0.0PheXaa: 0.0 ± 0.0
Gly
3.527GlyAla: 3.527 ± 1.543
1.764GlyCys: 1.764 ± 1.014
4.115GlyAsp: 4.115 ± 1.049
4.115GlyGlu: 4.115 ± 1.127
1.176GlyPhe: 1.176 ± 0.986
4.703GlyGly: 4.703 ± 1.942
1.176GlyHis: 1.176 ± 0.913
4.115GlyIle: 4.115 ± 1.177
3.527GlyLys: 3.527 ± 1.457
2.352GlyLeu: 2.352 ± 0.961
2.352GlyMet: 2.352 ± 1.346
2.939GlyAsn: 2.939 ± 1.473
4.115GlyPro: 4.115 ± 2.136
1.764GlyGln: 1.764 ± 0.789
2.352GlyArg: 2.352 ± 0.735
1.176GlySer: 1.176 ± 0.539
2.939GlyThr: 2.939 ± 1.682
3.527GlyVal: 3.527 ± 1.395
0.0GlyTrp: 0.0 ± 0.0
1.176GlyTyr: 1.176 ± 0.602
0.0GlyXaa: 0.0 ± 0.0
His
1.764HisAla: 1.764 ± 0.755
2.352HisCys: 2.352 ± 1.491
1.176HisAsp: 1.176 ± 1.375
1.764HisGlu: 1.764 ± 0.981
1.764HisPhe: 1.764 ± 0.887
1.176HisGly: 1.176 ± 0.986
1.764HisHis: 1.764 ± 2.024
2.939HisIle: 2.939 ± 1.168
0.588HisLys: 0.588 ± 0.687
2.352HisLeu: 2.352 ± 1.349
0.0HisMet: 0.0 ± 0.0
2.939HisAsn: 2.939 ± 1.215
1.764HisPro: 1.764 ± 1.046
3.527HisGln: 3.527 ± 1.369
4.703HisArg: 4.703 ± 1.534
1.764HisSer: 1.764 ± 0.892
2.939HisThr: 2.939 ± 1.645
2.939HisVal: 2.939 ± 1.168
0.0HisTrp: 0.0 ± 0.0
1.176HisTyr: 1.176 ± 0.539
0.0HisXaa: 0.0 ± 0.0
Ile
1.176IleAla: 1.176 ± 0.92
0.0IleCys: 0.0 ± 0.0
2.939IleAsp: 2.939 ± 0.876
1.176IleGlu: 1.176 ± 0.539
2.352IlePhe: 2.352 ± 1.342
2.939IleGly: 2.939 ± 1.149
2.352IleHis: 2.352 ± 1.013
3.527IleIle: 3.527 ± 1.106
5.879IleLys: 5.879 ± 1.06
4.115IleLeu: 4.115 ± 1.845
1.764IleMet: 1.764 ± 0.868
5.291IleAsn: 5.291 ± 1.355
1.176IlePro: 1.176 ± 0.539
3.527IleGln: 3.527 ± 1.39
4.703IleArg: 4.703 ± 1.703
5.291IleSer: 5.291 ± 2.056
4.703IleThr: 4.703 ± 2.455
1.176IleVal: 1.176 ± 0.539
1.764IleTrp: 1.764 ± 1.151
2.939IleTyr: 2.939 ± 1.674
0.0IleXaa: 0.0 ± 0.0
Lys
3.527LysAla: 3.527 ± 1.43
1.176LysCys: 1.176 ± 0.731
2.352LysAsp: 2.352 ± 0.846
3.527LysGlu: 3.527 ± 1.694
2.939LysPhe: 2.939 ± 0.862
4.115LysGly: 4.115 ± 1.203
1.764LysHis: 1.764 ± 0.65
3.527LysIle: 3.527 ± 2.046
1.176LysLys: 1.176 ± 0.696
2.939LysLeu: 2.939 ± 2.008
1.176LysMet: 1.176 ± 0.696
3.527LysAsn: 3.527 ± 1.419
3.527LysPro: 3.527 ± 0.905
1.764LysGln: 1.764 ± 0.905
3.527LysArg: 3.527 ± 1.643
6.467LysSer: 6.467 ± 1.291
2.352LysThr: 2.352 ± 1.115
2.939LysVal: 2.939 ± 1.126
0.0LysTrp: 0.0 ± 0.0
2.939LysTyr: 2.939 ± 0.9
0.0LysXaa: 0.0 ± 0.0
Leu
2.352LeuAla: 2.352 ± 0.678
1.176LeuCys: 1.176 ± 0.913
3.527LeuAsp: 3.527 ± 1.433
5.291LeuGlu: 5.291 ± 1.48
3.527LeuPhe: 3.527 ± 0.94
2.352LeuGly: 2.352 ± 0.926
5.291LeuHis: 5.291 ± 1.765
4.703LeuIle: 4.703 ± 1.409
6.467LeuLys: 6.467 ± 1.422
5.291LeuLeu: 5.291 ± 1.752
0.588LeuMet: 0.588 ± 0.444
5.291LeuAsn: 5.291 ± 1.805
3.527LeuPro: 3.527 ± 1.806
4.115LeuGln: 4.115 ± 1.538
5.879LeuArg: 5.879 ± 1.881
5.291LeuSer: 5.291 ± 1.716
3.527LeuThr: 3.527 ± 1.553
2.939LeuVal: 2.939 ± 1.341
0.0LeuTrp: 0.0 ± 0.0
3.527LeuTyr: 3.527 ± 2.151
0.0LeuXaa: 0.0 ± 0.0
Met
1.176MetAla: 1.176 ± 0.696
0.0MetCys: 0.0 ± 0.0
2.352MetAsp: 2.352 ± 0.998
0.588MetGlu: 0.588 ± 0.703
1.764MetPhe: 1.764 ± 1.253
2.939MetGly: 2.939 ± 1.179
0.0MetHis: 0.0 ± 0.0
1.176MetIle: 1.176 ± 0.731
1.176MetLys: 1.176 ± 0.682
1.176MetLeu: 1.176 ± 0.773
0.0MetMet: 0.0 ± 0.0
0.588MetAsn: 0.588 ± 0.687
2.352MetPro: 2.352 ± 1.017
0.0MetGln: 0.0 ± 0.0
1.764MetArg: 1.764 ± 0.938
1.764MetSer: 1.764 ± 0.89
0.0MetThr: 0.0 ± 0.0
0.588MetVal: 0.588 ± 0.558
1.176MetTrp: 1.176 ± 0.717
2.939MetTyr: 2.939 ± 1.645
0.0MetXaa: 0.0 ± 0.0
Asn
4.115AsnAla: 4.115 ± 1.547
0.0AsnCys: 0.0 ± 0.0
2.939AsnAsp: 2.939 ± 0.944
1.764AsnGlu: 1.764 ± 0.802
1.764AsnPhe: 1.764 ± 1.022
1.764AsnGly: 1.764 ± 1.221
6.467AsnHis: 6.467 ± 3.065
4.115AsnIle: 4.115 ± 1.067
1.764AsnLys: 1.764 ± 0.76
1.764AsnLeu: 1.764 ± 0.987
2.939AsnMet: 2.939 ± 1.267
2.352AsnAsn: 2.352 ± 0.972
3.527AsnPro: 3.527 ± 0.929
1.764AsnGln: 1.764 ± 0.892
2.352AsnArg: 2.352 ± 1.141
2.352AsnSer: 2.352 ± 0.735
2.352AsnThr: 2.352 ± 1.534
5.291AsnVal: 5.291 ± 1.234
0.0AsnTrp: 0.0 ± 0.0
2.939AsnTyr: 2.939 ± 1.08
0.0AsnXaa: 0.0 ± 0.0
Pro
0.588ProAla: 0.588 ± 0.518
2.352ProCys: 2.352 ± 0.868
2.939ProAsp: 2.939 ± 1.524
2.939ProGlu: 2.939 ± 1.168
2.352ProPhe: 2.352 ± 0.948
4.115ProGly: 4.115 ± 1.147
4.115ProHis: 4.115 ± 1.735
4.703ProIle: 4.703 ± 1.554
2.939ProLys: 2.939 ± 1.199
2.939ProLeu: 2.939 ± 1.079
1.176ProMet: 1.176 ± 0.849
1.176ProAsn: 1.176 ± 0.913
1.764ProPro: 1.764 ± 0.763
3.527ProGln: 3.527 ± 1.384
5.879ProArg: 5.879 ± 1.713
5.291ProSer: 5.291 ± 1.9
7.055ProThr: 7.055 ± 1.762
1.764ProVal: 1.764 ± 1.151
1.176ProTrp: 1.176 ± 0.539
3.527ProTyr: 3.527 ± 1.682
0.0ProXaa: 0.0 ± 0.0
Gln
4.703GlnAla: 4.703 ± 1.334
1.176GlnCys: 1.176 ± 0.601
2.352GlnAsp: 2.352 ± 0.739
1.764GlnGlu: 1.764 ± 1.01
2.939GlnPhe: 2.939 ± 0.984
3.527GlnGly: 3.527 ± 0.964
2.939GlnHis: 2.939 ± 1.739
2.939GlnIle: 2.939 ± 1.337
1.176GlnLys: 1.176 ± 0.827
1.176GlnLeu: 1.176 ± 0.792
0.0GlnMet: 0.0 ± 0.0
1.764GlnAsn: 1.764 ± 1.449
2.352GlnPro: 2.352 ± 1.529
1.176GlnGln: 1.176 ± 0.827
1.176GlnArg: 1.176 ± 0.539
4.703GlnSer: 4.703 ± 1.031
1.764GlnThr: 1.764 ± 1.072
5.291GlnVal: 5.291 ± 1.647
0.0GlnTrp: 0.0 ± 0.0
0.0GlnTyr: 0.0 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
2.352ArgAla: 2.352 ± 1.597
2.939ArgCys: 2.939 ± 1.283
2.939ArgAsp: 2.939 ± 1.105
1.764ArgGlu: 1.764 ± 1.046
4.115ArgPhe: 4.115 ± 1.046
4.703ArgGly: 4.703 ± 1.895
1.764ArgHis: 1.764 ± 1.027
2.352ArgIle: 2.352 ± 1.164
3.527ArgLys: 3.527 ± 1.277
6.467ArgLeu: 6.467 ± 2.981
1.764ArgMet: 1.764 ± 1.02
1.176ArgAsn: 1.176 ± 0.735
6.467ArgPro: 6.467 ± 1.665
1.764ArgGln: 1.764 ± 1.027
7.643ArgArg: 7.643 ± 2.382
6.467ArgSer: 6.467 ± 1.279
4.115ArgThr: 4.115 ± 1.193
6.467ArgVal: 6.467 ± 1.44
0.0ArgTrp: 0.0 ± 0.0
1.764ArgTyr: 1.764 ± 0.905
0.0ArgXaa: 0.0 ± 0.0
Ser
2.939SerAla: 2.939 ± 1.741
0.588SerCys: 0.588 ± 0.46
2.939SerAsp: 2.939 ± 0.922
2.939SerGlu: 2.939 ± 1.242
4.115SerPhe: 4.115 ± 1.458
2.352SerGly: 2.352 ± 1.054
1.176SerHis: 1.176 ± 1.35
5.879SerIle: 5.879 ± 1.943
5.879SerLys: 5.879 ± 1.834
5.291SerLeu: 5.291 ± 1.981
2.352SerMet: 2.352 ± 2.051
4.115SerAsn: 4.115 ± 1.502
8.23SerPro: 8.23 ± 1.688
5.879SerGln: 5.879 ± 2.017
4.703SerArg: 4.703 ± 1.504
11.17SerSer: 11.17 ± 3.136
5.879SerThr: 5.879 ± 2.367
5.291SerVal: 5.291 ± 2.359
0.0SerTrp: 0.0 ± 0.0
2.939SerTyr: 2.939 ± 1.063
0.0SerXaa: 0.0 ± 0.0
Thr
2.352ThrAla: 2.352 ± 1.456
0.588ThrCys: 0.588 ± 0.518
2.939ThrAsp: 2.939 ± 1.784
1.764ThrGlu: 1.764 ± 0.755
2.352ThrPhe: 2.352 ± 0.735
3.527ThrGly: 3.527 ± 1.142
2.352ThrHis: 2.352 ± 1.123
3.527ThrIle: 3.527 ± 1.694
2.352ThrLys: 2.352 ± 1.079
5.291ThrLeu: 5.291 ± 1.35
2.939ThrMet: 2.939 ± 0.931
5.879ThrAsn: 5.879 ± 2.087
4.115ThrPro: 4.115 ± 0.98
0.0ThrGln: 0.0 ± 0.0
4.703ThrArg: 4.703 ± 1.6
2.939ThrSer: 2.939 ± 1.535
2.939ThrThr: 2.939 ± 1.616
5.291ThrVal: 5.291 ± 1.829
1.176ThrTrp: 1.176 ± 0.731
1.764ThrTyr: 1.764 ± 0.887
0.0ThrXaa: 0.0 ± 0.0
Val
1.176ValAla: 1.176 ± 0.682
0.588ValCys: 0.588 ± 0.457
4.703ValAsp: 4.703 ± 1.486
2.939ValGlu: 2.939 ± 1.861
3.527ValPhe: 3.527 ± 1.323
2.352ValGly: 2.352 ± 1.256
0.588ValHis: 0.588 ± 0.698
2.352ValIle: 2.352 ± 0.738
4.703ValLys: 4.703 ± 2.005
5.879ValLeu: 5.879 ± 2.461
1.176ValMet: 1.176 ± 0.631
1.176ValAsn: 1.176 ± 1.035
4.703ValPro: 4.703 ± 1.33
2.352ValGln: 2.352 ± 1.494
4.115ValArg: 4.115 ± 2.187
8.23ValSer: 8.23 ± 2.187
4.115ValThr: 4.115 ± 1.366
2.352ValVal: 2.352 ± 1.686
1.764ValTrp: 1.764 ± 0.853
4.115ValTyr: 4.115 ± 1.656
0.0ValXaa: 0.0 ± 0.0
Trp
2.352TrpAla: 2.352 ± 0.892
0.0TrpCys: 0.0 ± 0.0
0.588TrpAsp: 0.588 ± 0.698
0.588TrpGlu: 0.588 ± 0.687
0.0TrpPhe: 0.0 ± 0.0
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
0.588TrpLys: 0.588 ± 0.46
0.0TrpLeu: 0.0 ± 0.0
0.588TrpMet: 0.588 ± 0.558
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.588TrpGln: 0.588 ± 0.457
1.176TrpArg: 1.176 ± 0.792
1.764TrpSer: 1.764 ± 0.76
0.588TrpThr: 0.588 ± 0.687
1.176TrpVal: 1.176 ± 0.601
0.0TrpTrp: 0.0 ± 0.0
0.588TrpTyr: 0.588 ± 0.457
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.352TyrAla: 2.352 ± 1.085
0.0TyrCys: 0.0 ± 0.0
1.764TyrAsp: 1.764 ± 1.013
2.352TyrGlu: 2.352 ± 1.392
2.939TyrPhe: 2.939 ± 0.864
2.352TyrGly: 2.352 ± 1.024
0.0TyrHis: 0.0 ± 0.0
4.115TyrIle: 4.115 ± 1.138
1.764TyrLys: 1.764 ± 1.025
3.527TyrLeu: 3.527 ± 1.777
1.764TyrMet: 1.764 ± 0.776
1.764TyrAsn: 1.764 ± 0.789
2.352TyrPro: 2.352 ± 0.842
1.176TyrGln: 1.176 ± 0.682
4.703TyrArg: 4.703 ± 2.038
2.352TyrSer: 2.352 ± 0.948
1.764TyrThr: 1.764 ± 0.721
3.527TyrVal: 3.527 ± 2.013
0.0TyrTrp: 0.0 ± 0.0
0.0TyrTyr: 0.0 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 8 proteins (1702 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski