Amino acid dipepetide frequency for Pepper yellow leaf curl Aceh virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.863AlaAla: 4.863 ± 1.995
1.824AlaCys: 1.824 ± 0.632
0.0AlaAsp: 0.0 ± 0.0
3.647AlaGlu: 3.647 ± 1.141
2.432AlaPhe: 2.432 ± 0.825
1.216AlaGly: 1.216 ± 0.7
0.608AlaHis: 0.608 ± 0.525
2.432AlaIle: 2.432 ± 1.269
4.255AlaLys: 4.255 ± 0.982
5.471AlaLeu: 5.471 ± 2.217
0.608AlaMet: 0.608 ± 0.525
1.216AlaAsn: 1.216 ± 0.782
3.04AlaPro: 3.04 ± 1.434
3.04AlaGln: 3.04 ± 1.568
3.647AlaArg: 3.647 ± 1.775
3.647AlaSer: 3.647 ± 2.159
3.647AlaThr: 3.647 ± 1.599
1.216AlaVal: 1.216 ± 0.724
1.824AlaTrp: 1.824 ± 1.089
1.824AlaTyr: 1.824 ± 0.631
0.0AlaXaa: 0.0 ± 0.0
Cys
0.608CysAla: 0.608 ± 0.511
1.216CysCys: 1.216 ± 1.449
0.608CysAsp: 0.608 ± 0.525
1.824CysGlu: 1.824 ± 0.632
0.608CysPhe: 0.608 ± 0.637
1.216CysGly: 1.216 ± 0.7
0.608CysHis: 0.608 ± 0.553
0.0CysIle: 0.0 ± 0.0
1.824CysLys: 1.824 ± 0.869
1.824CysLeu: 1.824 ± 1.155
1.216CysMet: 1.216 ± 0.866
2.432CysAsn: 2.432 ± 1.047
1.824CysPro: 1.824 ± 1.418
0.608CysGln: 0.608 ± 0.511
2.432CysArg: 2.432 ± 1.047
2.432CysSer: 2.432 ± 1.189
0.608CysThr: 0.608 ± 0.56
1.824CysVal: 1.824 ± 1.076
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
2.432AspAla: 2.432 ± 1.443
0.608AspCys: 0.608 ± 0.553
2.432AspAsp: 2.432 ± 1.599
1.216AspGlu: 1.216 ± 0.699
1.824AspPhe: 1.824 ± 0.908
3.04AspGly: 3.04 ± 1.539
1.216AspHis: 1.216 ± 0.732
1.216AspIle: 1.216 ± 0.635
2.432AspLys: 2.432 ± 1.193
4.863AspLeu: 4.863 ± 0.983
0.608AspMet: 0.608 ± 0.637
1.216AspAsn: 1.216 ± 1.119
3.04AspPro: 3.04 ± 1.084
1.824AspGln: 1.824 ± 1.717
3.647AspArg: 3.647 ± 1.039
3.647AspSer: 3.647 ± 1.541
4.255AspThr: 4.255 ± 0.892
4.255AspVal: 4.255 ± 0.911
0.608AspTrp: 0.608 ± 0.511
2.432AspTyr: 2.432 ± 1.128
0.0AspXaa: 0.0 ± 0.0
Glu
4.863GluAla: 4.863 ± 1.94
0.0GluCys: 0.0 ± 0.0
2.432GluAsp: 2.432 ± 1.423
4.255GluGlu: 4.255 ± 1.936
3.04GluPhe: 3.04 ± 1.084
3.647GluGly: 3.647 ± 1.081
0.0GluHis: 0.0 ± 0.0
1.824GluIle: 1.824 ± 1.204
1.216GluLys: 1.216 ± 0.597
3.04GluLeu: 3.04 ± 1.048
0.608GluMet: 0.608 ± 0.572
3.647GluAsn: 3.647 ± 1.696
1.824GluPro: 1.824 ± 0.632
2.432GluGln: 2.432 ± 1.317
1.824GluArg: 1.824 ± 0.889
2.432GluSer: 2.432 ± 1.482
3.647GluThr: 3.647 ± 2.357
3.647GluVal: 3.647 ± 1.14
0.608GluTrp: 0.608 ± 0.553
1.824GluTyr: 1.824 ± 0.717
0.0GluXaa: 0.0 ± 0.0
Phe
1.216PheAla: 1.216 ± 0.597
1.216PheCys: 1.216 ± 0.72
1.216PheAsp: 1.216 ± 1.022
1.216PheGlu: 1.216 ± 0.699
1.824PhePhe: 1.824 ± 1.089
2.432PheGly: 2.432 ± 1.495
3.04PheHis: 3.04 ± 1.518
0.608PheIle: 0.608 ± 0.511
3.04PheLys: 3.04 ± 1.226
3.647PheLeu: 3.647 ± 1.785
0.608PheMet: 0.608 ± 0.511
4.255PheAsn: 4.255 ± 1.545
1.216PhePro: 1.216 ± 0.866
2.432PheGln: 2.432 ± 0.971
4.255PheArg: 4.255 ± 1.157
4.255PheSer: 4.255 ± 1.993
1.216PheThr: 1.216 ± 0.698
3.647PheVal: 3.647 ± 1.556
1.216PheTrp: 1.216 ± 0.72
0.608PheTyr: 0.608 ± 0.56
0.0PheXaa: 0.0 ± 0.0
Gly
3.647GlyAla: 3.647 ± 1.684
2.432GlyCys: 2.432 ± 0.698
2.432GlyAsp: 2.432 ± 0.971
3.04GlyGlu: 3.04 ± 1.179
2.432GlyPhe: 2.432 ± 0.992
3.647GlyGly: 3.647 ± 1.076
2.432GlyHis: 2.432 ± 0.689
3.04GlyIle: 3.04 ± 1.002
5.471GlyLys: 5.471 ± 1.792
3.647GlyLeu: 3.647 ± 1.086
1.824GlyMet: 1.824 ± 0.987
1.824GlyAsn: 1.824 ± 0.907
1.824GlyPro: 1.824 ± 0.69
1.824GlyGln: 1.824 ± 0.907
2.432GlyArg: 2.432 ± 0.927
7.295GlySer: 7.295 ± 2.91
1.824GlyThr: 1.824 ± 1.022
2.432GlyVal: 2.432 ± 1.398
0.0GlyTrp: 0.0 ± 0.0
0.608GlyTyr: 0.608 ± 0.637
0.0GlyXaa: 0.0 ± 0.0
His
0.608HisAla: 0.608 ± 0.56
1.216HisCys: 1.216 ± 0.899
1.824HisAsp: 1.824 ± 0.735
1.824HisGlu: 1.824 ± 0.631
3.04HisPhe: 3.04 ± 1.372
3.647HisGly: 3.647 ± 2.348
0.608HisHis: 0.608 ± 0.553
1.824HisIle: 1.824 ± 1.488
0.608HisLys: 0.608 ± 0.637
1.824HisLeu: 1.824 ± 1.094
0.608HisMet: 0.608 ± 0.612
2.432HisAsn: 2.432 ± 0.968
1.216HisPro: 1.216 ± 0.782
1.824HisGln: 1.824 ± 0.794
4.255HisArg: 4.255 ± 1.393
1.824HisSer: 1.824 ± 0.902
1.824HisThr: 1.824 ± 1.679
3.647HisVal: 3.647 ± 1.273
0.0HisTrp: 0.0 ± 0.0
1.216HisTyr: 1.216 ± 0.609
0.0HisXaa: 0.0 ± 0.0
Ile
0.608IleAla: 0.608 ± 0.553
1.216IleCys: 1.216 ± 0.782
4.863IleAsp: 4.863 ± 1.763
2.432IleGlu: 2.432 ± 1.047
4.255IlePhe: 4.255 ± 1.787
2.432IleGly: 2.432 ± 1.154
3.04IleHis: 3.04 ± 1.361
3.647IleIle: 3.647 ± 0.959
4.863IleLys: 4.863 ± 1.142
2.432IleLeu: 2.432 ± 1.678
0.0IleMet: 0.0 ± 0.529
3.647IleAsn: 3.647 ± 1.126
2.432IlePro: 2.432 ± 0.988
4.255IleGln: 4.255 ± 1.276
4.255IleArg: 4.255 ± 1.422
1.824IleSer: 1.824 ± 1.353
3.04IleThr: 3.04 ± 1.645
1.216IleVal: 1.216 ± 0.699
1.824IleTrp: 1.824 ± 1.911
0.608IleTyr: 0.608 ± 0.56
0.0IleXaa: 0.0 ± 0.0
Lys
1.824LysAla: 1.824 ± 0.717
1.824LysCys: 1.824 ± 0.908
2.432LysAsp: 2.432 ± 0.885
3.647LysGlu: 3.647 ± 1.972
1.824LysPhe: 1.824 ± 1.079
2.432LysGly: 2.432 ± 1.041
0.608LysHis: 0.608 ± 0.511
3.647LysIle: 3.647 ± 1.207
3.647LysLys: 3.647 ± 1.314
3.04LysLeu: 3.04 ± 1.612
1.216LysMet: 1.216 ± 1.123
5.471LysAsn: 5.471 ± 1.756
3.04LysPro: 3.04 ± 0.945
1.824LysGln: 1.824 ± 1.214
3.04LysArg: 3.04 ± 2.132
6.687LysSer: 6.687 ± 1.646
1.824LysThr: 1.824 ± 0.631
4.863LysVal: 4.863 ± 1.543
0.0LysTrp: 0.0 ± 0.0
3.04LysTyr: 3.04 ± 0.971
0.0LysXaa: 0.0 ± 0.0
Leu
1.824LeuAla: 1.824 ± 0.942
3.647LeuCys: 3.647 ± 0.911
4.863LeuAsp: 4.863 ± 1.456
4.255LeuGlu: 4.255 ± 1.238
3.04LeuPhe: 3.04 ± 0.901
2.432LeuGly: 2.432 ± 1.305
6.079LeuHis: 6.079 ± 1.24
4.255LeuIle: 4.255 ± 1.794
3.04LeuLys: 3.04 ± 0.944
4.255LeuLeu: 4.255 ± 1.329
1.216LeuMet: 1.216 ± 0.856
3.647LeuAsn: 3.647 ± 1.034
4.255LeuPro: 4.255 ± 1.05
3.04LeuGln: 3.04 ± 1.46
6.079LeuArg: 6.079 ± 2.512
5.471LeuSer: 5.471 ± 1.896
0.0LeuThr: 0.0 ± 0.0
1.824LeuVal: 1.824 ± 0.632
1.216LeuTrp: 1.216 ± 0.828
4.863LeuTyr: 4.863 ± 1.215
0.0LeuXaa: 0.0 ± 0.0
Met
0.608MetAla: 0.608 ± 0.56
0.608MetCys: 0.608 ± 0.525
1.824MetAsp: 1.824 ± 1.297
1.824MetGlu: 1.824 ± 1.214
0.608MetPhe: 0.608 ± 0.56
2.432MetGly: 2.432 ± 0.757
0.608MetHis: 0.608 ± 0.572
0.0MetIle: 0.0 ± 0.0
0.608MetLys: 0.608 ± 0.572
1.216MetLeu: 1.216 ± 1.08
0.0MetMet: 0.0 ± 0.0
0.608MetAsn: 0.608 ± 0.56
1.824MetPro: 1.824 ± 0.632
0.608MetGln: 0.608 ± 0.572
1.216MetArg: 1.216 ± 0.758
1.824MetSer: 1.824 ± 1.155
1.216MetThr: 1.216 ± 0.856
0.0MetVal: 0.0 ± 0.0
1.216MetTrp: 1.216 ± 0.782
0.608MetTyr: 0.608 ± 0.56
0.0MetXaa: 0.0 ± 0.0
Asn
6.079AsnAla: 6.079 ± 1.99
1.824AsnCys: 1.824 ± 0.674
2.432AsnAsp: 2.432 ± 0.978
2.432AsnGlu: 2.432 ± 1.058
0.608AsnPhe: 0.608 ± 0.511
1.824AsnGly: 1.824 ± 0.822
3.04AsnHis: 3.04 ± 1.638
3.04AsnIle: 3.04 ± 1.555
1.216AsnLys: 1.216 ± 0.609
4.255AsnLeu: 4.255 ± 2.011
2.432AsnMet: 2.432 ± 1.424
7.903AsnAsn: 7.903 ± 1.669
3.647AsnPro: 3.647 ± 1.518
1.216AsnGln: 1.216 ± 0.856
2.432AsnArg: 2.432 ± 1.27
3.04AsnSer: 3.04 ± 1.447
3.647AsnThr: 3.647 ± 1.104
7.295AsnVal: 7.295 ± 1.193
0.608AsnTrp: 0.608 ± 0.511
3.04AsnTyr: 3.04 ± 1.228
0.0AsnXaa: 0.0 ± 0.0
Pro
2.432ProAla: 2.432 ± 0.834
1.824ProCys: 1.824 ± 0.882
0.608ProAsp: 0.608 ± 0.724
2.432ProGlu: 2.432 ± 1.482
2.432ProPhe: 2.432 ± 1.106
3.647ProGly: 3.647 ± 1.339
3.04ProHis: 3.04 ± 0.998
2.432ProIle: 2.432 ± 1.047
3.647ProLys: 3.647 ± 1.472
4.255ProLeu: 4.255 ± 1.318
1.824ProMet: 1.824 ± 1.076
1.824ProAsn: 1.824 ± 1.079
1.824ProPro: 1.824 ± 1.533
3.647ProGln: 3.647 ± 1.614
3.04ProArg: 3.04 ± 1.088
5.471ProSer: 5.471 ± 1.479
4.255ProThr: 4.255 ± 2.27
3.04ProVal: 3.04 ± 1.125
1.216ProTrp: 1.216 ± 0.609
1.216ProTyr: 1.216 ± 0.72
0.0ProXaa: 0.0 ± 0.0
Gln
3.647GlnAla: 3.647 ± 1.018
1.216GlnCys: 1.216 ± 1.022
2.432GlnAsp: 2.432 ± 1.284
1.824GlnGlu: 1.824 ± 0.632
1.824GlnPhe: 1.824 ± 0.717
3.04GlnGly: 3.04 ± 1.094
1.216GlnHis: 1.216 ± 0.898
3.04GlnIle: 3.04 ± 1.529
2.432GlnLys: 2.432 ± 1.2
1.824GlnLeu: 1.824 ± 0.717
0.608GlnMet: 0.608 ± 0.684
1.216GlnAsn: 1.216 ± 0.724
1.824GlnPro: 1.824 ± 1.326
2.432GlnGln: 2.432 ± 1.516
2.432GlnArg: 2.432 ± 0.982
3.647GlnSer: 3.647 ± 1.512
3.04GlnThr: 3.04 ± 1.764
3.647GlnVal: 3.647 ± 1.346
0.0GlnTrp: 0.0 ± 0.0
1.216GlnTyr: 1.216 ± 1.119
0.0GlnXaa: 0.0 ± 0.0
Arg
1.216ArgAla: 1.216 ± 0.769
1.216ArgCys: 1.216 ± 1.449
3.647ArgAsp: 3.647 ± 1.667
2.432ArgGlu: 2.432 ± 1.304
4.255ArgPhe: 4.255 ± 1.759
4.863ArgGly: 4.863 ± 1.305
3.04ArgHis: 3.04 ± 1.379
4.863ArgIle: 4.863 ± 1.114
2.432ArgLys: 2.432 ± 0.838
3.647ArgLeu: 3.647 ± 1.375
1.216ArgMet: 1.216 ± 0.856
2.432ArgAsn: 2.432 ± 0.988
4.255ArgPro: 4.255 ± 1.44
1.216ArgGln: 1.216 ± 1.049
7.903ArgArg: 7.903 ± 3.412
5.471ArgSer: 5.471 ± 1.654
4.255ArgThr: 4.255 ± 1.551
7.903ArgVal: 7.903 ± 2.703
0.608ArgTrp: 0.608 ± 0.525
3.04ArgTyr: 3.04 ± 1.361
0.0ArgXaa: 0.0 ± 0.0
Ser
4.863SerAla: 4.863 ± 2.007
0.608SerCys: 0.608 ± 0.572
4.255SerAsp: 4.255 ± 1.578
1.824SerGlu: 1.824 ± 0.786
3.04SerPhe: 3.04 ± 1.001
2.432SerGly: 2.432 ± 1.662
1.216SerHis: 1.216 ± 1.105
5.471SerIle: 5.471 ± 1.489
7.903SerLys: 7.903 ± 0.91
3.04SerLeu: 3.04 ± 1.401
0.0SerMet: 0.0 ± 0.0
6.687SerAsn: 6.687 ± 1.875
6.079SerPro: 6.079 ± 1.891
3.647SerGln: 3.647 ± 0.866
4.863SerArg: 4.863 ± 1.75
12.766SerSer: 12.766 ± 1.651
10.334SerThr: 10.334 ± 3.288
5.471SerVal: 5.471 ± 3.125
0.608SerTrp: 0.608 ± 0.511
4.863SerTyr: 4.863 ± 0.927
0.0SerXaa: 0.0 ± 0.0
Thr
3.04ThrAla: 3.04 ± 1.04
0.608ThrCys: 0.608 ± 0.681
2.432ThrAsp: 2.432 ± 1.128
3.647ThrGlu: 3.647 ± 1.589
1.216ThrPhe: 1.216 ± 0.802
4.255ThrGly: 4.255 ± 0.891
2.432ThrHis: 2.432 ± 1.681
3.04ThrIle: 3.04 ± 1.4
1.824ThrLys: 1.824 ± 1.418
4.255ThrLeu: 4.255 ± 2.201
0.608ThrMet: 0.608 ± 0.511
3.647ThrAsn: 3.647 ± 1.232
4.255ThrPro: 4.255 ± 1.36
1.216ThrGln: 1.216 ± 0.899
4.255ThrArg: 4.255 ± 0.891
6.079ThrSer: 6.079 ± 2.228
1.824ThrThr: 1.824 ± 0.735
5.471ThrVal: 5.471 ± 1.827
0.608ThrTrp: 0.608 ± 0.572
1.824ThrTyr: 1.824 ± 1.105
0.0ThrXaa: 0.0 ± 0.0
Val
1.216ValAla: 1.216 ± 0.858
0.608ValCys: 0.608 ± 0.511
3.04ValAsp: 3.04 ± 1.451
1.824ValGlu: 1.824 ± 0.889
2.432ValPhe: 2.432 ± 0.943
3.04ValGly: 3.04 ± 1.36
2.432ValHis: 2.432 ± 1.2
4.255ValIle: 4.255 ± 1.737
3.04ValLys: 3.04 ± 1.225
6.687ValLeu: 6.687 ± 1.805
0.608ValMet: 0.608 ± 0.56
4.255ValAsn: 4.255 ± 1.72
4.255ValPro: 4.255 ± 0.832
4.863ValGln: 4.863 ± 1.872
4.255ValArg: 4.255 ± 2.527
6.687ValSer: 6.687 ± 2.063
4.255ValThr: 4.255 ± 1.538
6.687ValVal: 6.687 ± 2.533
1.216ValTrp: 1.216 ± 0.635
6.079ValTyr: 6.079 ± 1.883
0.0ValXaa: 0.0 ± 0.0
Trp
1.824TrpAla: 1.824 ± 0.968
0.0TrpCys: 0.0 ± 0.0
1.216TrpAsp: 1.216 ± 0.951
0.0TrpGlu: 0.0 ± 0.0
0.0TrpPhe: 0.0 ± 0.0
0.608TrpGly: 0.608 ± 0.511
0.608TrpHis: 0.608 ± 0.637
0.0TrpIle: 0.0 ± 0.0
1.216TrpLys: 1.216 ± 0.783
1.216TrpLeu: 1.216 ± 0.635
0.608TrpMet: 0.608 ± 0.56
0.608TrpAsn: 0.608 ± 0.525
0.0TrpPro: 0.0 ± 0.0
0.608TrpGln: 0.608 ± 0.511
1.216TrpArg: 1.216 ± 0.7
0.0TrpSer: 0.0 ± 0.0
1.216TrpThr: 1.216 ± 0.783
1.824TrpVal: 1.824 ± 0.632
0.0TrpTrp: 0.0 ± 0.0
0.608TrpTyr: 0.608 ± 0.511
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.04TyrAla: 3.04 ± 1.757
0.0TyrCys: 0.0 ± 0.0
1.824TyrAsp: 1.824 ± 1.076
1.216TyrGlu: 1.216 ± 0.868
2.432TyrPhe: 2.432 ± 0.838
1.824TyrGly: 1.824 ± 0.632
0.0TyrHis: 0.0 ± 0.0
4.863TyrIle: 4.863 ± 1.608
1.216TyrLys: 1.216 ± 0.699
4.863TyrLeu: 4.863 ± 1.665
2.432TyrMet: 2.432 ± 1.259
3.04TyrAsn: 3.04 ± 1.117
2.432TyrPro: 2.432 ± 1.041
0.0TyrGln: 0.0 ± 0.0
2.432TyrArg: 2.432 ± 1.701
5.471TyrSer: 5.471 ± 1.695
0.608TyrThr: 0.608 ± 0.572
1.824TyrVal: 1.824 ± 0.828
0.0TyrTrp: 0.0 ± 0.0
1.824TyrTyr: 1.824 ± 0.829
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 8 proteins (1646 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski