Amino acid dipepetide frequency for African oil palm ringspot virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.519AlaAla: 7.519 ± 3.876
1.583AlaCys: 1.583 ± 0.826
6.332AlaAsp: 6.332 ± 1.399
5.54AlaGlu: 5.54 ± 2.09
5.54AlaPhe: 5.54 ± 1.806
3.957AlaGly: 3.957 ± 0.805
1.979AlaHis: 1.979 ± 2.19
4.749AlaIle: 4.749 ± 1.898
6.727AlaLys: 6.727 ± 1.505
6.727AlaLeu: 6.727 ± 2.461
1.187AlaMet: 1.187 ± 0.619
1.979AlaAsn: 1.979 ± 0.62
2.374AlaPro: 2.374 ± 0.713
1.583AlaGln: 1.583 ± 0.826
1.979AlaArg: 1.979 ± 2.007
2.77AlaSer: 2.77 ± 1.615
3.957AlaThr: 3.957 ± 0.705
3.562AlaVal: 3.562 ± 2.848
0.791AlaTrp: 0.791 ± 0.725
1.979AlaTyr: 1.979 ± 1.032
0.0AlaXaa: 0.0 ± 0.0
Cys
1.187CysAla: 1.187 ± 0.627
0.396CysCys: 0.396 ± 0.79
1.979CysAsp: 1.979 ± 0.806
1.583CysGlu: 1.583 ± 1.131
1.187CysPhe: 1.187 ± 0.853
1.979CysGly: 1.979 ± 1.376
1.187CysHis: 1.187 ± 0.853
0.791CysIle: 0.791 ± 0.939
2.77CysLys: 2.77 ± 0.508
1.979CysLeu: 1.979 ± 0.806
0.0CysMet: 0.0 ± 0.0
0.396CysAsn: 0.396 ± 0.206
0.791CysPro: 0.791 ± 0.413
0.0CysGln: 0.0 ± 0.0
0.396CysArg: 0.396 ± 0.206
2.77CysSer: 2.77 ± 0.847
2.374CysThr: 2.374 ± 0.546
2.374CysVal: 2.374 ± 0.713
0.0CysTrp: 0.0 ± 0.0
0.791CysTyr: 0.791 ± 1.013
0.0CysXaa: 0.0 ± 0.0
Asp
2.77AspAla: 2.77 ± 0.847
2.77AspCys: 2.77 ± 0.847
2.374AspAsp: 2.374 ± 0.713
5.144AspGlu: 5.144 ± 1.23
3.957AspPhe: 3.957 ± 1.05
1.979AspGly: 1.979 ± 0.822
0.791AspHis: 0.791 ± 0.413
5.144AspIle: 5.144 ± 0.953
2.77AspLys: 2.77 ± 0.847
5.936AspLeu: 5.936 ± 1.487
1.583AspMet: 1.583 ± 0.589
1.583AspAsn: 1.583 ± 1.213
1.979AspPro: 1.979 ± 1.376
1.187AspGln: 1.187 ± 0.627
1.583AspArg: 1.583 ± 1.45
4.353AspSer: 4.353 ± 2.271
1.187AspThr: 1.187 ± 0.619
1.583AspVal: 1.583 ± 0.718
1.583AspTrp: 1.583 ± 0.589
1.583AspTyr: 1.583 ± 0.826
0.0AspXaa: 0.0 ± 0.0
Glu
4.749GluAla: 4.749 ± 2.478
0.396GluCys: 0.396 ± 0.206
2.374GluAsp: 2.374 ± 0.713
3.957GluGlu: 3.957 ± 1.613
2.374GluPhe: 2.374 ± 0.932
4.353GluGly: 4.353 ± 0.974
0.396GluHis: 0.396 ± 0.862
3.957GluIle: 3.957 ± 0.705
3.957GluLys: 3.957 ± 0.805
7.519GluLeu: 7.519 ± 4.813
0.396GluMet: 0.396 ± 0.206
0.791GluAsn: 0.791 ± 0.413
4.749GluPro: 4.749 ± 1.807
2.77GluGln: 2.77 ± 1.197
6.727GluArg: 6.727 ± 1.993
6.727GluSer: 6.727 ± 2.677
2.77GluThr: 2.77 ± 2.061
5.936GluVal: 5.936 ± 0.957
0.791GluTrp: 0.791 ± 0.413
0.791GluTyr: 0.791 ± 0.709
0.0GluXaa: 0.0 ± 0.0
Phe
3.166PheAla: 3.166 ± 0.817
3.166PheCys: 3.166 ± 0.55
5.936PheAsp: 5.936 ± 1.954
5.144PheGlu: 5.144 ± 2.011
2.374PhePhe: 2.374 ± 0.713
4.353PheGly: 4.353 ± 1.06
0.791PheHis: 0.791 ± 0.413
1.979PheIle: 1.979 ± 0.806
3.166PheLys: 3.166 ± 2.054
5.144PheLeu: 5.144 ± 2.684
1.583PheMet: 1.583 ± 0.822
4.353PheAsn: 4.353 ± 1.731
0.791PhePro: 0.791 ± 0.413
2.374PheGln: 2.374 ± 1.239
0.0PheArg: 0.0 ± 0.0
6.332PheSer: 6.332 ± 1.473
3.957PheThr: 3.957 ± 1.861
3.166PheVal: 3.166 ± 1.112
0.396PheTrp: 0.396 ± 0.206
1.583PheTyr: 1.583 ± 0.826
0.0PheXaa: 0.0 ± 0.0
Gly
3.957GlyAla: 3.957 ± 1.25
1.979GlyCys: 1.979 ± 2.142
5.144GlyAsp: 5.144 ± 1.475
3.166GlyGlu: 3.166 ± 1.215
2.77GlyPhe: 2.77 ± 0.871
1.979GlyGly: 1.979 ± 3.674
0.791GlyHis: 0.791 ± 0.725
2.77GlyIle: 2.77 ± 0.871
5.144GlyLys: 5.144 ± 1.23
3.957GlyLeu: 3.957 ± 2.288
0.396GlyMet: 0.396 ± 0.206
1.979GlyAsn: 1.979 ± 0.62
0.791GlyPro: 0.791 ± 0.725
1.187GlyGln: 1.187 ± 0.853
3.562GlyArg: 3.562 ± 0.824
6.332GlySer: 6.332 ± 1.563
2.77GlyThr: 2.77 ± 3.432
3.957GlyVal: 3.957 ± 1.338
1.187GlyTrp: 1.187 ± 0.619
1.583GlyTyr: 1.583 ± 1.096
0.0GlyXaa: 0.0 ± 0.0
His
1.979HisAla: 1.979 ± 0.806
0.396HisCys: 0.396 ± 0.206
1.583HisAsp: 1.583 ± 0.826
0.396HisGlu: 0.396 ± 0.206
1.583HisPhe: 1.583 ± 1.963
1.187HisGly: 1.187 ± 1.486
0.791HisHis: 0.791 ± 0.939
1.187HisIle: 1.187 ± 0.853
0.791HisLys: 0.791 ± 0.709
1.979HisLeu: 1.979 ± 0.822
0.791HisMet: 0.791 ± 0.658
1.187HisAsn: 1.187 ± 0.627
1.187HisPro: 1.187 ± 1.486
0.396HisGln: 0.396 ± 0.206
1.187HisArg: 1.187 ± 0.853
3.166HisSer: 3.166 ± 1.624
1.187HisThr: 1.187 ± 2.321
0.396HisVal: 0.396 ± 0.206
0.0HisTrp: 0.0 ± 0.0
1.187HisTyr: 1.187 ± 0.619
0.0HisXaa: 0.0 ± 0.0
Ile
3.562IleAla: 3.562 ± 0.99
1.187IleCys: 1.187 ± 0.683
1.979IleAsp: 1.979 ± 1.032
5.54IleGlu: 5.54 ± 1.058
5.54IlePhe: 5.54 ± 2.179
1.187IleGly: 1.187 ± 0.853
1.979IleHis: 1.979 ± 3.207
2.77IleIle: 2.77 ± 2.08
3.957IleLys: 3.957 ± 1.362
5.144IleLeu: 5.144 ± 1.66
1.979IleMet: 1.979 ± 1.34
1.583IleAsn: 1.583 ± 0.812
2.374IlePro: 2.374 ± 1.365
1.583IleGln: 1.583 ± 0.826
4.749IleArg: 4.749 ± 1.093
5.936IleSer: 5.936 ± 1.039
1.187IleThr: 1.187 ± 1.486
3.166IleVal: 3.166 ± 1.555
0.791IleTrp: 0.791 ± 0.939
1.979IleTyr: 1.979 ± 0.822
0.0IleXaa: 0.0 ± 0.0
Lys
3.957LysAla: 3.957 ± 0.805
0.791LysCys: 0.791 ± 0.939
2.77LysAsp: 2.77 ± 1.208
6.332LysGlu: 6.332 ± 2.502
4.749LysPhe: 4.749 ± 1.136
3.166LysGly: 3.166 ± 1.215
2.374LysHis: 2.374 ± 0.964
3.166LysIle: 3.166 ± 1.652
4.749LysLys: 4.749 ± 1.155
7.915LysLeu: 7.915 ± 1.611
1.187LysMet: 1.187 ± 0.619
4.353LysAsn: 4.353 ± 1.781
5.144LysPro: 5.144 ± 0.775
1.187LysGln: 1.187 ± 0.619
4.353LysArg: 4.353 ± 2.271
3.957LysSer: 3.957 ± 1.613
3.166LysThr: 3.166 ± 3.011
3.562LysVal: 3.562 ± 0.958
0.0LysTrp: 0.0 ± 0.0
1.187LysTyr: 1.187 ± 0.627
0.396LysXaa: 0.396 ± 0.862
Leu
5.54LeuAla: 5.54 ± 1.646
3.166LeuCys: 3.166 ± 1.251
2.374LeuAsp: 2.374 ± 0.932
3.957LeuGlu: 3.957 ± 2.033
5.54LeuPhe: 5.54 ± 1.716
5.936LeuGly: 5.936 ± 1.98
1.583LeuHis: 1.583 ± 0.826
5.54LeuIle: 5.54 ± 1.025
6.727LeuLys: 6.727 ± 2.067
9.102LeuLeu: 9.102 ± 6.926
1.979LeuMet: 1.979 ± 1.032
4.353LeuAsn: 4.353 ± 0.669
5.144LeuPro: 5.144 ± 1.564
3.957LeuGln: 3.957 ± 0.805
7.519LeuArg: 7.519 ± 1.961
7.123LeuSer: 7.123 ± 2.332
3.562LeuThr: 3.562 ± 2.216
8.706LeuVal: 8.706 ± 2.389
0.0LeuTrp: 0.0 ± 0.0
1.187LeuTyr: 1.187 ± 1.232
0.0LeuXaa: 0.0 ± 0.0
Met
4.353MetAla: 4.353 ± 1.781
0.396MetCys: 0.396 ± 0.206
0.396MetAsp: 0.396 ± 1.059
0.396MetGlu: 0.396 ± 0.79
0.0MetPhe: 0.0 ± 0.0
1.583MetGly: 1.583 ± 0.826
0.0MetHis: 0.0 ± 0.0
0.791MetIle: 0.791 ± 0.725
3.166MetLys: 3.166 ± 1.177
0.396MetLeu: 0.396 ± 0.206
0.791MetMet: 0.791 ± 0.413
0.791MetAsn: 0.791 ± 0.725
0.791MetPro: 0.791 ± 0.725
0.396MetGln: 0.396 ± 0.206
0.791MetArg: 0.791 ± 0.413
2.77MetSer: 2.77 ± 1.397
0.396MetThr: 0.396 ± 0.206
0.791MetVal: 0.791 ± 0.413
0.0MetTrp: 0.0 ± 0.0
0.396MetTyr: 0.396 ± 0.862
0.396MetXaa: 0.396 ± 0.862
Asn
2.77AsnAla: 2.77 ± 2.563
1.583AsnCys: 1.583 ± 0.812
1.187AsnAsp: 1.187 ± 0.619
1.979AsnGlu: 1.979 ± 1.069
1.979AsnPhe: 1.979 ± 1.069
1.187AsnGly: 1.187 ± 0.627
0.396AsnHis: 0.396 ± 0.206
2.374AsnIle: 2.374 ± 0.906
3.166AsnLys: 3.166 ± 1.555
4.353AsnLeu: 4.353 ± 1.802
0.396AsnMet: 0.396 ± 0.206
1.187AsnAsn: 1.187 ± 0.627
3.562AsnPro: 3.562 ± 1.858
1.583AsnGln: 1.583 ± 0.589
1.979AsnArg: 1.979 ± 0.822
2.374AsnSer: 2.374 ± 0.948
1.187AsnThr: 1.187 ± 0.619
3.562AsnVal: 3.562 ± 0.658
0.791AsnTrp: 0.791 ± 0.725
3.166AsnTyr: 3.166 ± 2.426
0.0AsnXaa: 0.0 ± 0.0
Pro
5.936ProAla: 5.936 ± 3.197
0.791ProCys: 0.791 ± 0.709
2.77ProAsp: 2.77 ± 0.847
3.166ProGlu: 3.166 ± 1.425
0.791ProPhe: 0.791 ± 0.413
2.374ProGly: 2.374 ± 0.546
1.187ProHis: 1.187 ± 1.232
0.791ProIle: 0.791 ± 0.709
3.957ProLys: 3.957 ± 1.303
2.77ProLeu: 2.77 ± 0.847
0.396ProMet: 0.396 ± 0.206
1.187ProAsn: 1.187 ± 0.619
1.979ProPro: 1.979 ± 1.069
0.791ProGln: 0.791 ± 0.413
1.979ProArg: 1.979 ± 1.032
3.562ProSer: 3.562 ± 1.066
2.77ProThr: 2.77 ± 1.615
3.957ProVal: 3.957 ± 2.065
0.791ProTrp: 0.791 ± 0.413
2.77ProTyr: 2.77 ± 0.871
0.0ProXaa: 0.0 ± 0.0
Gln
1.583GlnAla: 1.583 ± 0.589
0.0GlnCys: 0.0 ± 0.0
1.187GlnAsp: 1.187 ± 0.619
1.583GlnGlu: 1.583 ± 0.718
2.374GlnPhe: 2.374 ± 0.713
0.396GlnGly: 0.396 ± 0.206
0.396GlnHis: 0.396 ± 0.206
1.979GlnIle: 1.979 ± 1.032
0.791GlnLys: 0.791 ± 1.013
3.562GlnLeu: 3.562 ± 1.191
0.396GlnMet: 0.396 ± 0.862
1.187GlnAsn: 1.187 ± 0.619
1.979GlnPro: 1.979 ± 0.62
0.791GlnGln: 0.791 ± 0.413
1.583GlnArg: 1.583 ± 0.826
1.979GlnSer: 1.979 ± 1.376
1.979GlnThr: 1.979 ± 0.806
2.374GlnVal: 2.374 ± 0.546
0.0GlnTrp: 0.0 ± 0.0
1.583GlnTyr: 1.583 ± 0.812
0.0GlnXaa: 0.0 ± 0.0
Arg
4.749ArgAla: 4.749 ± 2.197
0.396ArgCys: 0.396 ± 0.206
1.583ArgAsp: 1.583 ± 0.589
3.957ArgGlu: 3.957 ± 1.24
3.957ArgPhe: 3.957 ± 1.612
6.332ArgGly: 6.332 ± 3.178
1.583ArgHis: 1.583 ± 0.913
2.77ArgIle: 2.77 ± 0.847
2.374ArgLys: 2.374 ± 1.239
5.144ArgLeu: 5.144 ± 2.684
1.187ArgMet: 1.187 ± 0.627
1.979ArgAsn: 1.979 ± 2.189
1.583ArgPro: 1.583 ± 0.812
0.791ArgGln: 0.791 ± 0.725
5.54ArgArg: 5.54 ± 3.563
3.562ArgSer: 3.562 ± 1.371
1.979ArgThr: 1.979 ± 0.822
3.562ArgVal: 3.562 ± 1.512
0.396ArgTrp: 0.396 ± 0.206
2.77ArgTyr: 2.77 ± 1.445
0.0ArgXaa: 0.0 ± 0.0
Ser
5.54SerAla: 5.54 ± 2.891
1.583SerCys: 1.583 ± 1.213
6.332SerAsp: 6.332 ± 2.051
4.353SerGlu: 4.353 ± 1.32
5.936SerPhe: 5.936 ± 1.755
5.54SerGly: 5.54 ± 1.949
2.77SerHis: 2.77 ± 1.445
5.54SerIle: 5.54 ± 1.646
6.727SerLys: 6.727 ± 2.182
7.519SerLeu: 7.519 ± 2.851
1.187SerMet: 1.187 ± 0.908
5.54SerAsn: 5.54 ± 1.622
3.562SerPro: 3.562 ± 0.796
2.77SerGln: 2.77 ± 0.94
5.144SerArg: 5.144 ± 2.048
10.289SerSer: 10.289 ± 2.826
3.166SerThr: 3.166 ± 1.005
4.353SerVal: 4.353 ± 1.802
0.396SerTrp: 0.396 ± 0.206
2.374SerTyr: 2.374 ± 0.932
0.0SerXaa: 0.0 ± 0.0
Thr
2.374ThrAla: 2.374 ± 0.546
0.396ThrCys: 0.396 ± 0.206
0.396ThrAsp: 0.396 ± 0.206
5.144ThrGlu: 5.144 ± 3.075
5.936ThrPhe: 5.936 ± 2.331
3.957ThrGly: 3.957 ± 1.983
1.187ThrHis: 1.187 ± 0.627
2.374ThrIle: 2.374 ± 0.992
4.749ThrLys: 4.749 ± 0.909
2.374ThrLeu: 2.374 ± 1.246
1.583ThrMet: 1.583 ± 0.589
3.166ThrAsn: 3.166 ± 1.7
1.583ThrPro: 1.583 ± 1.45
0.791ThrGln: 0.791 ± 0.725
1.979ThrArg: 1.979 ± 2.073
4.749ThrSer: 4.749 ± 4.912
1.979ThrThr: 1.979 ± 1.069
1.979ThrVal: 1.979 ± 2.007
0.0ThrTrp: 0.0 ± 0.0
0.396ThrTyr: 0.396 ± 0.206
0.0ThrXaa: 0.0 ± 0.0
Val
3.957ValAla: 3.957 ± 2.121
2.374ValCys: 2.374 ± 1.365
2.374ValAsp: 2.374 ± 1.239
2.77ValGlu: 2.77 ± 1.084
2.374ValPhe: 2.374 ± 1.239
2.77ValGly: 2.77 ± 1.615
1.187ValHis: 1.187 ± 0.965
4.353ValIle: 4.353 ± 2.118
1.187ValLys: 1.187 ± 0.619
6.332ValLeu: 6.332 ± 2.534
1.187ValMet: 1.187 ± 1.28
1.979ValAsn: 1.979 ± 1.032
2.374ValPro: 2.374 ± 0.546
2.374ValGln: 2.374 ± 0.932
3.957ValArg: 3.957 ± 1.42
7.915ValSer: 7.915 ± 2.481
5.936ValThr: 5.936 ± 3.053
3.166ValVal: 3.166 ± 2.261
0.791ValTrp: 0.791 ± 0.413
1.583ValTyr: 1.583 ± 0.826
0.0ValXaa: 0.0 ± 0.0
Trp
1.583TrpAla: 1.583 ± 1.45
0.791TrpCys: 0.791 ± 0.413
0.396TrpAsp: 0.396 ± 0.206
0.791TrpGlu: 0.791 ± 0.413
0.0TrpPhe: 0.0 ± 0.0
0.0TrpGly: 0.0 ± 0.0
0.396TrpHis: 0.396 ± 0.206
0.396TrpIle: 0.396 ± 0.206
0.396TrpLys: 0.396 ± 0.206
1.583TrpLeu: 1.583 ± 0.812
0.0TrpMet: 0.0 ± 0.0
0.396TrpAsn: 0.396 ± 0.862
0.0TrpPro: 0.0 ± 0.0
0.396TrpGln: 0.396 ± 0.206
0.396TrpArg: 0.396 ± 0.206
0.791TrpSer: 0.791 ± 0.413
0.791TrpThr: 0.791 ± 0.413
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.187TyrAla: 1.187 ± 0.627
0.791TyrCys: 0.791 ± 1.724
2.374TyrAsp: 2.374 ± 1.239
1.583TyrGlu: 1.583 ± 0.826
1.187TyrPhe: 1.187 ± 0.619
1.187TyrGly: 1.187 ± 0.853
0.791TyrHis: 0.791 ± 0.939
4.749TyrIle: 4.749 ± 1.545
1.187TyrLys: 1.187 ± 0.619
3.562TyrLeu: 3.562 ± 1.428
1.187TyrMet: 1.187 ± 1.373
0.791TyrAsn: 0.791 ± 0.939
1.583TyrPro: 1.583 ± 0.718
0.791TyrGln: 0.791 ± 0.709
0.791TyrArg: 0.791 ± 0.413
3.166TyrSer: 3.166 ± 1.652
0.791TyrThr: 0.791 ± 0.939
0.791TyrVal: 0.791 ± 0.939
0.396TyrTrp: 0.396 ± 0.206
0.0TyrTyr: 0.0 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.396XaaAla: 0.396 ± 0.862
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.396XaaPro: 0.396 ± 0.862
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (2528 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski