Amino acid dipepetide frequency for Apis mellifera associated microvirus 30

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.541AlaAla: 6.541 ± 2.635
0.727AlaCys: 0.727 ± 1.15
4.36AlaAsp: 4.36 ± 1.487
6.541AlaGlu: 6.541 ± 2.089
1.453AlaPhe: 1.453 ± 1.013
3.634AlaGly: 3.634 ± 1.662
2.18AlaHis: 2.18 ± 1.374
4.36AlaIle: 4.36 ± 1.118
7.267AlaLys: 7.267 ± 3.213
7.994AlaLeu: 7.994 ± 2.326
0.727AlaMet: 0.727 ± 0.598
2.907AlaAsn: 2.907 ± 1.212
4.36AlaPro: 4.36 ± 1.118
5.087AlaGln: 5.087 ± 1.542
4.36AlaArg: 4.36 ± 1.773
5.814AlaSer: 5.814 ± 2.747
7.994AlaThr: 7.994 ± 2.039
5.087AlaVal: 5.087 ± 2.251
1.453AlaTrp: 1.453 ± 1.052
1.453AlaTyr: 1.453 ± 0.606
0.0AlaXaa: 0.0 ± 0.0
Cys
1.453CysAla: 1.453 ± 0.605
0.0CysCys: 0.0 ± 0.0
0.727CysAsp: 0.727 ± 0.732
0.0CysGlu: 0.0 ± 0.0
0.0CysPhe: 0.0 ± 0.0
0.727CysGly: 0.727 ± 0.688
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
0.727CysLys: 0.727 ± 1.15
0.727CysLeu: 0.727 ± 0.688
0.0CysMet: 0.0 ± 0.0
0.727CysAsn: 0.727 ± 1.15
0.727CysPro: 0.727 ± 0.507
1.453CysGln: 1.453 ± 0.605
0.727CysArg: 0.727 ± 0.688
0.727CysSer: 0.727 ± 0.688
0.0CysThr: 0.0 ± 0.0
0.0CysVal: 0.0 ± 0.0
0.727CysTrp: 0.727 ± 0.688
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
5.087AspAla: 5.087 ± 1.951
0.727AspCys: 0.727 ± 0.688
1.453AspAsp: 1.453 ± 1.376
2.18AspGlu: 2.18 ± 0.701
3.634AspPhe: 3.634 ± 1.171
2.907AspGly: 2.907 ± 1.211
0.727AspHis: 0.727 ± 0.688
5.087AspIle: 5.087 ± 1.169
2.907AspLys: 2.907 ± 1.01
7.994AspLeu: 7.994 ± 2.133
0.727AspMet: 0.727 ± 0.55
2.907AspAsn: 2.907 ± 1.487
1.453AspPro: 1.453 ± 1.463
2.18AspGln: 2.18 ± 0.943
2.18AspArg: 2.18 ± 0.701
5.814AspSer: 5.814 ± 0.983
2.907AspThr: 2.907 ± 1.387
2.907AspVal: 2.907 ± 1.851
0.727AspTrp: 0.727 ± 0.507
4.36AspTyr: 4.36 ± 0.797
0.0AspXaa: 0.0 ± 0.0
Glu
2.18GluAla: 2.18 ± 0.943
0.727GluCys: 0.727 ± 0.732
2.18GluAsp: 2.18 ± 1.765
2.18GluGlu: 2.18 ± 0.879
1.453GluPhe: 1.453 ± 0.958
2.18GluGly: 2.18 ± 1.093
2.907GluHis: 2.907 ± 0.616
4.36GluIle: 4.36 ± 2.347
1.453GluLys: 1.453 ± 1.376
1.453GluLeu: 1.453 ± 1.463
1.453GluMet: 1.453 ± 0.958
2.18GluAsn: 2.18 ± 0.973
0.727GluPro: 0.727 ± 0.507
2.907GluGln: 2.907 ± 1.211
5.814GluArg: 5.814 ± 2.298
5.087GluSer: 5.087 ± 1.61
1.453GluThr: 1.453 ± 1.052
4.36GluVal: 4.36 ± 2.03
2.18GluTrp: 2.18 ± 1.013
2.18GluTyr: 2.18 ± 0.879
0.0GluXaa: 0.0 ± 0.0
Phe
2.18PheAla: 2.18 ± 1.318
0.727PheCys: 0.727 ± 0.507
1.453PheAsp: 1.453 ± 0.754
2.18PheGlu: 2.18 ± 1.184
2.18PhePhe: 2.18 ± 1.184
2.907PheGly: 2.907 ± 1.301
0.727PheHis: 0.727 ± 0.507
3.634PheIle: 3.634 ± 1.422
3.634PheLys: 3.634 ± 0.908
2.907PheLeu: 2.907 ± 1.056
0.727PheMet: 0.727 ± 0.507
1.453PheAsn: 1.453 ± 0.606
0.727PhePro: 0.727 ± 0.507
2.18PheGln: 2.18 ± 1.093
1.453PheArg: 1.453 ± 0.606
2.18PheSer: 2.18 ± 1.841
2.18PheThr: 2.18 ± 0.701
1.453PheVal: 1.453 ± 1.013
0.0PheTrp: 0.0 ± 0.0
1.453PheTyr: 1.453 ± 1.376
0.0PheXaa: 0.0 ± 0.0
Gly
5.814GlyAla: 5.814 ± 2.747
0.0GlyCys: 0.0 ± 0.0
4.36GlyAsp: 4.36 ± 1.607
5.814GlyGlu: 5.814 ± 0.696
1.453GlyPhe: 1.453 ± 0.758
4.36GlyGly: 4.36 ± 1.319
0.727GlyHis: 0.727 ± 0.688
2.18GlyIle: 2.18 ± 0.469
1.453GlyLys: 1.453 ± 1.267
4.36GlyLeu: 4.36 ± 1.487
0.727GlyMet: 0.727 ± 0.507
6.541GlyAsn: 6.541 ± 1.343
2.907GlyPro: 2.907 ± 1.002
0.727GlyGln: 0.727 ± 0.598
0.727GlyArg: 0.727 ± 0.507
3.634GlySer: 3.634 ± 0.864
7.267GlyThr: 7.267 ± 2.621
5.087GlyVal: 5.087 ± 1.395
0.0GlyTrp: 0.0 ± 0.0
4.36GlyTyr: 4.36 ± 1.06
0.0GlyXaa: 0.0 ± 0.0
His
2.907HisAla: 2.907 ± 1.692
0.727HisCys: 0.727 ± 0.688
4.36HisAsp: 4.36 ± 2.543
1.453HisGlu: 1.453 ± 0.906
2.907HisPhe: 2.907 ± 1.206
1.453HisGly: 1.453 ± 0.605
1.453HisHis: 1.453 ± 1.376
0.0HisIle: 0.0 ± 0.0
0.0HisLys: 0.0 ± 0.0
1.453HisLeu: 1.453 ± 0.605
0.0HisMet: 0.0 ± 0.0
0.727HisAsn: 0.727 ± 0.507
1.453HisPro: 1.453 ± 0.605
0.727HisGln: 0.727 ± 0.598
0.727HisArg: 0.727 ± 0.688
2.907HisSer: 2.907 ± 1.851
0.0HisThr: 0.0 ± 0.0
0.727HisVal: 0.727 ± 0.688
0.0HisTrp: 0.0 ± 0.0
4.36HisTyr: 4.36 ± 0.745
0.0HisXaa: 0.0 ± 0.0
Ile
2.18IleAla: 2.18 ± 1.18
0.0IleCys: 0.0 ± 0.0
3.634IleAsp: 3.634 ± 1.805
0.727IleGlu: 0.727 ± 1.15
0.727IlePhe: 0.727 ± 0.598
4.36IleGly: 4.36 ± 1.94
2.18IleHis: 2.18 ± 1.013
0.727IleIle: 0.727 ± 0.507
4.36IleLys: 4.36 ± 2.109
5.814IleLeu: 5.814 ± 1.217
0.727IleMet: 0.727 ± 0.507
1.453IleAsn: 1.453 ± 0.754
5.814IlePro: 5.814 ± 1.929
2.18IleGln: 2.18 ± 1.193
4.36IleArg: 4.36 ± 1.68
2.18IleSer: 2.18 ± 2.321
5.814IleThr: 5.814 ± 1.698
2.907IleVal: 2.907 ± 1.299
1.453IleTrp: 1.453 ± 1.013
2.18IleTyr: 2.18 ± 1.52
0.0IleXaa: 0.0 ± 0.0
Lys
5.087LysAla: 5.087 ± 3.044
0.727LysCys: 0.727 ± 0.688
3.634LysAsp: 3.634 ± 1.083
4.36LysGlu: 4.36 ± 1.142
2.18LysPhe: 2.18 ± 1.056
2.907LysGly: 2.907 ± 1.851
2.18LysHis: 2.18 ± 0.973
2.907LysIle: 2.907 ± 1.491
4.36LysLys: 4.36 ± 2.043
4.36LysLeu: 4.36 ± 2.44
0.727LysMet: 0.727 ± 1.022
2.907LysAsn: 2.907 ± 1.01
2.18LysPro: 2.18 ± 1.385
2.18LysGln: 2.18 ± 0.973
9.448LysArg: 9.448 ± 2.419
3.634LysSer: 3.634 ± 0.838
2.907LysThr: 2.907 ± 1.203
1.453LysVal: 1.453 ± 1.267
0.727LysTrp: 0.727 ± 0.598
3.634LysTyr: 3.634 ± 2.853
0.0LysXaa: 0.0 ± 0.0
Leu
4.36LeuAla: 4.36 ± 1.715
0.0LeuCys: 0.0 ± 0.0
2.907LeuAsp: 2.907 ± 0.608
4.36LeuGlu: 4.36 ± 1.999
2.18LeuPhe: 2.18 ± 1.013
6.541LeuGly: 6.541 ± 2.137
3.634LeuHis: 3.634 ± 3.44
3.634LeuIle: 3.634 ± 1.261
4.36LeuLys: 4.36 ± 1.711
5.087LeuLeu: 5.087 ± 2.537
1.453LeuMet: 1.453 ± 1.115
5.087LeuAsn: 5.087 ± 1.571
5.814LeuPro: 5.814 ± 1.883
4.36LeuGln: 4.36 ± 1.607
5.814LeuArg: 5.814 ± 0.696
3.634LeuSer: 3.634 ± 1.025
5.814LeuThr: 5.814 ± 4.115
2.907LeuVal: 2.907 ± 1.387
0.0LeuTrp: 0.0 ± 0.0
2.907LeuTyr: 2.907 ± 0.616
0.0LeuXaa: 0.0 ± 0.0
Met
1.453MetAla: 1.453 ± 0.606
0.0MetCys: 0.0 ± 0.0
2.18MetAsp: 2.18 ± 1.093
0.0MetGlu: 0.0 ± 0.0
0.727MetPhe: 0.727 ± 1.15
2.18MetGly: 2.18 ± 0.943
1.453MetHis: 1.453 ± 0.605
1.453MetIle: 1.453 ± 0.754
3.634MetLys: 3.634 ± 1.339
0.727MetLeu: 0.727 ± 0.507
0.727MetMet: 0.727 ± 0.507
0.0MetAsn: 0.0 ± 0.0
1.453MetPro: 1.453 ± 1.013
0.0MetGln: 0.0 ± 0.0
0.0MetArg: 0.0 ± 0.0
1.453MetSer: 1.453 ± 1.197
0.727MetThr: 0.727 ± 1.15
0.727MetVal: 0.727 ± 0.688
0.0MetTrp: 0.0 ± 0.0
1.453MetTyr: 1.453 ± 1.013
0.0MetXaa: 0.0 ± 0.0
Asn
7.267AsnAla: 7.267 ± 3.325
0.0AsnCys: 0.0 ± 0.0
3.634AsnAsp: 3.634 ± 1.261
2.18AsnGlu: 2.18 ± 0.701
2.907AsnPhe: 2.907 ± 1.211
6.541AsnGly: 6.541 ± 2.077
0.727AsnHis: 0.727 ± 0.507
5.087AsnIle: 5.087 ± 2.075
2.907AsnLys: 2.907 ± 1.303
7.994AsnLeu: 7.994 ± 0.482
0.727AsnMet: 0.727 ± 0.507
2.907AsnAsn: 2.907 ± 1.914
2.907AsnPro: 2.907 ± 1.303
2.907AsnGln: 2.907 ± 1.03
0.727AsnArg: 0.727 ± 0.598
3.634AsnSer: 3.634 ± 1.864
2.907AsnThr: 2.907 ± 1.303
0.727AsnVal: 0.727 ± 0.688
0.0AsnTrp: 0.0 ± 0.0
0.727AsnTyr: 0.727 ± 0.688
0.0AsnXaa: 0.0 ± 0.0
Pro
7.994ProAla: 7.994 ± 2.942
0.727ProCys: 0.727 ± 0.688
0.727ProAsp: 0.727 ± 0.507
3.634ProGlu: 3.634 ± 2.107
0.727ProPhe: 0.727 ± 1.15
2.18ProGly: 2.18 ± 1.52
2.18ProHis: 2.18 ± 2.064
3.634ProIle: 3.634 ± 0.864
2.18ProLys: 2.18 ± 0.839
2.18ProLeu: 2.18 ± 0.879
2.18ProMet: 2.18 ± 1.182
2.907ProAsn: 2.907 ± 1.299
1.453ProPro: 1.453 ± 0.906
2.907ProGln: 2.907 ± 1.301
3.634ProArg: 3.634 ± 1.025
2.907ProSer: 2.907 ± 1.509
3.634ProThr: 3.634 ± 1.171
6.541ProVal: 6.541 ± 3.74
0.727ProTrp: 0.727 ± 0.598
0.727ProTyr: 0.727 ± 0.507
0.0ProXaa: 0.0 ± 0.0
Gln
7.267GlnAla: 7.267 ± 3.28
0.0GlnCys: 0.0 ± 0.0
1.453GlnAsp: 1.453 ± 0.605
2.907GlnGlu: 2.907 ± 1.338
2.18GlnPhe: 2.18 ± 0.701
2.907GlnGly: 2.907 ± 2.026
1.453GlnHis: 1.453 ± 1.349
0.727GlnIle: 0.727 ± 0.507
2.907GlnLys: 2.907 ± 1.167
0.727GlnLeu: 0.727 ± 0.598
3.634GlnMet: 3.634 ± 0.838
1.453GlnAsn: 1.453 ± 0.605
2.18GlnPro: 2.18 ± 0.879
3.634GlnGln: 3.634 ± 1.268
4.36GlnArg: 4.36 ± 1.061
2.907GlnSer: 2.907 ± 1.01
2.18GlnThr: 2.18 ± 0.943
1.453GlnVal: 1.453 ± 1.013
0.0GlnTrp: 0.0 ± 0.0
1.453GlnTyr: 1.453 ± 0.906
0.0GlnXaa: 0.0 ± 0.0
Arg
2.18ArgAla: 2.18 ± 1.544
0.0ArgCys: 0.0 ± 0.0
5.087ArgAsp: 5.087 ± 1.614
2.907ArgGlu: 2.907 ± 1.699
1.453ArgPhe: 1.453 ± 0.906
2.18ArgGly: 2.18 ± 0.943
0.727ArgHis: 0.727 ± 0.732
5.814ArgIle: 5.814 ± 1.973
2.907ArgLys: 2.907 ± 1.692
6.541ArgLeu: 6.541 ± 1.03
2.18ArgMet: 2.18 ± 0.879
1.453ArgAsn: 1.453 ± 0.958
4.36ArgPro: 4.36 ± 1.758
2.18ArgGln: 2.18 ± 0.943
1.453ArgArg: 1.453 ± 0.958
4.36ArgSer: 4.36 ± 1.06
2.907ArgThr: 2.907 ± 0.616
2.907ArgVal: 2.907 ± 1.301
1.453ArgTrp: 1.453 ± 1.013
3.634ArgTyr: 3.634 ± 0.864
0.0ArgXaa: 0.0 ± 0.0
Ser
5.814SerAla: 5.814 ± 2.434
0.727SerCys: 0.727 ± 0.688
5.087SerAsp: 5.087 ± 0.925
3.634SerGlu: 3.634 ± 0.982
5.087SerPhe: 5.087 ± 1.464
1.453SerGly: 1.453 ± 0.606
0.727SerHis: 0.727 ± 0.688
3.634SerIle: 3.634 ± 2.156
7.994SerLys: 7.994 ± 3.023
3.634SerLeu: 3.634 ± 1.171
0.0SerMet: 0.0 ± 0.0
5.814SerAsn: 5.814 ± 1.286
1.453SerPro: 1.453 ± 0.605
2.18SerGln: 2.18 ± 1.093
2.18SerArg: 2.18 ± 0.879
10.174SerSer: 10.174 ± 2.268
7.994SerThr: 7.994 ± 0.482
2.907SerVal: 2.907 ± 2.026
0.727SerTrp: 0.727 ± 0.688
1.453SerTyr: 1.453 ± 1.052
0.0SerXaa: 0.0 ± 0.0
Thr
6.541ThrAla: 6.541 ± 2.32
0.727ThrCys: 0.727 ± 0.688
2.907ThrAsp: 2.907 ± 1.301
2.907ThrGlu: 2.907 ± 1.299
2.18ThrPhe: 2.18 ± 0.943
6.541ThrGly: 6.541 ± 2.558
0.727ThrHis: 0.727 ± 0.507
2.18ThrIle: 2.18 ± 1.013
2.18ThrLys: 2.18 ± 0.469
4.36ThrLeu: 4.36 ± 1.707
0.727ThrMet: 0.727 ± 0.72
5.814ThrAsn: 5.814 ± 2.123
5.814ThrPro: 5.814 ± 2.165
3.634ThrGln: 3.634 ± 1.662
3.634ThrArg: 3.634 ± 1.88
5.087ThrSer: 5.087 ± 2.904
2.18ThrThr: 2.18 ± 0.879
2.18ThrVal: 2.18 ± 1.52
1.453ThrTrp: 1.453 ± 1.376
4.36ThrTyr: 4.36 ± 2.178
0.0ThrXaa: 0.0 ± 0.0
Val
1.453ValAla: 1.453 ± 0.606
0.727ValCys: 0.727 ± 0.507
2.907ValAsp: 2.907 ± 1.301
0.727ValGlu: 0.727 ± 0.598
0.727ValPhe: 0.727 ± 0.507
3.634ValGly: 3.634 ± 1.441
2.18ValHis: 2.18 ± 0.879
1.453ValIle: 1.453 ± 0.605
5.087ValLys: 5.087 ± 2.361
4.36ValLeu: 4.36 ± 1.495
2.18ValMet: 2.18 ± 0.879
2.907ValAsn: 2.907 ± 1.211
6.541ValPro: 6.541 ± 3.895
1.453ValGln: 1.453 ± 0.754
2.907ValArg: 2.907 ± 1.301
3.634ValSer: 3.634 ± 1.171
2.907ValThr: 2.907 ± 0.616
2.18ValVal: 2.18 ± 0.879
1.453ValTrp: 1.453 ± 0.605
0.727ValTyr: 0.727 ± 0.688
0.0ValXaa: 0.0 ± 0.0
Trp
2.18TrpAla: 2.18 ± 1.374
0.0TrpCys: 0.0 ± 0.0
2.18TrpAsp: 2.18 ± 1.52
0.0TrpGlu: 0.0 ± 0.0
1.453TrpPhe: 1.453 ± 1.013
0.727TrpGly: 0.727 ± 0.688
0.727TrpHis: 0.727 ± 0.507
0.727TrpIle: 0.727 ± 0.688
0.0TrpLys: 0.0 ± 0.0
1.453TrpLeu: 1.453 ± 1.376
0.0TrpMet: 0.0 ± 0.0
2.18TrpAsn: 2.18 ± 1.182
0.0TrpPro: 0.0 ± 0.0
0.727TrpGln: 0.727 ± 0.688
0.0TrpArg: 0.0 ± 0.0
0.727TrpSer: 0.727 ± 0.688
0.727TrpThr: 0.727 ± 0.507
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
4.36TyrAla: 4.36 ± 1.231
2.18TyrCys: 2.18 ± 1.013
3.634TyrAsp: 3.634 ± 2.187
0.727TyrGlu: 0.727 ± 1.15
0.727TyrPhe: 0.727 ± 0.507
2.18TyrGly: 2.18 ± 0.701
0.727TyrHis: 0.727 ± 0.688
2.18TyrIle: 2.18 ± 0.879
2.18TyrLys: 2.18 ± 2.064
1.453TyrLeu: 1.453 ± 0.606
0.0TyrMet: 0.0 ± 0.0
5.087TyrAsn: 5.087 ± 1.287
1.453TyrPro: 1.453 ± 0.758
2.18TyrGln: 2.18 ± 1.056
2.18TyrArg: 2.18 ± 1.193
2.18TyrSer: 2.18 ± 0.943
3.634TyrThr: 3.634 ± 1.083
3.634TyrVal: 3.634 ± 0.903
0.727TyrTrp: 0.727 ± 0.688
1.453TyrTyr: 1.453 ± 1.376
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (1377 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski