Amino acid dipepetide frequency for Apis mellifera associated microvirus 56

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.081AlaAla: 8.081 ± 3.859
0.673AlaCys: 0.673 ± 0.6
4.714AlaAsp: 4.714 ± 2.725
2.694AlaGlu: 2.694 ± 1.575
3.367AlaPhe: 3.367 ± 1.495
4.714AlaGly: 4.714 ± 2.23
2.694AlaHis: 2.694 ± 2.113
7.407AlaIle: 7.407 ± 1.604
3.367AlaLys: 3.367 ± 0.549
7.407AlaLeu: 7.407 ± 2.466
2.02AlaMet: 2.02 ± 0.9
8.081AlaAsn: 8.081 ± 3.145
4.04AlaPro: 4.04 ± 1.257
3.367AlaGln: 3.367 ± 1.46
4.04AlaArg: 4.04 ± 1.236
7.407AlaSer: 7.407 ± 1.863
5.387AlaThr: 5.387 ± 1.25
6.061AlaVal: 6.061 ± 1.545
2.02AlaTrp: 2.02 ± 0.97
5.387AlaTyr: 5.387 ± 1.557
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
1.347CysGlu: 1.347 ± 1.042
0.673CysPhe: 0.673 ± 0.521
0.0CysGly: 0.0 ± 0.0
0.673CysHis: 0.673 ± 0.6
0.0CysIle: 0.0 ± 0.0
0.673CysLys: 0.673 ± 0.6
0.673CysLeu: 0.673 ± 0.817
0.673CysMet: 0.673 ± 0.6
0.0CysAsn: 0.0 ± 0.0
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
0.673CysArg: 0.673 ± 0.6
0.0CysSer: 0.0 ± 0.0
0.0CysThr: 0.0 ± 0.0
0.673CysVal: 0.673 ± 0.817
0.673CysTrp: 0.673 ± 0.521
0.673CysTyr: 0.673 ± 0.6
0.0CysXaa: 0.0 ± 0.0
Asp
4.714AspAla: 4.714 ± 1.406
0.673AspCys: 0.673 ± 0.6
4.04AspAsp: 4.04 ± 2.274
1.347AspGlu: 1.347 ± 0.594
3.367AspPhe: 3.367 ± 1.743
4.04AspGly: 4.04 ± 1.941
0.0AspHis: 0.0 ± 0.0
2.694AspIle: 2.694 ± 1.533
4.714AspLys: 4.714 ± 1.398
4.714AspLeu: 4.714 ± 0.948
0.673AspMet: 0.673 ± 0.661
0.0AspAsn: 0.0 ± 0.0
4.04AspPro: 4.04 ± 1.554
4.714AspGln: 4.714 ± 1.811
2.694AspArg: 2.694 ± 1.233
3.367AspSer: 3.367 ± 1.794
4.04AspThr: 4.04 ± 0.838
1.347AspVal: 1.347 ± 0.594
0.0AspTrp: 0.0 ± 0.0
2.02AspTyr: 2.02 ± 0.756
0.0AspXaa: 0.0 ± 0.0
Glu
4.04GluAla: 4.04 ± 1.876
0.0GluCys: 0.0 ± 0.0
0.673GluAsp: 0.673 ± 0.6
6.061GluGlu: 6.061 ± 2.987
1.347GluPhe: 1.347 ± 0.971
3.367GluGly: 3.367 ± 1.152
0.0GluHis: 0.0 ± 0.0
3.367GluIle: 3.367 ± 0.549
3.367GluLys: 3.367 ± 2.255
5.387GluLeu: 5.387 ± 1.29
2.694GluMet: 2.694 ± 1.467
2.694GluAsn: 2.694 ± 1.188
0.673GluPro: 0.673 ± 0.521
4.04GluGln: 4.04 ± 1.093
4.04GluArg: 4.04 ± 2.778
3.367GluSer: 3.367 ± 1.094
4.04GluThr: 4.04 ± 1.346
3.367GluVal: 3.367 ± 1.781
1.347GluTrp: 1.347 ± 0.594
2.02GluTyr: 2.02 ± 0.97
0.0GluXaa: 0.0 ± 0.0
Phe
3.367PheAla: 3.367 ± 1.337
0.0PheCys: 0.0 ± 0.0
4.04PheAsp: 4.04 ± 1.774
2.694PheGlu: 2.694 ± 2.113
3.367PhePhe: 3.367 ± 1.526
4.714PheGly: 4.714 ± 1.271
1.347PheHis: 1.347 ± 1.042
2.694PheIle: 2.694 ± 0.984
4.714PheLys: 4.714 ± 1.811
1.347PheLeu: 1.347 ± 0.807
0.673PheMet: 0.673 ± 0.593
2.02PheAsn: 2.02 ± 1.562
2.694PhePro: 2.694 ± 1.815
1.347PheGln: 1.347 ± 1.042
4.04PheArg: 4.04 ± 1.941
1.347PheSer: 1.347 ± 0.807
2.694PheThr: 2.694 ± 0.862
2.694PheVal: 2.694 ± 1.614
0.0PheTrp: 0.0 ± 0.0
0.0PheTyr: 0.0 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
6.061GlyAla: 6.061 ± 1.881
0.0GlyCys: 0.0 ± 0.0
2.02GlyAsp: 2.02 ± 0.9
2.694GlyGlu: 2.694 ± 1.43
1.347GlyPhe: 1.347 ± 1.042
4.714GlyGly: 4.714 ± 2.951
1.347GlyHis: 1.347 ± 0.616
3.367GlyIle: 3.367 ± 1.152
2.02GlyLys: 2.02 ± 2.662
5.387GlyLeu: 5.387 ± 1.352
0.0GlyMet: 0.0 ± 0.0
1.347GlyAsn: 1.347 ± 1.042
2.02GlyPro: 2.02 ± 1.562
5.387GlyGln: 5.387 ± 1.462
3.367GlyArg: 3.367 ± 1.644
8.081GlySer: 8.081 ± 3.042
8.754GlyThr: 8.754 ± 5.158
2.02GlyVal: 2.02 ± 1.144
2.694GlyTrp: 2.694 ± 1.179
1.347GlyTyr: 1.347 ± 0.81
0.0GlyXaa: 0.0 ± 0.0
His
4.04HisAla: 4.04 ± 2.751
0.0HisCys: 0.0 ± 0.0
2.02HisAsp: 2.02 ± 0.786
1.347HisGlu: 1.347 ± 1.314
1.347HisPhe: 1.347 ± 1.042
2.02HisGly: 2.02 ± 0.792
0.673HisHis: 0.673 ± 0.6
1.347HisIle: 1.347 ± 1.314
0.0HisLys: 0.0 ± 0.0
2.02HisLeu: 2.02 ± 1.071
0.0HisMet: 0.0 ± 0.0
0.673HisAsn: 0.673 ± 0.521
1.347HisPro: 1.347 ± 1.042
0.673HisGln: 0.673 ± 0.661
2.694HisArg: 2.694 ± 1.4
0.673HisSer: 0.673 ± 0.521
0.673HisThr: 0.673 ± 1.353
2.02HisVal: 2.02 ± 1.406
0.673HisTrp: 0.673 ± 0.521
1.347HisTyr: 1.347 ± 1.001
0.0HisXaa: 0.0 ± 0.0
Ile
6.061IleAla: 6.061 ± 1.175
0.0IleCys: 0.0 ± 0.0
4.04IleAsp: 4.04 ± 1.217
2.694IleGlu: 2.694 ± 0.707
4.04IlePhe: 4.04 ± 1.25
4.04IleGly: 4.04 ± 1.241
0.673IleHis: 0.673 ± 0.888
2.694IleIle: 2.694 ± 1.575
1.347IleLys: 1.347 ± 0.98
4.04IleLeu: 4.04 ± 1.217
1.347IleMet: 1.347 ± 0.594
5.387IleAsn: 5.387 ± 1.244
0.673IlePro: 0.673 ± 0.6
4.04IleGln: 4.04 ± 1.683
4.714IleArg: 4.714 ± 1.953
4.714IleSer: 4.714 ± 1.395
3.367IleThr: 3.367 ± 1.217
0.673IleVal: 0.673 ± 0.888
0.673IleTrp: 0.673 ± 0.521
1.347IleTyr: 1.347 ± 1.042
0.0IleXaa: 0.0 ± 0.0
Lys
3.367LysAla: 3.367 ± 2.444
0.673LysCys: 0.673 ± 0.6
2.02LysAsp: 2.02 ± 1.389
2.694LysGlu: 2.694 ± 1.054
2.02LysPhe: 2.02 ± 2.09
3.367LysGly: 3.367 ± 1.094
2.694LysHis: 2.694 ± 1.656
2.694LysIle: 2.694 ± 1.771
3.367LysLys: 3.367 ± 2.536
4.04LysLeu: 4.04 ± 2.287
2.694LysMet: 2.694 ± 1.14
3.367LysAsn: 3.367 ± 1.332
1.347LysPro: 1.347 ± 1.359
2.694LysGln: 2.694 ± 1.682
2.02LysArg: 2.02 ± 1.1
4.04LysSer: 4.04 ± 1.592
4.04LysThr: 4.04 ± 2.458
2.694LysVal: 2.694 ± 0.88
0.0LysTrp: 0.0 ± 0.0
2.694LysTyr: 2.694 ± 1.233
0.0LysXaa: 0.0 ± 0.0
Leu
4.04LeuAla: 4.04 ± 1.805
0.673LeuCys: 0.673 ± 0.521
6.061LeuAsp: 6.061 ± 0.559
3.367LeuGlu: 3.367 ± 1.965
3.367LeuPhe: 3.367 ± 1.094
5.387LeuGly: 5.387 ± 1.814
2.694LeuHis: 2.694 ± 1.057
3.367LeuIle: 3.367 ± 1.832
4.04LeuLys: 4.04 ± 1.572
5.387LeuLeu: 5.387 ± 1.448
1.347LeuMet: 1.347 ± 1.125
6.061LeuAsn: 6.061 ± 1.873
2.694LeuPro: 2.694 ± 1.4
5.387LeuGln: 5.387 ± 1.329
7.407LeuArg: 7.407 ± 1.795
2.694LeuSer: 2.694 ± 0.984
5.387LeuThr: 5.387 ± 1.294
1.347LeuVal: 1.347 ± 1.042
0.673LeuTrp: 0.673 ± 0.661
2.694LeuTyr: 2.694 ± 0.724
0.0LeuXaa: 0.0 ± 0.0
Met
1.347MetAla: 1.347 ± 0.616
0.0MetCys: 0.0 ± 0.0
3.367MetAsp: 3.367 ± 1.108
3.367MetGlu: 3.367 ± 1.999
1.347MetPhe: 1.347 ± 1.042
2.02MetGly: 2.02 ± 0.9
0.673MetHis: 0.673 ± 0.521
1.347MetIle: 1.347 ± 0.841
2.02MetLys: 2.02 ± 0.786
1.347MetLeu: 1.347 ± 1.359
0.0MetMet: 0.0 ± 0.0
0.0MetAsn: 0.0 ± 0.0
2.02MetPro: 2.02 ± 1.562
2.694MetGln: 2.694 ± 1.112
0.0MetArg: 0.0 ± 0.0
2.02MetSer: 2.02 ± 0.786
1.347MetThr: 1.347 ± 0.594
0.673MetVal: 0.673 ± 0.888
1.347MetTrp: 1.347 ± 1.057
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
4.714AsnAla: 4.714 ± 1.556
2.02AsnCys: 2.02 ± 0.833
3.367AsnAsp: 3.367 ± 1.087
2.694AsnGlu: 2.694 ± 1.097
1.347AsnPhe: 1.347 ± 0.807
2.694AsnGly: 2.694 ± 1.233
0.0AsnHis: 0.0 ± 0.0
3.367AsnIle: 3.367 ± 1.492
4.714AsnLys: 4.714 ± 2.175
3.367AsnLeu: 3.367 ± 1.087
2.694AsnMet: 2.694 ± 0.862
2.694AsnAsn: 2.694 ± 1.771
5.387AsnPro: 5.387 ± 1.736
3.367AsnGln: 3.367 ± 1.485
2.02AsnArg: 2.02 ± 0.61
3.367AsnSer: 3.367 ± 1.152
2.02AsnThr: 2.02 ± 2.0
3.367AsnVal: 3.367 ± 2.188
0.673AsnTrp: 0.673 ± 0.521
0.673AsnTyr: 0.673 ± 0.521
0.0AsnXaa: 0.0 ± 0.0
Pro
8.754ProAla: 8.754 ± 3.267
0.673ProCys: 0.673 ± 0.6
3.367ProAsp: 3.367 ± 1.478
3.367ProGlu: 3.367 ± 1.036
2.694ProPhe: 2.694 ± 2.083
4.714ProGly: 4.714 ± 1.868
0.673ProHis: 0.673 ± 0.6
2.694ProIle: 2.694 ± 1.43
2.02ProLys: 2.02 ± 0.833
2.694ProLeu: 2.694 ± 1.233
1.347ProMet: 1.347 ± 0.745
4.04ProAsn: 4.04 ± 1.703
2.694ProPro: 2.694 ± 1.371
2.694ProGln: 2.694 ± 1.458
3.367ProArg: 3.367 ± 1.922
4.04ProSer: 4.04 ± 2.709
0.673ProThr: 0.673 ± 0.661
2.02ProVal: 2.02 ± 1.552
0.0ProTrp: 0.0 ± 0.0
0.0ProTyr: 0.0 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
8.754GlnAla: 8.754 ± 1.689
0.0GlnCys: 0.0 ± 0.0
0.673GlnAsp: 0.673 ± 0.888
2.02GlnGlu: 2.02 ± 0.61
1.347GlnPhe: 1.347 ± 0.971
4.04GlnGly: 4.04 ± 1.554
2.694GlnHis: 2.694 ± 1.55
3.367GlnIle: 3.367 ± 1.519
4.04GlnLys: 4.04 ± 1.699
4.04GlnLeu: 4.04 ± 1.266
3.367GlnMet: 3.367 ± 1.629
4.04GlnAsn: 4.04 ± 2.187
1.347GlnPro: 1.347 ± 1.057
3.367GlnGln: 3.367 ± 2.357
2.694GlnArg: 2.694 ± 1.214
2.02GlnSer: 2.02 ± 0.61
3.367GlnThr: 3.367 ± 1.679
4.714GlnVal: 4.714 ± 2.205
2.02GlnTrp: 2.02 ± 1.389
0.673GlnTyr: 0.673 ± 0.521
0.0GlnXaa: 0.0 ± 0.0
Arg
2.694ArgAla: 2.694 ± 1.682
0.673ArgCys: 0.673 ± 0.817
2.694ArgAsp: 2.694 ± 1.233
4.04ArgGlu: 4.04 ± 1.699
3.367ArgPhe: 3.367 ± 1.806
3.367ArgGly: 3.367 ± 1.152
1.347ArgHis: 1.347 ± 1.359
3.367ArgIle: 3.367 ± 1.54
4.04ArgLys: 4.04 ± 2.236
6.734ArgLeu: 6.734 ± 2.411
1.347ArgMet: 1.347 ± 0.616
2.02ArgAsn: 2.02 ± 0.9
4.714ArgPro: 4.714 ± 1.663
5.387ArgGln: 5.387 ± 2.178
4.04ArgArg: 4.04 ± 2.199
2.02ArgSer: 2.02 ± 1.407
2.694ArgThr: 2.694 ± 1.233
1.347ArgVal: 1.347 ± 1.001
0.0ArgTrp: 0.0 ± 0.0
4.04ArgTyr: 4.04 ± 1.849
0.0ArgXaa: 0.0 ± 0.0
Ser
10.101SerAla: 10.101 ± 4.347
0.0SerCys: 0.0 ± 0.0
2.02SerAsp: 2.02 ± 1.801
4.714SerGlu: 4.714 ± 1.587
4.04SerPhe: 4.04 ± 2.355
1.347SerGly: 1.347 ± 1.001
1.347SerHis: 1.347 ± 1.042
5.387SerIle: 5.387 ± 1.807
3.367SerLys: 3.367 ± 1.121
4.04SerLeu: 4.04 ± 1.861
0.0SerMet: 0.0 ± 0.471
2.02SerAsn: 2.02 ± 1.085
4.04SerPro: 4.04 ± 1.217
4.04SerGln: 4.04 ± 1.833
4.04SerArg: 4.04 ± 1.326
2.02SerSer: 2.02 ± 0.756
4.04SerThr: 4.04 ± 1.675
1.347SerVal: 1.347 ± 1.042
2.02SerTrp: 2.02 ± 1.1
2.694SerTyr: 2.694 ± 1.653
0.0SerXaa: 0.0 ± 0.0
Thr
5.387ThrAla: 5.387 ± 1.819
0.0ThrCys: 0.0 ± 0.0
2.02ThrAsp: 2.02 ± 0.792
2.694ThrGlu: 2.694 ± 1.057
2.02ThrPhe: 2.02 ± 2.227
4.04ThrGly: 4.04 ± 1.25
1.347ThrHis: 1.347 ± 1.633
4.04ThrIle: 4.04 ± 0.801
3.367ThrLys: 3.367 ± 2.702
6.061ThrLeu: 6.061 ± 2.512
2.02ThrMet: 2.02 ± 1.032
3.367ThrAsn: 3.367 ± 1.504
7.407ThrPro: 7.407 ± 1.963
2.02ThrGln: 2.02 ± 1.658
3.367ThrArg: 3.367 ± 1.214
6.734ThrSer: 6.734 ± 1.583
4.714ThrThr: 4.714 ± 2.122
1.347ThrVal: 1.347 ± 0.807
0.0ThrTrp: 0.0 ± 0.0
1.347ThrTyr: 1.347 ± 0.841
0.0ThrXaa: 0.0 ± 0.0
Val
6.061ValAla: 6.061 ± 1.373
0.0ValCys: 0.0 ± 0.0
2.02ValAsp: 2.02 ± 1.032
2.02ValGlu: 2.02 ± 1.144
1.347ValPhe: 1.347 ± 0.616
2.694ValGly: 2.694 ± 1.43
1.347ValHis: 1.347 ± 0.807
1.347ValIle: 1.347 ± 0.98
0.673ValLys: 0.673 ± 0.6
2.694ValLeu: 2.694 ± 1.458
1.347ValMet: 1.347 ± 0.81
2.02ValAsn: 2.02 ± 0.786
3.367ValPro: 3.367 ± 1.152
2.694ValGln: 2.694 ± 1.533
1.347ValArg: 1.347 ± 1.057
2.02ValSer: 2.02 ± 1.085
4.714ValThr: 4.714 ± 2.356
2.02ValVal: 2.02 ± 0.97
0.673ValTrp: 0.673 ± 0.817
1.347ValTyr: 1.347 ± 0.98
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
2.02TrpAsp: 2.02 ± 1.446
2.694TrpGlu: 2.694 ± 0.76
0.0TrpPhe: 0.0 ± 0.0
0.0TrpGly: 0.0 ± 0.0
0.673TrpHis: 0.673 ± 0.6
0.0TrpIle: 0.0 ± 0.0
0.0TrpLys: 0.0 ± 0.0
0.673TrpLeu: 0.673 ± 0.521
1.347TrpMet: 1.347 ± 0.841
2.02TrpAsn: 2.02 ± 1.791
2.02TrpPro: 2.02 ± 1.562
0.673TrpGln: 0.673 ± 0.6
0.673TrpArg: 0.673 ± 0.817
2.02TrpSer: 2.02 ± 1.1
0.673TrpThr: 0.673 ± 0.521
0.673TrpVal: 0.673 ± 0.6
1.347TrpTrp: 1.347 ± 0.594
1.347TrpTyr: 1.347 ± 0.594
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.347TyrAla: 1.347 ± 1.042
1.347TyrCys: 1.347 ± 1.042
2.02TyrAsp: 2.02 ± 0.792
1.347TyrGlu: 1.347 ± 0.841
4.04TyrPhe: 4.04 ± 1.941
2.694TyrGly: 2.694 ± 1.179
2.694TyrHis: 2.694 ± 1.55
2.02TyrIle: 2.02 ± 0.833
0.673TyrLys: 0.673 ± 0.661
2.02TyrLeu: 2.02 ± 1.801
0.673TyrMet: 0.673 ± 0.521
2.694TyrAsn: 2.694 ± 0.724
0.0TyrPro: 0.0 ± 0.0
0.0TyrGln: 0.0 ± 0.0
2.694TyrArg: 2.694 ± 1.499
1.347TyrSer: 1.347 ± 1.042
0.673TyrThr: 0.673 ± 0.521
1.347TyrVal: 1.347 ± 1.201
2.02TyrTrp: 2.02 ± 1.406
1.347TyrTyr: 1.347 ± 0.81
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (1486 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski