Amino acid dipepetide frequency for Apis mellifera associated microvirus 61

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.39AlaAla: 6.39 ± 3.328
0.0AlaCys: 0.0 ± 0.0
5.591AlaAsp: 5.591 ± 1.116
2.396AlaGlu: 2.396 ± 0.8
2.396AlaPhe: 2.396 ± 1.525
4.792AlaGly: 4.792 ± 3.444
3.195AlaHis: 3.195 ± 1.282
3.994AlaIle: 3.994 ± 2.668
3.195AlaLys: 3.195 ± 2.815
4.792AlaLeu: 4.792 ± 1.419
0.0AlaMet: 0.0 ± 0.0
3.195AlaAsn: 3.195 ± 1.43
4.792AlaPro: 4.792 ± 2.327
7.987AlaGln: 7.987 ± 2.943
1.597AlaArg: 1.597 ± 0.978
3.195AlaSer: 3.195 ± 1.868
3.195AlaThr: 3.195 ± 1.346
3.994AlaVal: 3.994 ± 1.224
0.0AlaTrp: 0.0 ± 0.0
2.396AlaTyr: 2.396 ± 1.327
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.799CysCys: 0.799 ± 0.744
1.597CysAsp: 1.597 ± 1.082
0.0CysGlu: 0.0 ± 0.0
0.0CysPhe: 0.0 ± 0.0
1.597CysGly: 1.597 ± 1.488
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
1.597CysLys: 1.597 ± 1.488
0.799CysLeu: 0.799 ± 0.508
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
0.799CysPro: 0.799 ± 0.744
0.0CysGln: 0.0 ± 0.0
0.799CysArg: 0.799 ± 0.508
0.0CysSer: 0.0 ± 0.0
0.799CysThr: 0.799 ± 0.744
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
5.591AspAla: 5.591 ± 1.657
0.0AspCys: 0.0 ± 0.0
5.591AspAsp: 5.591 ± 3.385
5.591AspGlu: 5.591 ± 1.544
6.39AspPhe: 6.39 ± 1.432
2.396AspGly: 2.396 ± 0.837
0.0AspHis: 0.0 ± 0.0
2.396AspIle: 2.396 ± 1.949
2.396AspLys: 2.396 ± 1.825
5.591AspLeu: 5.591 ± 1.657
0.0AspMet: 0.0 ± 0.0
3.195AspAsn: 3.195 ± 1.351
4.792AspPro: 4.792 ± 1.104
0.799AspGln: 0.799 ± 0.932
0.799AspArg: 0.799 ± 0.508
3.994AspSer: 3.994 ± 1.115
6.39AspThr: 6.39 ± 2.478
3.994AspVal: 3.994 ± 1.11
0.0AspTrp: 0.0 ± 0.0
4.792AspTyr: 4.792 ± 1.517
0.0AspXaa: 0.0 ± 0.0
Glu
2.396GluAla: 2.396 ± 0.837
0.799GluCys: 0.799 ± 0.744
3.195GluAsp: 3.195 ± 1.366
0.0GluGlu: 0.0 ± 0.0
3.195GluPhe: 3.195 ± 1.245
3.195GluGly: 3.195 ± 1.6
2.396GluHis: 2.396 ± 0.837
1.597GluIle: 1.597 ± 0.675
3.994GluLys: 3.994 ± 2.115
5.591GluLeu: 5.591 ± 1.37
0.0GluMet: 0.0 ± 0.0
2.396GluAsn: 2.396 ± 0.837
4.792GluPro: 4.792 ± 2.935
5.591GluGln: 5.591 ± 1.925
1.597GluArg: 1.597 ± 0.675
3.195GluSer: 3.195 ± 0.922
2.396GluThr: 2.396 ± 1.722
3.994GluVal: 3.994 ± 1.488
1.597GluTrp: 1.597 ± 0.675
1.597GluTyr: 1.597 ± 0.675
0.0GluXaa: 0.0 ± 0.0
Phe
3.195PheAla: 3.195 ± 1.282
0.0PheCys: 0.0 ± 0.0
7.188PheAsp: 7.188 ± 5.07
0.799PheGlu: 0.799 ± 0.932
1.597PhePhe: 1.597 ± 0.94
6.39PheGly: 6.39 ± 1.849
0.0PheHis: 0.0 ± 0.0
3.195PheIle: 3.195 ± 2.043
3.195PheLys: 3.195 ± 1.245
2.396PheLeu: 2.396 ± 1.525
0.799PheMet: 0.799 ± 0.744
3.195PheAsn: 3.195 ± 1.346
0.799PhePro: 0.799 ± 0.508
1.597PheGln: 1.597 ± 0.862
2.396PheArg: 2.396 ± 1.525
0.0PheSer: 0.0 ± 0.0
3.195PheThr: 3.195 ± 1.063
2.396PheVal: 2.396 ± 1.056
0.799PheTrp: 0.799 ± 0.508
1.597PheTyr: 1.597 ± 1.017
0.0PheXaa: 0.0 ± 0.0
Gly
4.792GlyAla: 4.792 ± 4.555
0.799GlyCys: 0.799 ± 0.508
4.792GlyAsp: 4.792 ± 1.543
3.195GlyGlu: 3.195 ± 1.473
3.195GlyPhe: 3.195 ± 1.757
3.195GlyGly: 3.195 ± 0.752
0.799GlyHis: 0.799 ± 0.744
4.792GlyIle: 4.792 ± 1.545
3.195GlyLys: 3.195 ± 1.473
7.987GlyLeu: 7.987 ± 2.118
2.396GlyMet: 2.396 ± 1.059
3.994GlyAsn: 3.994 ± 2.541
1.597GlyPro: 1.597 ± 1.017
3.195GlyGln: 3.195 ± 2.033
0.799GlyArg: 0.799 ± 0.744
9.585GlySer: 9.585 ± 1.722
3.195GlyThr: 3.195 ± 1.43
5.591GlyVal: 5.591 ± 2.348
0.799GlyTrp: 0.799 ± 0.744
3.195GlyTyr: 3.195 ± 1.282
0.0GlyXaa: 0.0 ± 0.0
His
1.597HisAla: 1.597 ± 1.058
0.0HisCys: 0.0 ± 0.0
0.0HisAsp: 0.0 ± 0.0
0.799HisGlu: 0.799 ± 0.744
0.0HisPhe: 0.0 ± 0.0
1.597HisGly: 1.597 ± 1.017
0.0HisHis: 0.0 ± 0.0
1.597HisIle: 1.597 ± 1.828
0.799HisLys: 0.799 ± 1.035
2.396HisLeu: 2.396 ± 0.986
0.799HisMet: 0.799 ± 0.744
0.0HisAsn: 0.0 ± 0.0
0.0HisPro: 0.0 ± 0.0
1.597HisGln: 1.597 ± 1.058
0.799HisArg: 0.799 ± 1.035
0.0HisSer: 0.0 ± 0.0
0.0HisThr: 0.0 ± 0.0
1.597HisVal: 1.597 ± 1.017
0.0HisTrp: 0.0 ± 0.0
2.396HisTyr: 2.396 ± 1.327
0.0HisXaa: 0.0 ± 0.0
Ile
4.792IleAla: 4.792 ± 2.543
0.0IleCys: 0.0 ± 0.0
0.0IleAsp: 0.0 ± 0.0
3.195IleGlu: 3.195 ± 1.245
1.597IlePhe: 1.597 ± 0.675
3.994IleGly: 3.994 ± 1.11
0.0IleHis: 0.0 ± 0.0
1.597IleIle: 1.597 ± 1.058
6.39IleLys: 6.39 ± 2.587
3.994IleLeu: 3.994 ± 1.11
1.597IleMet: 1.597 ± 1.272
3.994IleAsn: 3.994 ± 1.296
2.396IlePro: 2.396 ± 1.166
1.597IleGln: 1.597 ± 1.272
3.195IleArg: 3.195 ± 0.865
5.591IleSer: 5.591 ± 2.297
2.396IleThr: 2.396 ± 1.327
1.597IleVal: 1.597 ± 1.395
0.799IleTrp: 0.799 ± 0.508
2.396IleTyr: 2.396 ± 0.936
0.0IleXaa: 0.0 ± 0.0
Lys
3.195LysAla: 3.195 ± 2.075
0.799LysCys: 0.799 ± 0.744
6.39LysAsp: 6.39 ± 4.305
7.987LysGlu: 7.987 ± 3.92
3.994LysPhe: 3.994 ± 1.145
4.792LysGly: 4.792 ± 1.978
0.799LysHis: 0.799 ± 1.035
2.396LysIle: 2.396 ± 1.975
16.773LysLys: 16.773 ± 8.061
5.591LysLeu: 5.591 ± 3.361
0.799LysMet: 0.799 ± 0.773
6.39LysAsn: 6.39 ± 4.601
3.994LysPro: 3.994 ± 2.41
0.799LysGln: 0.799 ± 0.932
4.792LysArg: 4.792 ± 2.026
2.396LysSer: 2.396 ± 1.542
3.994LysThr: 3.994 ± 2.652
0.799LysVal: 0.799 ± 1.035
0.0LysTrp: 0.0 ± 0.0
3.994LysTyr: 3.994 ± 2.799
0.0LysXaa: 0.0 ± 0.0
Leu
4.792LeuAla: 4.792 ± 2.288
0.0LeuCys: 0.0 ± 0.0
5.591LeuAsp: 5.591 ± 2.852
3.994LeuGlu: 3.994 ± 1.11
3.195LeuPhe: 3.195 ± 1.473
3.994LeuGly: 3.994 ± 1.697
0.0LeuHis: 0.0 ± 0.0
4.792LeuIle: 4.792 ± 1.419
5.591LeuLys: 5.591 ± 1.925
2.396LeuLeu: 2.396 ± 1.066
2.396LeuMet: 2.396 ± 0.855
4.792LeuAsn: 4.792 ± 1.601
7.987LeuPro: 7.987 ± 2.44
3.994LeuGln: 3.994 ± 1.296
7.188LeuArg: 7.188 ± 0.622
4.792LeuSer: 4.792 ± 1.517
5.591LeuThr: 5.591 ± 1.642
2.396LeuVal: 2.396 ± 1.525
1.597LeuTrp: 1.597 ± 1.488
3.994LeuTyr: 3.994 ± 1.77
0.0LeuXaa: 0.0 ± 0.0
Met
0.799MetAla: 0.799 ± 1.035
1.597MetCys: 1.597 ± 1.488
2.396MetAsp: 2.396 ± 1.066
0.799MetGlu: 0.799 ± 0.932
0.799MetPhe: 0.799 ± 0.508
0.799MetGly: 0.799 ± 0.932
0.799MetHis: 0.799 ± 0.508
0.799MetIle: 0.799 ± 0.508
0.799MetLys: 0.799 ± 0.744
2.396MetLeu: 2.396 ± 1.166
0.0MetMet: 0.0 ± 0.0
0.799MetAsn: 0.799 ± 0.508
0.799MetPro: 0.799 ± 0.508
1.597MetGln: 1.597 ± 1.828
1.597MetArg: 1.597 ± 1.058
2.396MetSer: 2.396 ± 0.8
0.0MetThr: 0.0 ± 0.0
0.0MetVal: 0.0 ± 0.0
0.0MetTrp: 0.0 ± 0.0
0.799MetTyr: 0.799 ± 0.508
0.0MetXaa: 0.0 ± 0.0
Asn
2.396AsnAla: 2.396 ± 1.059
0.799AsnCys: 0.799 ± 0.744
1.597AsnAsp: 1.597 ± 1.082
3.195AsnGlu: 3.195 ± 1.956
0.799AsnPhe: 0.799 ± 0.508
3.994AsnGly: 3.994 ± 1.97
0.799AsnHis: 0.799 ± 0.508
3.195AsnIle: 3.195 ± 0.865
3.195AsnLys: 3.195 ± 0.823
4.792AsnLeu: 4.792 ± 2.288
1.597AsnMet: 1.597 ± 0.675
2.396AsnAsn: 2.396 ± 0.837
8.786AsnPro: 8.786 ± 2.866
3.195AsnGln: 3.195 ± 2.631
1.597AsnArg: 1.597 ± 1.017
5.591AsnSer: 5.591 ± 1.475
3.195AsnThr: 3.195 ± 1.069
1.597AsnVal: 1.597 ± 1.017
0.799AsnTrp: 0.799 ± 0.508
0.799AsnTyr: 0.799 ± 0.508
0.0AsnXaa: 0.0 ± 0.0
Pro
3.195ProAla: 3.195 ± 1.956
0.799ProCys: 0.799 ± 0.744
5.591ProAsp: 5.591 ± 1.37
3.994ProGlu: 3.994 ± 1.3
1.597ProPhe: 1.597 ± 1.395
3.994ProGly: 3.994 ± 1.551
2.396ProHis: 2.396 ± 1.575
4.792ProIle: 4.792 ± 1.234
5.591ProLys: 5.591 ± 2.4
6.39ProLeu: 6.39 ± 2.069
2.396ProMet: 2.396 ± 1.124
0.799ProAsn: 0.799 ± 0.508
3.195ProPro: 3.195 ± 2.972
5.591ProGln: 5.591 ± 1.05
1.597ProArg: 1.597 ± 0.978
3.195ProSer: 3.195 ± 0.865
1.597ProThr: 1.597 ± 1.017
3.994ProVal: 3.994 ± 2.541
0.799ProTrp: 0.799 ± 0.508
0.0ProTyr: 0.0 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
1.597GlnAla: 1.597 ± 0.862
0.0GlnCys: 0.0 ± 0.0
1.597GlnAsp: 1.597 ± 1.058
2.396GlnGlu: 2.396 ± 1.166
0.799GlnPhe: 0.799 ± 0.508
7.188GlnGly: 7.188 ± 2.45
0.799GlnHis: 0.799 ± 1.08
6.39GlnIle: 6.39 ± 1.954
6.39GlnLys: 6.39 ± 2.534
0.0GlnLeu: 0.0 ± 0.0
0.799GlnMet: 0.799 ± 0.932
4.792GlnAsn: 4.792 ± 1.387
2.396GlnPro: 2.396 ± 1.542
4.792GlnGln: 4.792 ± 2.587
1.597GlnArg: 1.597 ± 0.862
4.792GlnSer: 4.792 ± 2.587
3.994GlnThr: 3.994 ± 0.751
2.396GlnVal: 2.396 ± 0.837
0.799GlnTrp: 0.799 ± 1.08
0.0GlnTyr: 0.0 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
2.396ArgAla: 2.396 ± 1.066
0.0ArgCys: 0.0 ± 0.0
3.195ArgAsp: 3.195 ± 0.865
2.396ArgGlu: 2.396 ± 1.575
3.994ArgPhe: 3.994 ± 1.11
3.195ArgGly: 3.195 ± 1.757
0.799ArgHis: 0.799 ± 1.08
1.597ArgIle: 1.597 ± 0.978
4.792ArgLys: 4.792 ± 1.907
3.195ArgLeu: 3.195 ± 1.346
0.799ArgMet: 0.799 ± 0.508
2.396ArgAsn: 2.396 ± 1.327
3.195ArgPro: 3.195 ± 1.351
2.396ArgGln: 2.396 ± 0.986
1.597ArgArg: 1.597 ± 1.017
3.994ArgSer: 3.994 ± 2.541
0.0ArgThr: 0.0 ± 0.0
2.396ArgVal: 2.396 ± 0.936
0.799ArgTrp: 0.799 ± 0.508
2.396ArgTyr: 2.396 ± 1.166
0.0ArgXaa: 0.0 ± 0.0
Ser
4.792SerAla: 4.792 ± 2.132
1.597SerCys: 1.597 ± 0.978
3.195SerAsp: 3.195 ± 2.105
2.396SerGlu: 2.396 ± 1.059
0.799SerPhe: 0.799 ± 0.508
5.591SerGly: 5.591 ± 1.501
0.799SerHis: 0.799 ± 1.035
3.994SerIle: 3.994 ± 1.551
1.597SerLys: 1.597 ± 1.058
3.994SerLeu: 3.994 ± 1.662
1.597SerMet: 1.597 ± 1.017
7.188SerAsn: 7.188 ± 1.936
3.195SerPro: 3.195 ± 0.823
2.396SerGln: 2.396 ± 0.837
3.994SerArg: 3.994 ± 2.773
5.591SerSer: 5.591 ± 3.409
9.585SerThr: 9.585 ± 1.929
6.39SerVal: 6.39 ± 2.467
0.799SerTrp: 0.799 ± 0.744
0.799SerTyr: 0.799 ± 0.508
0.0SerXaa: 0.0 ± 0.0
Thr
5.591ThrAla: 5.591 ± 3.24
0.0ThrCys: 0.0 ± 0.0
1.597ThrAsp: 1.597 ± 0.675
3.994ThrGlu: 3.994 ± 2.091
2.396ThrPhe: 2.396 ± 0.936
5.591ThrGly: 5.591 ± 2.686
0.799ThrHis: 0.799 ± 0.508
1.597ThrIle: 1.597 ± 1.017
7.987ThrLys: 7.987 ± 5.293
7.188ThrLeu: 7.188 ± 1.747
0.0ThrMet: 0.0 ± 0.0
1.597ThrAsn: 1.597 ± 1.017
0.799ThrPro: 0.799 ± 0.508
0.799ThrGln: 0.799 ± 0.932
4.792ThrArg: 4.792 ± 1.872
5.591ThrSer: 5.591 ± 1.542
4.792ThrThr: 4.792 ± 2.935
3.994ThrVal: 3.994 ± 1.3
0.0ThrTrp: 0.0 ± 0.0
2.396ThrTyr: 2.396 ± 1.327
0.0ThrXaa: 0.0 ± 0.0
Val
4.792ValAla: 4.792 ± 1.545
0.0ValCys: 0.0 ± 0.0
0.799ValAsp: 0.799 ± 0.508
3.195ValGlu: 3.195 ± 1.069
3.994ValPhe: 3.994 ± 2.81
1.597ValGly: 1.597 ± 1.395
0.0ValHis: 0.0 ± 0.0
2.396ValIle: 2.396 ± 1.066
2.396ValLys: 2.396 ± 1.575
4.792ValLeu: 4.792 ± 2.288
1.597ValMet: 1.597 ± 0.802
2.396ValAsn: 2.396 ± 0.8
6.39ValPro: 6.39 ± 4.066
2.396ValGln: 2.396 ± 0.936
3.994ValArg: 3.994 ± 1.934
2.396ValSer: 2.396 ± 1.638
2.396ValThr: 2.396 ± 0.936
0.0ValVal: 0.0 ± 0.0
1.597ValTrp: 1.597 ± 1.017
1.597ValTyr: 1.597 ± 0.675
0.0ValXaa: 0.0 ± 0.0
Trp
0.799TrpAla: 0.799 ± 0.744
0.0TrpCys: 0.0 ± 0.0
2.396TrpAsp: 2.396 ± 0.986
1.597TrpGlu: 1.597 ± 0.675
2.396TrpPhe: 2.396 ± 1.525
0.0TrpGly: 0.0 ± 0.0
0.799TrpHis: 0.799 ± 0.508
0.0TrpIle: 0.0 ± 0.0
0.799TrpLys: 0.799 ± 0.744
0.799TrpLeu: 0.799 ± 0.744
0.0TrpMet: 0.0 ± 0.0
0.799TrpAsn: 0.799 ± 0.508
0.799TrpPro: 0.799 ± 0.508
0.0TrpGln: 0.0 ± 0.0
0.0TrpArg: 0.0 ± 0.0
0.0TrpSer: 0.0 ± 0.0
0.0TrpThr: 0.0 ± 0.0
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.994TyrAla: 3.994 ± 1.145
0.799TyrCys: 0.799 ± 0.744
2.396TyrAsp: 2.396 ± 1.525
1.597TyrGlu: 1.597 ± 0.675
1.597TyrPhe: 1.597 ± 1.017
3.195TyrGly: 3.195 ± 1.523
0.799TyrHis: 0.799 ± 0.744
0.0TyrIle: 0.0 ± 0.0
0.799TyrLys: 0.799 ± 1.035
3.994TyrLeu: 3.994 ± 1.97
1.597TyrMet: 1.597 ± 1.058
0.0TyrAsn: 0.0 ± 0.0
0.0TyrPro: 0.0 ± 0.0
3.195TyrGln: 3.195 ± 1.346
0.799TyrArg: 0.799 ± 0.508
3.994TyrSer: 3.994 ± 0.751
4.792TyrThr: 4.792 ± 1.872
1.597TyrVal: 1.597 ± 0.675
0.0TyrTrp: 0.0 ± 0.0
1.597TyrTyr: 1.597 ± 1.488
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (1253 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski