Amino acid dipepetide frequency for Microvirus sp.

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.726AlaAla: 5.726 ± 3.426
0.521AlaCys: 0.521 ± 0.711
3.644AlaAsp: 3.644 ± 1.065
3.123AlaGlu: 3.123 ± 1.092
1.562AlaPhe: 1.562 ± 0.761
3.123AlaGly: 3.123 ± 3.034
1.041AlaHis: 1.041 ± 0.724
3.123AlaIle: 3.123 ± 1.193
4.685AlaLys: 4.685 ± 1.64
6.247AlaLeu: 6.247 ± 1.22
0.521AlaMet: 0.521 ± 0.506
6.767AlaAsn: 6.767 ± 1.697
4.164AlaPro: 4.164 ± 2.129
5.206AlaGln: 5.206 ± 3.128
1.562AlaArg: 1.562 ± 0.883
7.288AlaSer: 7.288 ± 2.288
3.644AlaThr: 3.644 ± 1.144
2.603AlaVal: 2.603 ± 0.671
0.0AlaTrp: 0.0 ± 0.0
2.082AlaTyr: 2.082 ± 0.44
0.0AlaXaa: 0.0 ± 0.0
Cys
1.041CysAla: 1.041 ± 0.743
0.521CysCys: 0.521 ± 0.487
0.521CysAsp: 0.521 ± 0.362
2.603CysGlu: 2.603 ± 1.369
0.521CysPhe: 0.521 ± 0.362
0.0CysGly: 0.0 ± 0.0
0.0CysHis: 0.0 ± 0.0
0.521CysIle: 0.521 ± 0.497
1.041CysLys: 1.041 ± 0.879
2.603CysLeu: 2.603 ± 1.34
0.0CysMet: 0.0 ± 0.0
1.041CysAsn: 1.041 ± 0.724
0.0CysPro: 0.0 ± 0.0
0.521CysGln: 0.521 ± 0.487
1.562CysArg: 1.562 ± 0.82
1.041CysSer: 1.041 ± 0.743
1.562CysThr: 1.562 ± 0.57
1.041CysVal: 1.041 ± 0.879
1.041CysTrp: 1.041 ± 0.405
2.082CysTyr: 2.082 ± 0.876
0.0CysXaa: 0.0 ± 0.0
Asp
4.164AspAla: 4.164 ± 1.026
1.041AspCys: 1.041 ± 0.685
3.123AspAsp: 3.123 ± 2.217
1.041AspGlu: 1.041 ± 0.974
5.206AspPhe: 5.206 ± 1.907
1.562AspGly: 1.562 ± 0.761
0.521AspHis: 0.521 ± 0.487
3.644AspIle: 3.644 ± 1.183
4.164AspLys: 4.164 ± 1.745
4.685AspLeu: 4.685 ± 1.094
2.082AspMet: 2.082 ± 1.447
1.562AspAsn: 1.562 ± 0.761
2.082AspPro: 2.082 ± 0.632
0.521AspGln: 0.521 ± 0.506
3.123AspArg: 3.123 ± 0.655
7.808AspSer: 7.808 ± 1.82
3.123AspThr: 3.123 ± 1.029
3.123AspVal: 3.123 ± 1.092
1.041AspTrp: 1.041 ± 1.011
3.123AspTyr: 3.123 ± 1.075
0.0AspXaa: 0.0 ± 0.0
Glu
3.644GluAla: 3.644 ± 2.055
1.041GluCys: 1.041 ± 0.974
1.562GluAsp: 1.562 ± 1.035
3.123GluGlu: 3.123 ± 1.76
2.603GluPhe: 2.603 ± 0.797
0.521GluGly: 0.521 ± 0.362
0.521GluHis: 0.521 ± 0.487
3.644GluIle: 3.644 ± 2.136
1.562GluLys: 1.562 ± 0.716
4.685GluLeu: 4.685 ± 1.166
1.562GluMet: 1.562 ± 0.707
3.123GluAsn: 3.123 ± 0.649
0.521GluPro: 0.521 ± 0.487
3.123GluGln: 3.123 ± 1.824
1.562GluArg: 1.562 ± 0.881
2.603GluSer: 2.603 ± 0.532
1.041GluThr: 1.041 ± 0.535
1.041GluVal: 1.041 ± 0.634
0.0GluTrp: 0.0 ± 0.0
2.082GluTyr: 2.082 ± 1.949
0.0GluXaa: 0.0 ± 0.0
Phe
3.644PheAla: 3.644 ± 1.05
1.562PheCys: 1.562 ± 0.672
5.206PheAsp: 5.206 ± 0.946
4.685PheGlu: 4.685 ± 1.894
3.644PhePhe: 3.644 ± 2.136
3.123PheGly: 3.123 ± 0.92
0.521PheHis: 0.521 ± 0.711
3.644PheIle: 3.644 ± 1.101
1.562PheLys: 1.562 ± 0.778
5.206PheLeu: 5.206 ± 1.469
0.521PheMet: 0.521 ± 0.585
4.164PheAsn: 4.164 ± 0.558
1.562PhePro: 1.562 ± 1.462
0.521PheGln: 0.521 ± 0.506
3.123PheArg: 3.123 ± 1.68
2.603PheSer: 2.603 ± 1.018
4.164PheThr: 4.164 ± 1.266
3.644PheVal: 3.644 ± 1.477
0.0PheTrp: 0.0 ± 0.0
1.562PheTyr: 1.562 ± 0.566
0.0PheXaa: 0.0 ± 0.0
Gly
2.082GlyAla: 2.082 ± 1.065
0.521GlyCys: 0.521 ± 0.362
1.562GlyAsp: 1.562 ± 0.594
2.603GlyGlu: 2.603 ± 1.234
1.041GlyPhe: 1.041 ± 0.405
2.603GlyGly: 2.603 ± 1.265
0.521GlyHis: 0.521 ± 0.362
2.603GlyIle: 2.603 ± 1.265
2.603GlyLys: 2.603 ± 1.099
5.726GlyLeu: 5.726 ± 2.26
1.562GlyMet: 1.562 ± 0.761
1.562GlyAsn: 1.562 ± 1.086
0.0GlyPro: 0.0 ± 0.0
2.082GlyGln: 2.082 ± 0.81
0.0GlyArg: 0.0 ± 0.0
8.329GlySer: 8.329 ± 2.561
1.562GlyThr: 1.562 ± 0.325
2.082GlyVal: 2.082 ± 0.769
0.521GlyTrp: 0.521 ± 0.487
3.644GlyTyr: 3.644 ± 0.914
0.0GlyXaa: 0.0 ± 0.0
His
1.562HisAla: 1.562 ± 1.086
0.521HisCys: 0.521 ± 0.711
2.082HisAsp: 2.082 ± 1.286
0.0HisGlu: 0.0 ± 0.0
2.082HisPhe: 2.082 ± 1.022
0.521HisGly: 0.521 ± 0.362
0.521HisHis: 0.521 ± 0.585
1.562HisIle: 1.562 ± 0.57
1.562HisLys: 1.562 ± 0.82
2.082HisLeu: 2.082 ± 0.985
0.0HisMet: 0.0 ± 0.0
1.562HisAsn: 1.562 ± 0.594
1.562HisPro: 1.562 ± 0.82
0.0HisGln: 0.0 ± 0.0
1.562HisArg: 1.562 ± 0.82
2.082HisSer: 2.082 ± 1.466
1.041HisThr: 1.041 ± 0.634
1.562HisVal: 1.562 ± 0.988
0.521HisTrp: 0.521 ± 0.487
1.562HisTyr: 1.562 ± 1.462
0.0HisXaa: 0.0 ± 0.0
Ile
5.726IleAla: 5.726 ± 1.886
1.562IleCys: 1.562 ± 0.891
1.562IleAsp: 1.562 ± 1.086
1.041IleGlu: 1.041 ± 1.011
3.123IlePhe: 3.123 ± 0.529
5.726IleGly: 5.726 ± 1.26
1.562IleHis: 1.562 ± 0.566
2.082IleIle: 2.082 ± 1.251
4.164IleLys: 4.164 ± 1.15
3.123IleLeu: 3.123 ± 2.219
2.082IleMet: 2.082 ± 1.214
3.123IleAsn: 3.123 ± 0.655
3.123IlePro: 3.123 ± 1.49
1.562IleGln: 1.562 ± 0.82
1.041IleArg: 1.041 ± 0.658
5.726IleSer: 5.726 ± 2.066
1.562IleThr: 1.562 ± 0.843
2.082IleVal: 2.082 ± 0.85
0.521IleTrp: 0.521 ± 0.487
2.603IleTyr: 2.603 ± 1.263
0.0IleXaa: 0.0 ± 0.0
Lys
2.082LysAla: 2.082 ± 1.107
1.041LysCys: 1.041 ± 0.974
2.082LysAsp: 2.082 ± 1.267
2.082LysGlu: 2.082 ± 0.517
3.644LysPhe: 3.644 ± 1.059
2.082LysGly: 2.082 ± 1.377
1.562LysHis: 1.562 ± 1.035
3.644LysIle: 3.644 ± 1.955
2.082LysLys: 2.082 ± 1.529
6.767LysLeu: 6.767 ± 2.46
1.562LysMet: 1.562 ± 1.274
4.164LysAsn: 4.164 ± 0.819
2.082LysPro: 2.082 ± 0.769
2.082LysGln: 2.082 ± 0.827
3.123LysArg: 3.123 ± 1.68
4.685LysSer: 4.685 ± 0.904
2.603LysThr: 2.603 ± 1.047
4.685LysVal: 4.685 ± 1.949
0.521LysTrp: 0.521 ± 0.497
4.164LysTyr: 4.164 ± 1.274
0.0LysXaa: 0.0 ± 0.0
Leu
5.726LeuAla: 5.726 ± 1.028
3.123LeuCys: 3.123 ± 1.572
5.726LeuAsp: 5.726 ± 1.897
2.603LeuGlu: 2.603 ± 1.788
4.164LeuPhe: 4.164 ± 0.818
3.123LeuGly: 3.123 ± 0.749
5.726LeuHis: 5.726 ± 1.517
4.685LeuIle: 4.685 ± 2.863
8.329LeuLys: 8.329 ± 2.464
15.617LeuLeu: 15.617 ± 5.806
2.082LeuMet: 2.082 ± 1.453
5.726LeuAsn: 5.726 ± 1.453
3.644LeuPro: 3.644 ± 0.998
5.726LeuGln: 5.726 ± 1.503
7.808LeuArg: 7.808 ± 1.788
12.493LeuSer: 12.493 ± 1.357
1.562LeuThr: 1.562 ± 0.883
3.644LeuVal: 3.644 ± 2.461
2.082LeuTrp: 2.082 ± 0.906
4.685LeuTyr: 4.685 ± 1.53
0.0LeuXaa: 0.0 ± 0.0
Met
3.123MetAla: 3.123 ± 1.731
0.521MetCys: 0.521 ± 0.362
0.521MetAsp: 0.521 ± 0.362
0.521MetGlu: 0.521 ± 0.585
1.562MetPhe: 1.562 ± 1.073
0.521MetGly: 0.521 ± 0.362
1.041MetHis: 1.041 ± 0.535
0.521MetIle: 0.521 ± 0.362
0.521MetLys: 0.521 ± 0.585
5.726MetLeu: 5.726 ± 2.832
0.0MetMet: 0.0 ± 0.0
0.521MetAsn: 0.521 ± 0.506
1.041MetPro: 1.041 ± 0.634
1.562MetGln: 1.562 ± 0.976
0.521MetArg: 0.521 ± 0.362
2.603MetSer: 2.603 ± 0.851
0.521MetThr: 0.521 ± 0.362
1.562MetVal: 1.562 ± 0.976
0.0MetTrp: 0.0 ± 0.0
0.521MetTyr: 0.521 ± 0.497
0.0MetXaa: 0.0 ± 0.0
Asn
5.206AsnAla: 5.206 ± 1.911
0.0AsnCys: 0.0 ± 0.0
4.164AsnAsp: 4.164 ± 0.996
2.603AsnGlu: 2.603 ± 0.671
2.603AsnPhe: 2.603 ± 0.95
3.644AsnGly: 3.644 ± 2.113
0.0AsnHis: 0.0 ± 0.0
3.123AsnIle: 3.123 ± 1.131
3.123AsnLys: 3.123 ± 1.234
8.329AsnLeu: 8.329 ± 1.656
3.123AsnMet: 3.123 ± 1.152
3.644AsnAsn: 3.644 ± 1.678
2.603AsnPro: 2.603 ± 0.738
3.123AsnGln: 3.123 ± 1.188
0.521AsnArg: 0.521 ± 0.362
3.644AsnSer: 3.644 ± 0.684
2.082AsnThr: 2.082 ± 1.022
4.685AsnVal: 4.685 ± 0.992
1.041AsnTrp: 1.041 ± 0.405
2.082AsnTyr: 2.082 ± 1.46
0.0AsnXaa: 0.0 ± 0.0
Pro
1.041ProAla: 1.041 ± 0.405
1.562ProCys: 1.562 ± 0.672
4.685ProAsp: 4.685 ± 1.092
1.562ProGlu: 1.562 ± 0.82
2.082ProPhe: 2.082 ± 0.896
1.041ProGly: 1.041 ± 0.535
1.562ProHis: 1.562 ± 1.462
1.562ProIle: 1.562 ± 0.843
1.041ProLys: 1.041 ± 0.528
4.164ProLeu: 4.164 ± 0.834
0.0ProMet: 0.0 ± 0.0
1.041ProAsn: 1.041 ± 0.781
1.041ProPro: 1.041 ± 0.939
1.562ProGln: 1.562 ± 0.761
2.082ProArg: 2.082 ± 0.747
0.521ProSer: 0.521 ± 0.487
5.206ProThr: 5.206 ± 0.752
2.082ProVal: 2.082 ± 0.896
0.521ProTrp: 0.521 ± 0.497
4.164ProTyr: 4.164 ± 0.93
0.0ProXaa: 0.0 ± 0.0
Gln
1.562GlnAla: 1.562 ± 0.976
1.562GlnCys: 1.562 ± 0.594
3.644GlnAsp: 3.644 ± 2.28
2.603GlnGlu: 2.603 ± 1.955
1.562GlnPhe: 1.562 ± 0.747
1.041GlnGly: 1.041 ± 0.535
1.041GlnHis: 1.041 ± 0.634
3.123GlnIle: 3.123 ± 1.188
3.644GlnLys: 3.644 ± 1.232
2.603GlnLeu: 2.603 ± 0.739
2.082GlnMet: 2.082 ± 0.795
1.562GlnAsn: 1.562 ± 0.976
1.041GlnPro: 1.041 ± 0.781
2.082GlnGln: 2.082 ± 1.07
3.644GlnArg: 3.644 ± 1.575
3.123GlnSer: 3.123 ± 1.252
1.562GlnThr: 1.562 ± 0.325
3.123GlnVal: 3.123 ± 0.901
2.082GlnTrp: 2.082 ± 0.517
2.603GlnTyr: 2.603 ± 0.739
0.0GlnXaa: 0.0 ± 0.0
Arg
1.041ArgAla: 1.041 ± 0.528
1.562ArgCys: 1.562 ± 0.566
3.123ArgAsp: 3.123 ± 0.512
1.562ArgGlu: 1.562 ± 0.325
5.726ArgPhe: 5.726 ± 3.284
1.562ArgGly: 1.562 ± 0.881
1.041ArgHis: 1.041 ± 0.405
2.082ArgIle: 2.082 ± 1.377
4.164ArgLys: 4.164 ± 1.275
5.726ArgLeu: 5.726 ± 1.241
1.562ArgMet: 1.562 ± 0.986
2.082ArgAsn: 2.082 ± 1.085
2.082ArgPro: 2.082 ± 1.286
2.603ArgGln: 2.603 ± 0.532
2.082ArgArg: 2.082 ± 1.083
2.603ArgSer: 2.603 ± 0.968
4.164ArgThr: 4.164 ± 0.652
1.562ArgVal: 1.562 ± 1.462
1.041ArgTrp: 1.041 ± 0.781
3.123ArgTyr: 3.123 ± 0.899
0.0ArgXaa: 0.0 ± 0.0
Ser
7.808SerAla: 7.808 ± 1.504
1.041SerCys: 1.041 ± 0.69
7.288SerAsp: 7.288 ± 2.564
3.644SerGlu: 3.644 ± 0.878
4.164SerPhe: 4.164 ± 2.404
3.644SerGly: 3.644 ± 0.915
3.123SerHis: 3.123 ± 1.188
5.206SerIle: 5.206 ± 0.846
3.644SerLys: 3.644 ± 0.908
7.808SerLeu: 7.808 ± 1.356
2.603SerMet: 2.603 ± 0.891
6.767SerAsn: 6.767 ± 1.864
4.685SerPro: 4.685 ± 1.669
5.206SerGln: 5.206 ± 2.155
8.329SerArg: 8.329 ± 3.047
11.452SerSer: 11.452 ± 2.992
3.644SerThr: 3.644 ± 0.878
4.164SerVal: 4.164 ± 1.257
1.041SerTrp: 1.041 ± 0.405
2.603SerTyr: 2.603 ± 0.738
0.0SerXaa: 0.0 ± 0.0
Thr
5.206ThrAla: 5.206 ± 1.588
0.0ThrCys: 0.0 ± 0.0
2.082ThrAsp: 2.082 ± 0.85
1.562ThrGlu: 1.562 ± 0.85
2.082ThrPhe: 2.082 ± 0.793
3.123ThrGly: 3.123 ± 1.075
1.562ThrHis: 1.562 ± 0.594
3.644ThrIle: 3.644 ± 0.786
2.603ThrLys: 2.603 ± 1.29
3.644ThrLeu: 3.644 ± 1.072
1.041ThrMet: 1.041 ± 0.528
3.123ThrAsn: 3.123 ± 0.804
3.123ThrPro: 3.123 ± 0.655
1.562ThrGln: 1.562 ± 1.519
1.562ThrArg: 1.562 ± 1.086
5.206ThrSer: 5.206 ± 0.766
1.041ThrThr: 1.041 ± 0.535
0.0ThrVal: 0.0 ± 0.0
0.0ThrTrp: 0.0 ± 0.0
2.603ThrTyr: 2.603 ± 0.905
0.0ThrXaa: 0.0 ± 0.0
Val
0.0ValAla: 0.0 ± 0.0
1.041ValCys: 1.041 ± 0.543
3.123ValAsp: 3.123 ± 0.649
2.082ValGlu: 2.082 ± 0.896
1.041ValPhe: 1.041 ± 0.405
4.164ValGly: 4.164 ± 1.849
1.041ValHis: 1.041 ± 0.405
2.082ValIle: 2.082 ± 0.81
3.123ValLys: 3.123 ± 1.738
5.726ValLeu: 5.726 ± 2.508
0.521ValMet: 0.521 ± 0.497
3.123ValAsn: 3.123 ± 1.578
2.603ValPro: 2.603 ± 0.691
3.123ValGln: 3.123 ± 0.892
2.603ValArg: 2.603 ± 1.388
8.329ValSer: 8.329 ± 3.174
2.603ValThr: 2.603 ± 0.857
4.164ValVal: 4.164 ± 2.471
0.521ValTrp: 0.521 ± 0.497
0.521ValTyr: 0.521 ± 0.711
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
0.521TrpGlu: 0.521 ± 0.487
1.041TrpPhe: 1.041 ± 0.69
0.0TrpGly: 0.0 ± 0.0
0.521TrpHis: 0.521 ± 0.487
0.0TrpIle: 0.0 ± 0.0
1.041TrpLys: 1.041 ± 0.405
3.644TrpLeu: 3.644 ± 1.921
0.0TrpMet: 0.0 ± 0.0
1.041TrpAsn: 1.041 ± 0.535
0.521TrpPro: 0.521 ± 0.711
0.521TrpGln: 0.521 ± 0.362
1.562TrpArg: 1.562 ± 0.325
0.521TrpSer: 0.521 ± 0.487
0.521TrpThr: 0.521 ± 0.362
1.041TrpVal: 1.041 ± 0.634
0.0TrpTrp: 0.0 ± 0.0
0.521TrpTyr: 0.521 ± 0.487
0.0TrpXaa: 0.0 ± 0.0
Tyr
6.247TyrAla: 6.247 ± 1.861
0.521TyrCys: 0.521 ± 0.487
1.041TyrAsp: 1.041 ± 0.405
0.0TyrGlu: 0.0 ± 0.0
5.206TyrPhe: 5.206 ± 1.839
2.082TyrGly: 2.082 ± 0.896
0.521TyrHis: 0.521 ± 0.487
2.603TyrIle: 2.603 ± 0.797
2.603TyrLys: 2.603 ± 0.532
4.164TyrLeu: 4.164 ± 1.442
0.0TyrMet: 0.0 ± 0.0
3.644TyrAsn: 3.644 ± 0.684
1.041TyrPro: 1.041 ± 0.974
2.603TyrGln: 2.603 ± 0.95
3.123TyrArg: 3.123 ± 0.529
5.206TyrSer: 5.206 ± 1.839
1.562TyrThr: 1.562 ± 0.778
3.644TyrVal: 3.644 ± 1.205
0.521TyrTrp: 0.521 ± 0.487
7.288TyrTyr: 7.288 ± 3.691
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (1922 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski