Amino acid dipepetide frequency for Ambe virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.254AlaAla: 4.254 ± 1.894
0.501AlaCys: 0.501 ± 0.319
2.002AlaAsp: 2.002 ± 0.824
3.003AlaGlu: 3.003 ± 0.778
2.252AlaPhe: 2.252 ± 1.335
3.003AlaGly: 3.003 ± 1.487
1.251AlaHis: 1.251 ± 0.832
2.753AlaIle: 2.753 ± 0.312
2.503AlaLys: 2.503 ± 0.371
6.006AlaLeu: 6.006 ± 1.602
1.251AlaMet: 1.251 ± 0.622
2.252AlaAsn: 2.252 ± 0.881
2.002AlaPro: 2.002 ± 0.394
2.002AlaGln: 2.002 ± 0.41
1.752AlaArg: 1.752 ± 0.558
5.005AlaSer: 5.005 ± 1.038
4.004AlaThr: 4.004 ± 1.061
3.504AlaVal: 3.504 ± 1.046
0.25AlaTrp: 0.25 ± 0.491
2.002AlaTyr: 2.002 ± 0.64
0.0AlaXaa: 0.0 ± 0.0
Cys
0.501CysAla: 0.501 ± 0.319
0.501CysCys: 0.501 ± 0.319
0.751CysAsp: 0.751 ± 0.334
2.002CysGlu: 2.002 ± 1.41
1.502CysPhe: 1.502 ± 0.669
1.502CysGly: 1.502 ± 0.437
1.251CysHis: 1.251 ± 0.595
2.002CysIle: 2.002 ± 0.945
2.503CysLys: 2.503 ± 1.212
1.251CysLeu: 1.251 ± 0.595
1.001CysMet: 1.001 ± 0.382
1.251CysAsn: 1.251 ± 0.468
0.751CysPro: 0.751 ± 0.653
1.502CysGln: 1.502 ± 0.802
1.251CysArg: 1.251 ± 0.36
3.003CysSer: 3.003 ± 0.913
1.251CysThr: 1.251 ± 0.394
1.251CysVal: 1.251 ± 0.394
0.0CysTrp: 0.0 ± 0.0
1.001CysTyr: 1.001 ± 0.545
0.0CysXaa: 0.0 ± 0.0
Asp
2.252AspAla: 2.252 ± 1.234
1.502AspCys: 1.502 ± 0.761
3.504AspAsp: 3.504 ± 1.344
4.755AspGlu: 4.755 ± 0.895
2.252AspPhe: 2.252 ± 1.125
3.253AspGly: 3.253 ± 0.788
1.251AspHis: 1.251 ± 0.468
3.253AspIle: 3.253 ± 1.029
3.253AspLys: 3.253 ± 0.27
4.755AspLeu: 4.755 ± 1.379
2.252AspMet: 2.252 ± 1.437
2.753AspAsn: 2.753 ± 0.749
2.002AspPro: 2.002 ± 0.522
1.502AspGln: 1.502 ± 0.653
2.252AspArg: 2.252 ± 0.748
3.504AspSer: 3.504 ± 0.972
1.251AspThr: 1.251 ± 0.622
3.003AspVal: 3.003 ± 0.928
1.251AspTrp: 1.251 ± 0.374
1.752AspTyr: 1.752 ± 1.214
0.0AspXaa: 0.0 ± 0.0
Glu
4.505GluAla: 4.505 ± 1.002
3.003GluCys: 3.003 ± 0.913
4.755GluAsp: 4.755 ± 1.413
6.256GluGlu: 6.256 ± 1.374
4.755GluPhe: 4.755 ± 1.943
4.004GluGly: 4.004 ± 1.195
1.752GluHis: 1.752 ± 1.095
4.755GluIle: 4.755 ± 0.895
4.505GluLys: 4.505 ± 1.104
6.757GluLeu: 6.757 ± 0.964
3.003GluMet: 3.003 ± 0.404
3.754GluAsn: 3.754 ± 1.599
2.002GluPro: 2.002 ± 0.583
1.502GluGln: 1.502 ± 0.313
2.002GluArg: 2.002 ± 0.699
4.254GluSer: 4.254 ± 1.504
3.504GluThr: 3.504 ± 1.213
4.004GluVal: 4.004 ± 0.975
0.501GluTrp: 0.501 ± 0.146
1.502GluTyr: 1.502 ± 0.457
0.0GluXaa: 0.0 ± 0.0
Phe
4.004PheAla: 4.004 ± 1.061
1.752PheCys: 1.752 ± 0.618
3.003PheAsp: 3.003 ± 1.391
3.754PheGlu: 3.754 ± 0.535
3.003PhePhe: 3.003 ± 0.951
1.251PheGly: 1.251 ± 0.372
1.001PheHis: 1.001 ± 0.472
3.003PheIle: 3.003 ± 0.736
4.755PheLys: 4.755 ± 0.285
3.253PheLeu: 3.253 ± 1.111
2.002PheMet: 2.002 ± 0.8
2.753PheAsn: 2.753 ± 0.639
2.002PhePro: 2.002 ± 0.896
0.501PheGln: 0.501 ± 0.319
2.753PheArg: 2.753 ± 0.898
6.757PheSer: 6.757 ± 1.461
2.002PheThr: 2.002 ± 0.967
3.504PheVal: 3.504 ± 1.114
0.751PheTrp: 0.751 ± 0.215
1.001PheTyr: 1.001 ± 1.573
0.0PheXaa: 0.0 ± 0.0
Gly
2.002GlyAla: 2.002 ± 0.698
1.251GlyCys: 1.251 ± 0.36
1.502GlyAsp: 1.502 ± 0.496
2.503GlyGlu: 2.503 ± 0.66
4.505GlyPhe: 4.505 ± 0.535
3.754GlyGly: 3.754 ± 0.578
1.502GlyHis: 1.502 ± 0.653
4.004GlyIle: 4.004 ± 0.513
2.753GlyLys: 2.753 ± 0.809
5.255GlyLeu: 5.255 ± 0.621
1.752GlyMet: 1.752 ± 0.772
2.503GlyAsn: 2.503 ± 1.213
2.252GlyPro: 2.252 ± 0.759
2.503GlyGln: 2.503 ± 0.935
1.502GlyArg: 1.502 ± 0.742
6.006GlySer: 6.006 ± 0.605
3.504GlyThr: 3.504 ± 1.756
4.755GlyVal: 4.755 ± 0.755
1.251GlyTrp: 1.251 ± 0.374
0.501GlyTyr: 0.501 ± 0.448
0.0GlyXaa: 0.0 ± 0.0
His
0.501HisAla: 0.501 ± 0.319
0.25HisCys: 0.25 ± 0.16
1.752HisAsp: 1.752 ± 0.607
1.502HisGlu: 1.502 ± 1.474
1.752HisPhe: 1.752 ± 0.658
1.001HisGly: 1.001 ± 0.349
0.501HisHis: 0.501 ± 0.146
1.502HisIle: 1.502 ± 0.653
1.502HisLys: 1.502 ± 0.457
3.253HisLeu: 3.253 ± 0.722
0.25HisMet: 0.25 ± 0.16
0.501HisAsn: 0.501 ± 0.319
1.502HisPro: 1.502 ± 0.607
1.251HisGln: 1.251 ± 0.33
1.251HisArg: 1.251 ± 0.622
2.753HisSer: 2.753 ± 0.553
0.751HisThr: 0.751 ± 0.334
2.753HisVal: 2.753 ± 0.753
0.25HisTrp: 0.25 ± 0.537
1.502HisTyr: 1.502 ± 0.653
0.0HisXaa: 0.0 ± 0.0
Ile
2.252IleAla: 2.252 ± 0.846
3.253IleCys: 3.253 ± 0.568
3.504IleAsp: 3.504 ± 1.592
5.255IleGlu: 5.255 ± 0.813
1.251IlePhe: 1.251 ± 0.499
4.254IleGly: 4.254 ± 1.802
1.251IleHis: 1.251 ± 0.468
6.006IleIle: 6.006 ± 1.261
5.255IleLys: 5.255 ± 1.093
6.006IleLeu: 6.006 ± 0.801
1.502IleMet: 1.502 ± 0.429
2.503IleAsn: 2.503 ± 0.406
2.252IlePro: 2.252 ± 0.711
3.003IleGln: 3.003 ± 0.512
4.004IleArg: 4.004 ± 0.706
4.004IleSer: 4.004 ± 1.284
4.755IleThr: 4.755 ± 0.477
2.252IleVal: 2.252 ± 1.174
0.25IleTrp: 0.25 ± 0.16
2.002IleTyr: 2.002 ± 0.394
0.0IleXaa: 0.0 ± 0.0
Lys
2.503LysAla: 2.503 ± 0.423
1.502LysCys: 1.502 ± 0.993
3.003LysAsp: 3.003 ± 0.404
4.254LysGlu: 4.254 ± 0.946
3.504LysPhe: 3.504 ± 0.637
3.003LysGly: 3.003 ± 0.736
0.751LysHis: 0.751 ± 0.215
4.254LysIle: 4.254 ± 1.175
5.255LysLys: 5.255 ± 1.502
6.757LysLeu: 6.757 ± 0.738
3.253LysMet: 3.253 ± 0.258
3.253LysAsn: 3.253 ± 0.416
2.753LysPro: 2.753 ± 0.324
1.752LysGln: 1.752 ± 0.391
4.004LysArg: 4.004 ± 1.075
4.755LysSer: 4.755 ± 1.059
3.754LysThr: 3.754 ± 0.532
6.256LysVal: 6.256 ± 1.753
1.251LysTrp: 1.251 ± 0.656
2.252LysTyr: 2.252 ± 0.739
0.0LysXaa: 0.0 ± 0.0
Leu
5.255LeuAla: 5.255 ± 1.157
1.502LeuCys: 1.502 ± 0.429
4.254LeuAsp: 4.254 ± 1.109
5.756LeuGlu: 5.756 ± 0.368
5.255LeuPhe: 5.255 ± 0.983
4.254LeuGly: 4.254 ± 0.91
2.753LeuHis: 2.753 ± 1.474
6.256LeuIle: 6.256 ± 1.374
6.256LeuLys: 6.256 ± 0.807
7.257LeuLeu: 7.257 ± 1.002
3.754LeuMet: 3.754 ± 0.67
2.503LeuAsn: 2.503 ± 0.918
2.503LeuPro: 2.503 ± 0.891
2.753LeuGln: 2.753 ± 1.151
4.755LeuArg: 4.755 ± 0.403
11.762LeuSer: 11.762 ± 1.278
5.005LeuThr: 5.005 ± 1.266
5.255LeuVal: 5.255 ± 1.913
0.0LeuTrp: 0.0 ± 0.0
3.504LeuTyr: 3.504 ± 1.579
0.0LeuXaa: 0.0 ± 0.0
Met
2.002MetAla: 2.002 ± 0.394
0.0MetCys: 0.0 ± 0.0
1.502MetAsp: 1.502 ± 0.313
2.002MetGlu: 2.002 ± 0.537
2.002MetPhe: 2.002 ± 0.967
2.252MetGly: 2.252 ± 1.437
1.001MetHis: 1.001 ± 0.751
2.252MetIle: 2.252 ± 0.905
1.001MetLys: 1.001 ± 0.291
3.253MetLeu: 3.253 ± 1.111
1.251MetMet: 1.251 ± 0.372
1.752MetAsn: 1.752 ± 1.06
0.501MetPro: 0.501 ± 0.511
1.251MetGln: 1.251 ± 0.798
1.502MetArg: 1.502 ± 0.618
3.253MetSer: 3.253 ± 0.177
3.003MetThr: 3.003 ± 0.736
1.502MetVal: 1.502 ± 0.742
0.25MetTrp: 0.25 ± 0.218
0.25MetTyr: 0.25 ± 0.537
0.0MetXaa: 0.0 ± 0.0
Asn
2.753AsnAla: 2.753 ± 1.643
1.001AsnCys: 1.001 ± 0.871
2.503AsnAsp: 2.503 ± 0.66
2.252AsnGlu: 2.252 ± 0.602
3.504AsnPhe: 3.504 ± 0.572
3.003AsnGly: 3.003 ± 0.428
0.501AsnHis: 0.501 ± 0.319
1.752AsnIle: 1.752 ± 0.463
5.005AsnLys: 5.005 ± 2.115
5.255AsnLeu: 5.255 ± 0.616
0.25AsnMet: 0.25 ± 0.16
1.251AsnAsn: 1.251 ± 0.76
3.504AsnPro: 3.504 ± 0.618
0.501AsnGln: 0.501 ± 0.319
2.503AsnArg: 2.503 ± 1.189
4.755AsnSer: 4.755 ± 0.477
2.503AsnThr: 2.503 ± 0.875
1.502AsnVal: 1.502 ± 0.447
0.501AsnTrp: 0.501 ± 0.497
0.501AsnTyr: 0.501 ± 0.146
0.0AsnXaa: 0.0 ± 0.0
Pro
3.003ProAla: 3.003 ± 0.772
0.25ProCys: 0.25 ± 0.218
2.503ProAsp: 2.503 ± 0.312
4.505ProGlu: 4.505 ± 1.987
1.502ProPhe: 1.502 ± 0.669
3.003ProGly: 3.003 ± 0.428
0.751ProHis: 0.751 ± 0.433
1.001ProIle: 1.001 ± 0.751
1.251ProLys: 1.251 ± 0.468
3.003ProLeu: 3.003 ± 0.96
1.752ProMet: 1.752 ± 0.892
2.252ProAsn: 2.252 ± 1.89
0.25ProPro: 0.25 ± 0.218
1.001ProGln: 1.001 ± 0.482
3.253ProArg: 3.253 ± 1.133
3.754ProSer: 3.754 ± 1.584
1.752ProThr: 1.752 ± 0.603
0.751ProVal: 0.751 ± 0.479
0.501ProTrp: 0.501 ± 0.319
1.001ProTyr: 1.001 ± 0.382
0.0ProXaa: 0.0 ± 0.0
Gln
2.252GlnAla: 2.252 ± 1.795
1.251GlnCys: 1.251 ± 0.595
2.252GlnAsp: 2.252 ± 0.528
2.753GlnGlu: 2.753 ± 0.917
0.751GlnPhe: 0.751 ± 0.215
2.503GlnGly: 2.503 ± 0.423
1.502GlnHis: 1.502 ± 0.429
4.004GlnIle: 4.004 ± 1.396
2.753GlnLys: 2.753 ± 0.904
1.251GlnLeu: 1.251 ± 0.468
1.251GlnMet: 1.251 ± 0.622
0.751GlnAsn: 0.751 ± 0.479
1.251GlnPro: 1.251 ± 0.372
1.001GlnGln: 1.001 ± 0.564
0.751GlnArg: 0.751 ± 0.46
2.002GlnSer: 2.002 ± 0.486
1.502GlnThr: 1.502 ± 0.447
2.252GlnVal: 2.252 ± 0.345
0.25GlnTrp: 0.25 ± 0.218
0.0GlnTyr: 0.0 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
2.753ArgAla: 2.753 ± 0.846
1.251ArgCys: 1.251 ± 0.76
2.252ArgAsp: 2.252 ± 0.83
3.754ArgGlu: 3.754 ± 1.621
3.003ArgPhe: 3.003 ± 1.448
2.252ArgGly: 2.252 ± 0.987
1.001ArgHis: 1.001 ± 0.291
2.252ArgIle: 2.252 ± 0.345
4.254ArgLys: 4.254 ± 1.366
4.505ArgLeu: 4.505 ± 0.702
1.251ArgMet: 1.251 ± 1.37
2.503ArgAsn: 2.503 ± 0.312
2.753ArgPro: 2.753 ± 0.818
1.752ArgGln: 1.752 ± 0.972
1.752ArgArg: 1.752 ± 0.869
2.002ArgSer: 2.002 ± 0.689
3.003ArgThr: 3.003 ± 0.482
5.506ArgVal: 5.506 ± 0.861
0.751ArgTrp: 0.751 ± 0.479
0.751ArgTyr: 0.751 ± 0.507
0.0ArgXaa: 0.0 ± 0.0
Ser
4.505SerAla: 4.505 ± 0.945
2.753SerCys: 2.753 ± 0.891
5.005SerAsp: 5.005 ± 1.775
7.007SerGlu: 7.007 ± 3.177
3.754SerPhe: 3.754 ± 0.707
4.004SerGly: 4.004 ± 0.764
2.252SerHis: 2.252 ± 0.825
5.005SerIle: 5.005 ± 0.931
6.256SerLys: 6.256 ± 1.144
8.509SerLeu: 8.509 ± 0.929
1.502SerMet: 1.502 ± 0.653
4.004SerAsn: 4.004 ± 0.874
4.254SerPro: 4.254 ± 0.767
2.753SerGln: 2.753 ± 0.324
4.254SerArg: 4.254 ± 1.825
8.008SerSer: 8.008 ± 1.616
4.505SerThr: 4.505 ± 1.311
6.757SerVal: 6.757 ± 0.409
2.002SerTrp: 2.002 ± 0.394
2.753SerTyr: 2.753 ± 0.587
0.0SerXaa: 0.0 ± 0.0
Thr
1.502ThrAla: 1.502 ± 0.982
2.503ThrCys: 2.503 ± 0.941
3.003ThrAsp: 3.003 ± 0.428
4.004ThrGlu: 4.004 ± 0.951
3.253ThrPhe: 3.253 ± 0.951
5.255ThrGly: 5.255 ± 0.427
1.752ThrHis: 1.752 ± 0.328
3.253ThrIle: 3.253 ± 1.244
2.753ThrLys: 2.753 ± 0.891
5.756ThrLeu: 5.756 ± 0.97
1.251ThrMet: 1.251 ± 0.33
2.002ThrAsn: 2.002 ± 0.583
1.752ThrPro: 1.752 ± 0.328
1.001ThrGln: 1.001 ± 0.349
4.505ThrArg: 4.505 ± 0.543
4.505ThrSer: 4.505 ± 1.204
3.003ThrThr: 3.003 ± 0.874
2.252ThrVal: 2.252 ± 0.606
0.25ThrTrp: 0.25 ± 0.16
1.502ThrTyr: 1.502 ± 0.429
0.0ThrXaa: 0.0 ± 0.0
Val
3.003ValAla: 3.003 ± 1.074
1.251ValCys: 1.251 ± 0.76
2.252ValAsp: 2.252 ± 0.345
3.754ValGlu: 3.754 ± 0.582
3.504ValPhe: 3.504 ± 0.273
2.002ValGly: 2.002 ± 0.537
3.504ValHis: 3.504 ± 0.852
5.005ValIle: 5.005 ± 1.038
4.004ValLys: 4.004 ± 1.649
4.254ValLeu: 4.254 ± 0.729
2.002ValMet: 2.002 ± 0.689
4.755ValAsn: 4.755 ± 1.726
1.251ValPro: 1.251 ± 0.735
2.503ValGln: 2.503 ± 0.326
4.254ValArg: 4.254 ± 0.845
7.508ValSer: 7.508 ± 1.028
2.753ValThr: 2.753 ± 0.818
2.753ValVal: 2.753 ± 0.315
0.501ValTrp: 0.501 ± 0.319
2.002ValTyr: 2.002 ± 0.699
0.0ValXaa: 0.0 ± 0.0
Trp
0.751TrpAla: 0.751 ± 0.479
0.0TrpCys: 0.0 ± 0.0
1.001TrpAsp: 1.001 ± 0.639
0.25TrpGlu: 0.25 ± 0.16
0.501TrpPhe: 0.501 ± 0.497
0.751TrpGly: 0.751 ± 0.433
0.0TrpHis: 0.0 ± 0.0
1.251TrpIle: 1.251 ± 0.394
0.0TrpLys: 0.0 ± 0.0
1.251TrpLeu: 1.251 ± 0.468
0.0TrpMet: 0.0 ± 0.0
0.751TrpAsn: 0.751 ± 0.334
0.501TrpPro: 0.501 ± 0.497
0.501TrpGln: 0.501 ± 0.448
0.25TrpArg: 0.25 ± 0.16
0.501TrpSer: 0.501 ± 0.319
1.251TrpThr: 1.251 ± 0.374
1.502TrpVal: 1.502 ± 0.496
0.501TrpTrp: 0.501 ± 0.497
0.25TrpTyr: 0.25 ± 0.16
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.001TyrAla: 1.001 ± 0.564
0.751TyrCys: 0.751 ± 0.606
1.251TyrAsp: 1.251 ± 0.394
1.752TyrGlu: 1.752 ± 0.871
1.001TyrPhe: 1.001 ± 0.382
1.251TyrGly: 1.251 ± 0.499
0.751TyrHis: 0.751 ± 0.479
1.251TyrIle: 1.251 ± 0.394
2.252TyrLys: 2.252 ± 0.528
3.003TyrLeu: 3.003 ± 0.197
1.001TyrMet: 1.001 ± 1.85
1.251TyrAsn: 1.251 ± 0.394
1.251TyrPro: 1.251 ± 0.625
1.752TyrGln: 1.752 ± 1.19
0.751TyrArg: 0.751 ± 0.479
1.752TyrSer: 1.752 ± 1.395
2.002TyrThr: 2.002 ± 0.583
1.502TyrVal: 1.502 ± 0.429
0.501TyrTrp: 0.501 ± 0.436
0.501TyrTyr: 0.501 ± 0.497
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (3997 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski