Amino acid dipepetide frequency for Hendra virus (isolate Horse/Autralia/Hendra/1994)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.468AlaAla: 3.468 ± 1.478
0.33AlaCys: 0.33 ± 0.215
2.808AlaAsp: 2.808 ± 0.452
3.468AlaGlu: 3.468 ± 0.496
1.486AlaPhe: 1.486 ± 0.384
3.138AlaGly: 3.138 ± 0.768
0.495AlaHis: 0.495 ± 0.323
2.808AlaIle: 2.808 ± 0.733
2.808AlaLys: 2.808 ± 0.794
5.12AlaLeu: 5.12 ± 1.386
0.661AlaMet: 0.661 ± 0.253
1.156AlaAsn: 1.156 ± 0.382
1.652AlaPro: 1.652 ± 0.622
2.147AlaGln: 2.147 ± 0.771
1.652AlaArg: 1.652 ± 0.583
3.799AlaSer: 3.799 ± 0.539
2.477AlaThr: 2.477 ± 1.048
4.294AlaVal: 4.294 ± 1.178
1.486AlaTrp: 1.486 ± 0.678
1.156AlaTyr: 1.156 ± 0.291
0.0AlaXaa: 0.0 ± 0.0
Cys
0.33CysAla: 0.33 ± 0.163
0.826CysCys: 0.826 ± 0.318
0.991CysAsp: 0.991 ± 0.646
0.33CysGlu: 0.33 ± 0.263
0.991CysPhe: 0.991 ± 0.384
0.991CysGly: 0.991 ± 0.383
0.495CysHis: 0.495 ± 0.238
1.486CysIle: 1.486 ± 0.518
1.156CysLys: 1.156 ± 0.541
1.321CysLeu: 1.321 ± 0.521
0.165CysMet: 0.165 ± 0.108
0.661CysAsn: 0.661 ± 0.27
1.321CysPro: 1.321 ± 0.7
0.991CysGln: 0.991 ± 0.455
0.826CysArg: 0.826 ± 0.29
1.817CysSer: 1.817 ± 0.566
0.991CysThr: 0.991 ± 0.5
0.495CysVal: 0.495 ± 0.323
0.33CysTrp: 0.33 ± 0.251
0.661CysTyr: 0.661 ± 0.268
0.0CysXaa: 0.0 ± 0.0
Asp
1.486AspAla: 1.486 ± 0.463
0.661AspCys: 0.661 ± 0.302
4.789AspAsp: 4.789 ± 0.869
4.129AspGlu: 4.129 ± 0.974
3.303AspPhe: 3.303 ± 0.404
3.799AspGly: 3.799 ± 1.223
0.826AspHis: 0.826 ± 0.367
4.459AspIle: 4.459 ± 0.431
4.789AspLys: 4.789 ± 0.485
6.936AspLeu: 6.936 ± 0.826
0.661AspMet: 0.661 ± 0.314
3.799AspAsn: 3.799 ± 0.534
4.294AspPro: 4.294 ± 0.786
2.808AspGln: 2.808 ± 0.398
3.633AspArg: 3.633 ± 0.501
5.945AspSer: 5.945 ± 0.687
3.303AspThr: 3.303 ± 0.842
3.633AspVal: 3.633 ± 0.468
0.33AspTrp: 0.33 ± 0.215
1.982AspTyr: 1.982 ± 0.482
0.0AspXaa: 0.0 ± 0.0
Glu
2.312GluAla: 2.312 ± 0.916
2.808GluCys: 2.808 ± 0.996
5.45GluAsp: 5.45 ± 2.035
6.111GluGlu: 6.111 ± 1.915
2.642GluPhe: 2.642 ± 0.453
3.633GluGly: 3.633 ± 0.822
2.312GluHis: 2.312 ± 0.631
5.285GluIle: 5.285 ± 0.668
2.808GluLys: 2.808 ± 0.609
4.955GluLeu: 4.955 ± 0.845
1.156GluMet: 1.156 ± 0.599
3.303GluAsn: 3.303 ± 0.611
2.477GluPro: 2.477 ± 0.6
1.817GluGln: 1.817 ± 0.369
2.808GluArg: 2.808 ± 0.626
3.633GluSer: 3.633 ± 0.893
3.799GluThr: 3.799 ± 0.842
4.789GluVal: 4.789 ± 0.833
0.495GluTrp: 0.495 ± 0.232
2.312GluTyr: 2.312 ± 0.496
0.0GluXaa: 0.0 ± 0.0
Phe
3.964PheAla: 3.964 ± 1.06
0.495PheCys: 0.495 ± 0.323
1.652PheAsp: 1.652 ± 0.46
0.991PheGlu: 0.991 ± 0.303
1.486PhePhe: 1.486 ± 0.5
0.991PheGly: 0.991 ± 0.364
0.33PheHis: 0.33 ± 0.215
1.652PheIle: 1.652 ± 0.234
1.486PheLys: 1.486 ± 0.668
3.468PheLeu: 3.468 ± 0.837
0.991PheMet: 0.991 ± 0.449
1.817PheAsn: 1.817 ± 0.664
2.477PhePro: 2.477 ± 0.528
0.661PheGln: 0.661 ± 0.3
1.817PheArg: 1.817 ± 0.45
1.982PheSer: 1.982 ± 0.845
0.661PheThr: 0.661 ± 0.238
3.138PheVal: 3.138 ± 0.522
0.33PheTrp: 0.33 ± 0.215
0.661PheTyr: 0.661 ± 0.326
0.0PheXaa: 0.0 ± 0.0
Gly
2.147GlyAla: 2.147 ± 0.548
0.495GlyCys: 0.495 ± 0.323
4.294GlyAsp: 4.294 ± 1.001
2.477GlyGlu: 2.477 ± 0.675
1.652GlyPhe: 1.652 ± 0.543
5.945GlyGly: 5.945 ± 2.079
1.156GlyHis: 1.156 ± 0.581
3.633GlyIle: 3.633 ± 0.923
3.799GlyLys: 3.799 ± 0.779
6.771GlyLeu: 6.771 ± 0.478
1.652GlyMet: 1.652 ± 0.456
3.633GlyAsn: 3.633 ± 0.694
1.982GlyPro: 1.982 ± 0.351
1.817GlyGln: 1.817 ± 0.521
5.12GlyArg: 5.12 ± 1.184
5.78GlySer: 5.78 ± 1.201
3.964GlyThr: 3.964 ± 1.255
4.129GlyVal: 4.129 ± 1.042
0.661GlyTrp: 0.661 ± 0.277
1.486GlyTyr: 1.486 ± 0.499
0.0GlyXaa: 0.0 ± 0.0
His
0.661HisAla: 0.661 ± 0.232
0.495HisCys: 0.495 ± 0.316
2.147HisAsp: 2.147 ± 0.659
1.652HisGlu: 1.652 ± 0.41
0.0HisPhe: 0.0 ± 0.0
1.486HisGly: 1.486 ± 0.691
1.156HisHis: 1.156 ± 0.368
0.991HisIle: 0.991 ± 0.347
0.661HisLys: 0.661 ± 0.43
2.147HisLeu: 2.147 ± 0.697
1.156HisMet: 1.156 ± 0.366
0.33HisAsn: 0.33 ± 0.215
1.652HisPro: 1.652 ± 0.534
0.33HisGln: 0.33 ± 0.215
0.661HisArg: 0.661 ± 0.319
0.991HisSer: 0.991 ± 0.517
0.991HisThr: 0.991 ± 0.481
0.661HisVal: 0.661 ± 0.235
0.0HisTrp: 0.0 ± 0.0
0.661HisTyr: 0.661 ± 0.235
0.0HisXaa: 0.0 ± 0.0
Ile
3.964IleAla: 3.964 ± 1.074
1.156IleCys: 1.156 ± 0.288
6.276IleAsp: 6.276 ± 0.713
3.138IleGlu: 3.138 ± 0.68
1.652IlePhe: 1.652 ± 0.656
5.285IleGly: 5.285 ± 0.659
1.486IleHis: 1.486 ± 0.652
6.606IleIle: 6.606 ± 1.087
6.441IleLys: 6.441 ± 0.901
5.12IleLeu: 5.12 ± 1.006
2.477IleMet: 2.477 ± 0.67
3.964IleAsn: 3.964 ± 0.446
2.477IlePro: 2.477 ± 0.426
4.459IleGln: 4.459 ± 1.327
2.973IleArg: 2.973 ± 0.72
6.276IleSer: 6.276 ± 1.914
4.624IleThr: 4.624 ± 1.071
2.477IleVal: 2.477 ± 0.908
0.661IleTrp: 0.661 ± 0.238
3.303IleTyr: 3.303 ± 0.837
0.0IleXaa: 0.0 ± 0.0
Lys
2.808LysAla: 2.808 ± 0.844
0.991LysCys: 0.991 ± 0.505
3.468LysAsp: 3.468 ± 0.372
3.964LysGlu: 3.964 ± 0.76
2.642LysPhe: 2.642 ± 0.461
2.973LysGly: 2.973 ± 0.69
1.156LysHis: 1.156 ± 0.604
5.78LysIle: 5.78 ± 1.988
3.468LysLys: 3.468 ± 0.597
4.955LysLeu: 4.955 ± 0.573
0.661LysMet: 0.661 ± 0.304
3.799LysAsn: 3.799 ± 0.766
0.826LysPro: 0.826 ± 0.216
1.652LysGln: 1.652 ± 0.528
3.633LysArg: 3.633 ± 0.597
5.45LysSer: 5.45 ± 0.706
5.285LysThr: 5.285 ± 0.836
4.294LysVal: 4.294 ± 1.102
0.165LysTrp: 0.165 ± 0.108
2.973LysTyr: 2.973 ± 0.813
0.0LysXaa: 0.0 ± 0.0
Leu
3.799LeuAla: 3.799 ± 0.795
1.321LeuCys: 1.321 ± 0.469
6.606LeuAsp: 6.606 ± 0.695
6.111LeuGlu: 6.111 ± 0.639
3.138LeuPhe: 3.138 ± 0.885
5.45LeuGly: 5.45 ± 1.039
1.486LeuHis: 1.486 ± 0.638
6.936LeuIle: 6.936 ± 1.342
6.111LeuLys: 6.111 ± 1.307
7.267LeuLeu: 7.267 ± 1.436
2.642LeuMet: 2.642 ± 0.471
5.615LeuAsn: 5.615 ± 1.067
4.789LeuPro: 4.789 ± 0.78
3.303LeuGln: 3.303 ± 0.833
5.615LeuArg: 5.615 ± 0.868
8.918LeuSer: 8.918 ± 1.449
4.129LeuThr: 4.129 ± 1.041
4.789LeuVal: 4.789 ± 0.842
0.495LeuTrp: 0.495 ± 0.247
2.477LeuTyr: 2.477 ± 0.689
0.0LeuXaa: 0.0 ± 0.0
Met
1.321MetAla: 1.321 ± 0.501
0.165MetCys: 0.165 ± 0.108
1.652MetAsp: 1.652 ± 0.554
1.982MetGlu: 1.982 ± 0.826
0.33MetPhe: 0.33 ± 0.263
1.321MetGly: 1.321 ± 0.459
0.165MetHis: 0.165 ± 0.108
3.303MetIle: 3.303 ± 0.794
1.486MetLys: 1.486 ± 0.374
2.808MetLeu: 2.808 ± 0.489
0.991MetMet: 0.991 ± 0.416
2.147MetAsn: 2.147 ± 0.901
1.817MetPro: 1.817 ± 1.044
0.826MetGln: 0.826 ± 0.272
0.826MetArg: 0.826 ± 0.375
1.817MetSer: 1.817 ± 0.706
1.321MetThr: 1.321 ± 0.38
1.486MetVal: 1.486 ± 0.365
0.33MetTrp: 0.33 ± 0.199
1.156MetTyr: 1.156 ± 0.47
0.0MetXaa: 0.0 ± 0.0
Asn
1.982AsnAla: 1.982 ± 0.495
0.991AsnCys: 0.991 ± 0.368
3.964AsnAsp: 3.964 ± 0.819
1.982AsnGlu: 1.982 ± 0.342
0.826AsnPhe: 0.826 ± 0.332
1.982AsnGly: 1.982 ± 0.592
0.661AsnHis: 0.661 ± 0.302
5.12AsnIle: 5.12 ± 1.743
0.991AsnLys: 0.991 ± 0.268
5.285AsnLeu: 5.285 ± 1.28
1.156AsnMet: 1.156 ± 0.521
2.973AsnAsn: 2.973 ± 0.778
2.973AsnPro: 2.973 ± 0.394
4.624AsnGln: 4.624 ± 1.536
2.147AsnArg: 2.147 ± 0.559
3.633AsnSer: 3.633 ± 0.726
3.633AsnThr: 3.633 ± 0.501
3.799AsnVal: 3.799 ± 0.901
0.991AsnTrp: 0.991 ± 0.364
2.477AsnTyr: 2.477 ± 0.614
0.0AsnXaa: 0.0 ± 0.0
Pro
1.817ProAla: 1.817 ± 0.6
0.165ProCys: 0.165 ± 0.108
3.138ProAsp: 3.138 ± 1.085
5.285ProGlu: 5.285 ± 1.294
2.147ProPhe: 2.147 ± 0.413
3.633ProGly: 3.633 ± 0.862
0.33ProHis: 0.33 ± 0.152
3.799ProIle: 3.799 ± 0.442
4.624ProLys: 4.624 ± 1.465
2.973ProLeu: 2.973 ± 0.709
2.147ProMet: 2.147 ± 0.952
2.808ProAsn: 2.808 ± 0.465
3.468ProPro: 3.468 ± 0.791
1.486ProGln: 1.486 ± 0.463
3.138ProArg: 3.138 ± 0.433
3.964ProSer: 3.964 ± 0.819
1.486ProThr: 1.486 ± 0.521
3.303ProVal: 3.303 ± 0.612
0.495ProTrp: 0.495 ± 0.409
1.486ProTyr: 1.486 ± 0.513
0.0ProXaa: 0.0 ± 0.0
Gln
2.642GlnAla: 2.642 ± 0.591
0.661GlnCys: 0.661 ± 0.44
2.147GlnAsp: 2.147 ± 0.452
3.799GlnGlu: 3.799 ± 1.138
1.486GlnPhe: 1.486 ± 0.397
2.147GlnGly: 2.147 ± 0.485
0.165GlnHis: 0.165 ± 0.108
1.321GlnIle: 1.321 ± 0.851
4.294GlnLys: 4.294 ± 0.419
3.138GlnLeu: 3.138 ± 0.852
0.661GlnMet: 0.661 ± 0.536
0.826GlnAsn: 0.826 ± 0.38
2.147GlnPro: 2.147 ± 0.757
2.477GlnGln: 2.477 ± 0.326
1.982GlnArg: 1.982 ± 0.525
4.789GlnSer: 4.789 ± 0.668
1.982GlnThr: 1.982 ± 0.55
1.486GlnVal: 1.486 ± 0.393
0.661GlnTrp: 0.661 ± 0.293
0.33GlnTyr: 0.33 ± 0.222
0.0GlnXaa: 0.0 ± 0.0
Arg
3.964ArgAla: 3.964 ± 1.073
0.661ArgCys: 0.661 ± 0.315
1.982ArgAsp: 1.982 ± 0.854
4.789ArgGlu: 4.789 ± 0.67
0.661ArgPhe: 0.661 ± 0.267
4.129ArgGly: 4.129 ± 1.028
1.156ArgHis: 1.156 ± 0.493
3.303ArgIle: 3.303 ± 0.551
2.642ArgLys: 2.642 ± 0.656
6.771ArgLeu: 6.771 ± 0.72
1.321ArgMet: 1.321 ± 0.464
3.303ArgAsn: 3.303 ± 0.464
2.147ArgPro: 2.147 ± 0.579
1.486ArgGln: 1.486 ± 0.321
3.964ArgArg: 3.964 ± 0.972
4.624ArgSer: 4.624 ± 1.248
3.303ArgThr: 3.303 ± 0.64
2.808ArgVal: 2.808 ± 0.456
0.33ArgTrp: 0.33 ± 0.396
0.991ArgTyr: 0.991 ± 0.379
0.0ArgXaa: 0.0 ± 0.0
Ser
2.808SerAla: 2.808 ± 0.795
1.652SerCys: 1.652 ± 0.425
6.276SerAsp: 6.276 ± 0.581
3.138SerGlu: 3.138 ± 0.561
2.147SerPhe: 2.147 ± 0.534
6.276SerGly: 6.276 ± 0.971
2.477SerHis: 2.477 ± 0.476
6.276SerIle: 6.276 ± 0.745
5.12SerLys: 5.12 ± 0.768
7.267SerLeu: 7.267 ± 1.047
3.468SerMet: 3.468 ± 1.007
4.459SerAsn: 4.459 ± 0.99
4.294SerPro: 4.294 ± 0.559
3.799SerGln: 3.799 ± 0.944
4.129SerArg: 4.129 ± 0.606
7.432SerSer: 7.432 ± 2.169
5.45SerThr: 5.45 ± 0.564
4.955SerVal: 4.955 ± 0.726
0.826SerTrp: 0.826 ± 0.369
2.147SerTyr: 2.147 ± 0.305
0.0SerXaa: 0.0 ± 0.0
Thr
3.303ThrAla: 3.303 ± 1.054
0.991ThrCys: 0.991 ± 0.38
3.799ThrAsp: 3.799 ± 0.617
4.789ThrGlu: 4.789 ± 0.589
1.156ThrPhe: 1.156 ± 0.338
3.633ThrGly: 3.633 ± 0.508
0.991ThrHis: 0.991 ± 0.38
4.789ThrIle: 4.789 ± 0.826
3.633ThrLys: 3.633 ± 0.471
3.303ThrLeu: 3.303 ± 1.035
1.982ThrMet: 1.982 ± 0.436
1.817ThrAsn: 1.817 ± 0.585
3.633ThrPro: 3.633 ± 1.165
1.321ThrGln: 1.321 ± 0.463
4.129ThrArg: 4.129 ± 0.47
5.12ThrSer: 5.12 ± 0.586
3.303ThrThr: 3.303 ± 0.4
2.808ThrVal: 2.808 ± 0.553
0.661ThrTrp: 0.661 ± 0.32
1.486ThrTyr: 1.486 ± 0.344
0.0ThrXaa: 0.0 ± 0.0
Val
1.652ValAla: 1.652 ± 0.499
1.156ValCys: 1.156 ± 0.517
2.312ValAsp: 2.312 ± 0.612
3.633ValGlu: 3.633 ± 0.77
1.817ValPhe: 1.817 ± 0.612
3.138ValGly: 3.138 ± 0.584
0.33ValHis: 0.33 ± 0.215
4.294ValIle: 4.294 ± 0.734
2.642ValLys: 2.642 ± 0.807
7.267ValLeu: 7.267 ± 1.577
1.817ValMet: 1.817 ± 0.527
2.973ValAsn: 2.973 ± 0.468
5.78ValPro: 5.78 ± 1.289
1.982ValGln: 1.982 ± 0.647
3.799ValArg: 3.799 ± 1.074
5.285ValSer: 5.285 ± 0.86
3.138ValThr: 3.138 ± 0.607
2.808ValVal: 2.808 ± 1.222
0.165ValTrp: 0.165 ± 0.171
2.642ValTyr: 2.642 ± 0.58
0.0ValXaa: 0.0 ± 0.0
Trp
1.156TrpAla: 1.156 ± 0.254
0.495TrpCys: 0.495 ± 0.219
0.495TrpAsp: 0.495 ± 0.219
1.156TrpGlu: 1.156 ± 0.339
0.33TrpPhe: 0.33 ± 0.215
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.826TrpIle: 0.826 ± 0.309
0.33TrpLys: 0.33 ± 0.206
0.661TrpLeu: 0.661 ± 0.348
0.165TrpMet: 0.165 ± 0.108
0.33TrpAsn: 0.33 ± 0.206
0.165TrpPro: 0.165 ± 0.108
0.0TrpGln: 0.0 ± 0.0
0.661TrpArg: 0.661 ± 0.234
0.991TrpSer: 0.991 ± 0.24
0.826TrpThr: 0.826 ± 0.331
0.33TrpVal: 0.33 ± 0.251
0.165TrpTrp: 0.165 ± 0.108
0.495TrpTyr: 0.495 ± 0.323
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.826TyrAla: 0.826 ± 0.377
0.661TyrCys: 0.661 ± 0.281
1.321TyrAsp: 1.321 ± 0.391
1.982TyrGlu: 1.982 ± 0.548
1.156TyrPhe: 1.156 ± 0.351
2.642TyrGly: 2.642 ± 0.428
1.652TyrHis: 1.652 ± 0.612
2.147TyrIle: 2.147 ± 0.697
1.652TyrLys: 1.652 ± 0.498
3.468TyrLeu: 3.468 ± 0.95
1.156TyrMet: 1.156 ± 0.627
2.642TyrAsn: 2.642 ± 0.86
1.321TyrPro: 1.321 ± 0.463
1.156TyrGln: 1.156 ± 0.411
0.826TyrArg: 0.826 ± 0.297
2.147TyrSer: 2.147 ± 0.61
1.982TyrThr: 1.982 ± 0.475
2.147TyrVal: 2.147 ± 0.685
0.0TyrTrp: 0.0 ± 0.0
1.982TyrTyr: 1.982 ± 0.729
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 9 proteins (6056 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski