Amino acid dipepetide frequency for Chaco virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.065AlaAla: 1.065 ± 0.641
0.0AlaCys: 0.0 ± 0.0
1.331AlaAsp: 1.331 ± 0.842
2.129AlaGlu: 2.129 ± 0.877
0.532AlaPhe: 0.532 ± 0.636
1.863AlaGly: 1.863 ± 0.512
1.065AlaHis: 1.065 ± 0.627
4.259AlaIle: 4.259 ± 1.528
0.799AlaLys: 0.799 ± 0.402
2.928AlaLeu: 2.928 ± 0.983
0.266AlaMet: 0.266 ± 0.157
2.129AlaAsn: 2.129 ± 0.995
1.597AlaPro: 1.597 ± 1.328
1.331AlaGln: 1.331 ± 0.764
1.065AlaArg: 1.065 ± 0.438
1.863AlaSer: 1.863 ± 0.557
0.799AlaThr: 0.799 ± 0.542
0.799AlaVal: 0.799 ± 0.477
0.532AlaTrp: 0.532 ± 0.32
1.065AlaTyr: 1.065 ± 0.492
0.0AlaXaa: 0.0 ± 0.0
Cys
0.799CysAla: 0.799 ± 0.442
0.0CysCys: 0.0 ± 0.0
1.597CysAsp: 1.597 ± 0.583
1.065CysGlu: 1.065 ± 0.492
1.597CysPhe: 1.597 ± 0.346
0.799CysGly: 0.799 ± 0.402
0.266CysHis: 0.266 ± 0.378
2.928CysIle: 2.928 ± 1.379
1.863CysLys: 1.863 ± 0.631
0.799CysLeu: 0.799 ± 0.539
0.266CysMet: 0.266 ± 0.157
1.065CysAsn: 1.065 ± 0.627
1.065CysPro: 1.065 ± 1.051
2.129CysGln: 2.129 ± 0.554
0.532CysArg: 0.532 ± 0.313
0.799CysSer: 0.799 ± 0.32
1.331CysThr: 1.331 ± 0.507
0.266CysVal: 0.266 ± 0.157
0.799CysTrp: 0.799 ± 0.369
0.799CysTyr: 0.799 ± 0.32
0.0CysXaa: 0.0 ± 0.0
Asp
1.065AspAla: 1.065 ± 0.438
1.065AspCys: 1.065 ± 0.475
4.791AspAsp: 4.791 ± 2.396
4.259AspGlu: 4.259 ± 1.342
4.259AspPhe: 4.259 ± 1.008
1.331AspGly: 1.331 ± 0.592
1.597AspHis: 1.597 ± 0.545
3.993AspIle: 3.993 ± 1.365
4.525AspLys: 4.525 ± 2.108
8.784AspLeu: 8.784 ± 1.891
2.396AspMet: 2.396 ± 1.093
3.46AspAsn: 3.46 ± 0.6
3.194AspPro: 3.194 ± 1.269
1.597AspGln: 1.597 ± 0.639
1.331AspArg: 1.331 ± 0.377
6.122AspSer: 6.122 ± 1.835
3.726AspThr: 3.726 ± 0.867
3.46AspVal: 3.46 ± 2.093
2.129AspTrp: 2.129 ± 0.477
2.396AspTyr: 2.396 ± 0.908
0.0AspXaa: 0.0 ± 0.0
Glu
0.532GluAla: 0.532 ± 0.757
1.331GluCys: 1.331 ± 0.613
5.59GluAsp: 5.59 ± 1.126
5.856GluGlu: 5.856 ± 0.929
1.331GluPhe: 1.331 ± 0.613
3.726GluGly: 3.726 ± 1.108
0.532GluHis: 0.532 ± 0.757
7.453GluIle: 7.453 ± 1.599
4.791GluLys: 4.791 ± 1.04
5.59GluLeu: 5.59 ± 1.057
2.129GluMet: 2.129 ± 0.753
3.194GluAsn: 3.194 ± 1.061
1.597GluPro: 1.597 ± 0.828
0.799GluGln: 0.799 ± 0.35
3.46GluArg: 3.46 ± 0.604
5.057GluSer: 5.057 ± 1.422
2.396GluThr: 2.396 ± 1.332
2.662GluVal: 2.662 ± 0.624
0.799GluTrp: 0.799 ± 0.539
1.863GluTyr: 1.863 ± 1.351
0.0GluXaa: 0.0 ± 0.0
Phe
0.799PheAla: 0.799 ± 0.442
0.532PheCys: 0.532 ± 0.313
2.928PheAsp: 2.928 ± 1.033
3.726PheGlu: 3.726 ± 1.8
2.396PhePhe: 2.396 ± 0.716
2.129PheGly: 2.129 ± 0.79
0.532PheHis: 0.532 ± 0.313
5.59PheIle: 5.59 ± 0.979
3.726PheLys: 3.726 ± 1.231
6.122PheLeu: 6.122 ± 1.469
0.799PheMet: 0.799 ± 0.425
1.863PheAsn: 1.863 ± 0.537
3.46PhePro: 3.46 ± 0.693
1.863PheGln: 1.863 ± 0.667
2.662PheArg: 2.662 ± 1.226
3.726PheSer: 3.726 ± 1.043
2.662PheThr: 2.662 ± 1.486
1.331PheVal: 1.331 ± 0.592
1.065PheTrp: 1.065 ± 0.435
2.129PheTyr: 2.129 ± 0.837
0.0PheXaa: 0.0 ± 0.0
Gly
1.331GlyAla: 1.331 ± 0.575
1.065GlyCys: 1.065 ± 0.394
2.928GlyAsp: 2.928 ± 0.671
3.194GlyGlu: 3.194 ± 0.944
3.993GlyPhe: 3.993 ± 0.851
2.928GlyGly: 2.928 ± 0.827
1.065GlyHis: 1.065 ± 0.571
3.993GlyIle: 3.993 ± 0.799
5.323GlyLys: 5.323 ± 0.896
6.122GlyLeu: 6.122 ± 0.543
1.331GlyMet: 1.331 ± 0.497
3.194GlyAsn: 3.194 ± 1.556
0.266GlyPro: 0.266 ± 0.157
2.662GlyGln: 2.662 ± 1.193
2.928GlyArg: 2.928 ± 0.853
4.791GlySer: 4.791 ± 1.546
4.259GlyThr: 4.259 ± 1.954
2.928GlyVal: 2.928 ± 1.553
1.065GlyTrp: 1.065 ± 0.438
1.597GlyTyr: 1.597 ± 0.644
0.0GlyXaa: 0.0 ± 0.0
His
0.266HisAla: 0.266 ± 0.157
0.0HisCys: 0.0 ± 0.0
1.331HisAsp: 1.331 ± 0.431
1.331HisGlu: 1.331 ± 0.985
1.597HisPhe: 1.597 ± 0.64
0.799HisGly: 0.799 ± 0.736
0.799HisHis: 0.799 ± 0.47
2.129HisIle: 2.129 ± 0.541
2.662HisLys: 2.662 ± 0.624
2.396HisLeu: 2.396 ± 1.088
0.532HisMet: 0.532 ± 0.313
1.065HisAsn: 1.065 ± 0.475
1.597HisPro: 1.597 ± 1.354
0.799HisGln: 0.799 ± 0.423
1.065HisArg: 1.065 ± 0.627
0.799HisSer: 0.799 ± 0.38
1.065HisThr: 1.065 ± 0.816
0.799HisVal: 0.799 ± 0.425
0.266HisTrp: 0.266 ± 0.378
0.532HisTyr: 0.532 ± 0.313
0.0HisXaa: 0.0 ± 0.0
Ile
2.129IleAla: 2.129 ± 0.991
2.662IleCys: 2.662 ± 0.458
5.323IleAsp: 5.323 ± 1.107
5.057IleGlu: 5.057 ± 1.051
3.46IlePhe: 3.46 ± 0.764
5.057IleGly: 5.057 ± 1.334
2.129IleHis: 2.129 ± 1.097
7.719IleIle: 7.719 ± 1.907
7.985IleLys: 7.985 ± 1.34
8.251IleLeu: 8.251 ± 0.854
2.928IleMet: 2.928 ± 1.243
6.92IleAsn: 6.92 ± 1.689
5.057IlePro: 5.057 ± 0.868
2.129IleGln: 2.129 ± 0.479
5.856IleArg: 5.856 ± 0.841
7.719IleSer: 7.719 ± 1.621
4.525IleThr: 4.525 ± 1.471
4.525IleVal: 4.525 ± 1.089
1.597IleTrp: 1.597 ± 0.378
3.993IleTyr: 3.993 ± 1.143
0.0IleXaa: 0.0 ± 0.0
Lys
1.863LysAla: 1.863 ± 0.3
1.065LysCys: 1.065 ± 0.627
5.057LysAsp: 5.057 ± 1.097
4.791LysGlu: 4.791 ± 1.158
3.993LysPhe: 3.993 ± 0.656
6.122LysGly: 6.122 ± 1.584
0.532LysHis: 0.532 ± 0.32
8.517LysIle: 8.517 ± 1.643
3.194LysLys: 3.194 ± 0.824
5.323LysLeu: 5.323 ± 1.384
2.396LysMet: 2.396 ± 0.831
5.856LysAsn: 5.856 ± 1.47
2.396LysPro: 2.396 ± 0.844
2.396LysGln: 2.396 ± 0.965
3.194LysArg: 3.194 ± 1.363
4.791LysSer: 4.791 ± 0.784
4.791LysThr: 4.791 ± 0.724
2.396LysVal: 2.396 ± 0.87
1.597LysTrp: 1.597 ± 0.378
3.46LysTyr: 3.46 ± 1.223
0.0LysXaa: 0.0 ± 0.0
Leu
3.194LeuAla: 3.194 ± 1.028
1.331LeuCys: 1.331 ± 0.507
5.323LeuAsp: 5.323 ± 1.438
5.59LeuGlu: 5.59 ± 0.726
5.323LeuPhe: 5.323 ± 0.944
6.388LeuGly: 6.388 ± 0.671
1.065LeuHis: 1.065 ± 0.487
8.251LeuIle: 8.251 ± 2.599
6.122LeuLys: 6.122 ± 1.687
7.187LeuLeu: 7.187 ± 1.851
3.194LeuMet: 3.194 ± 1.127
5.59LeuAsn: 5.59 ± 0.836
3.194LeuPro: 3.194 ± 0.632
2.396LeuGln: 2.396 ± 1.989
5.057LeuArg: 5.057 ± 1.224
8.784LeuSer: 8.784 ± 2.157
4.525LeuThr: 4.525 ± 0.926
3.46LeuVal: 3.46 ± 0.702
0.799LeuTrp: 0.799 ± 0.677
3.726LeuTyr: 3.726 ± 1.533
0.0LeuXaa: 0.0 ± 0.0
Met
1.597MetAla: 1.597 ± 0.661
0.532MetCys: 0.532 ± 0.32
0.799MetAsp: 0.799 ± 0.762
0.799MetGlu: 0.799 ± 1.032
1.863MetPhe: 1.863 ± 0.622
1.597MetGly: 1.597 ± 0.497
0.0MetHis: 0.0 ± 0.0
3.194MetIle: 3.194 ± 0.956
1.331MetLys: 1.331 ± 0.507
2.129MetLeu: 2.129 ± 0.737
0.532MetMet: 0.532 ± 0.32
1.331MetAsn: 1.331 ± 0.783
0.532MetPro: 0.532 ± 0.313
1.065MetGln: 1.065 ± 0.394
0.532MetArg: 0.532 ± 0.313
3.194MetSer: 3.194 ± 0.74
1.331MetThr: 1.331 ± 0.44
1.065MetVal: 1.065 ± 0.861
0.532MetTrp: 0.532 ± 0.423
1.065MetTyr: 1.065 ± 0.438
0.0MetXaa: 0.0 ± 0.0
Asn
1.863AsnAla: 1.863 ± 0.873
1.597AsnCys: 1.597 ± 0.639
4.259AsnAsp: 4.259 ± 0.809
1.597AsnGlu: 1.597 ± 1.154
3.194AsnPhe: 3.194 ± 1.207
2.396AsnGly: 2.396 ± 1.555
1.065AsnHis: 1.065 ± 0.487
6.654AsnIle: 6.654 ± 1.359
6.388AsnLys: 6.388 ± 1.242
4.791AsnLeu: 4.791 ± 1.503
1.065AsnMet: 1.065 ± 0.612
5.057AsnAsn: 5.057 ± 1.067
3.46AsnPro: 3.46 ± 0.701
1.331AsnGln: 1.331 ± 0.783
1.597AsnArg: 1.597 ± 0.444
3.194AsnSer: 3.194 ± 0.946
2.928AsnThr: 2.928 ± 0.818
2.662AsnVal: 2.662 ± 1.408
0.532AsnTrp: 0.532 ± 0.313
2.129AsnTyr: 2.129 ± 0.554
0.0AsnXaa: 0.0 ± 0.0
Pro
0.266ProAla: 0.266 ± 0.157
0.0ProCys: 0.0 ± 0.0
3.726ProAsp: 3.726 ± 0.527
2.662ProGlu: 2.662 ± 1.651
0.532ProPhe: 0.532 ± 0.636
1.597ProGly: 1.597 ± 1.41
0.0ProHis: 0.0 ± 0.0
5.856ProIle: 5.856 ± 1.456
2.928ProLys: 2.928 ± 1.196
3.194ProLeu: 3.194 ± 0.614
0.532ProMet: 0.532 ± 0.407
1.331ProAsn: 1.331 ± 1.042
2.129ProPro: 2.129 ± 0.489
0.0ProGln: 0.0 ± 0.0
1.597ProArg: 1.597 ± 0.464
3.726ProSer: 3.726 ± 1.594
3.46ProThr: 3.46 ± 0.931
3.726ProVal: 3.726 ± 1.798
0.532ProTrp: 0.532 ± 0.313
2.129ProTyr: 2.129 ± 0.616
0.0ProXaa: 0.0 ± 0.0
Gln
1.065GlnAla: 1.065 ± 0.569
0.266GlnCys: 0.266 ± 0.378
1.863GlnAsp: 1.863 ± 0.996
1.597GlnGlu: 1.597 ± 0.475
2.129GlnPhe: 2.129 ± 0.371
2.396GlnGly: 2.396 ± 0.66
0.799GlnHis: 0.799 ± 0.47
2.396GlnIle: 2.396 ± 0.819
2.129GlnLys: 2.129 ± 0.437
3.46GlnLeu: 3.46 ± 0.841
0.799GlnMet: 0.799 ± 0.668
2.129GlnAsn: 2.129 ± 0.673
0.266GlnPro: 0.266 ± 0.157
0.799GlnGln: 0.799 ± 0.484
0.799GlnArg: 0.799 ± 0.47
2.396GlnSer: 2.396 ± 0.715
0.799GlnThr: 0.799 ± 0.519
0.799GlnVal: 0.799 ± 0.32
1.331GlnTrp: 1.331 ± 1.119
1.065GlnTyr: 1.065 ± 0.687
0.0GlnXaa: 0.0 ± 0.0
Arg
2.129ArgAla: 2.129 ± 0.332
1.863ArgCys: 1.863 ± 0.636
2.129ArgAsp: 2.129 ± 0.616
2.662ArgGlu: 2.662 ± 1.019
2.129ArgPhe: 2.129 ± 0.603
3.194ArgGly: 3.194 ± 0.74
2.129ArgHis: 2.129 ± 0.51
3.993ArgIle: 3.993 ± 0.581
3.726ArgLys: 3.726 ± 1.104
1.863ArgLeu: 1.863 ± 0.554
0.799ArgMet: 0.799 ± 0.377
1.863ArgAsn: 1.863 ± 0.676
1.597ArgPro: 1.597 ± 0.475
1.597ArgGln: 1.597 ± 0.74
2.396ArgArg: 2.396 ± 1.098
2.396ArgSer: 2.396 ± 0.833
2.928ArgThr: 2.928 ± 1.06
3.194ArgVal: 3.194 ± 0.716
1.331ArgTrp: 1.331 ± 1.012
1.863ArgTyr: 1.863 ± 0.688
0.0ArgXaa: 0.0 ± 0.0
Ser
2.662SerAla: 2.662 ± 1.184
3.993SerCys: 3.993 ± 0.889
6.388SerAsp: 6.388 ± 1.689
5.59SerGlu: 5.59 ± 1.2
4.791SerPhe: 4.791 ± 2.354
3.993SerGly: 3.993 ± 1.838
2.396SerHis: 2.396 ± 1.076
5.856SerIle: 5.856 ± 0.851
4.525SerLys: 4.525 ± 1.471
6.388SerLeu: 6.388 ± 1.335
2.129SerMet: 2.129 ± 0.673
3.726SerAsn: 3.726 ± 0.872
2.662SerPro: 2.662 ± 0.78
1.863SerGln: 1.863 ± 0.833
3.993SerArg: 3.993 ± 1.336
8.251SerSer: 8.251 ± 1.138
3.993SerThr: 3.993 ± 1.364
2.928SerVal: 2.928 ± 1.319
1.065SerTrp: 1.065 ± 0.492
3.726SerTyr: 3.726 ± 0.851
0.0SerXaa: 0.0 ± 0.0
Thr
1.331ThrAla: 1.331 ± 0.563
0.799ThrCys: 0.799 ± 0.55
4.259ThrAsp: 4.259 ± 1.305
3.46ThrGlu: 3.46 ± 1.186
2.662ThrPhe: 2.662 ± 1.016
5.057ThrGly: 5.057 ± 0.707
2.396ThrHis: 2.396 ± 0.554
4.259ThrIle: 4.259 ± 1.217
4.259ThrLys: 4.259 ± 0.82
5.856ThrLeu: 5.856 ± 2.343
1.331ThrMet: 1.331 ± 0.783
1.863ThrAsn: 1.863 ± 0.3
1.331ThrPro: 1.331 ± 1.042
1.863ThrGln: 1.863 ± 0.953
1.065ThrArg: 1.065 ± 0.426
3.726ThrSer: 3.726 ± 0.647
2.662ThrThr: 2.662 ± 0.755
4.259ThrVal: 4.259 ± 1.025
0.532ThrTrp: 0.532 ± 0.313
2.396ThrTyr: 2.396 ± 0.719
0.0ThrXaa: 0.0 ± 0.0
Val
1.863ValAla: 1.863 ± 1.053
1.331ValCys: 1.331 ± 0.377
2.662ValAsp: 2.662 ± 0.959
2.129ValGlu: 2.129 ± 0.6
1.065ValPhe: 1.065 ± 0.76
1.065ValGly: 1.065 ± 0.685
1.863ValHis: 1.863 ± 0.537
5.057ValIle: 5.057 ± 1.088
2.662ValLys: 2.662 ± 0.615
3.194ValLeu: 3.194 ± 1.239
0.532ValMet: 0.532 ± 0.313
2.928ValAsn: 2.928 ± 1.196
2.129ValPro: 2.129 ± 0.616
1.597ValGln: 1.597 ± 0.918
4.259ValArg: 4.259 ± 0.969
4.791ValSer: 4.791 ± 0.931
4.525ValThr: 4.525 ± 1.485
1.331ValVal: 1.331 ± 0.307
0.799ValTrp: 0.799 ± 0.666
0.799ValTyr: 0.799 ± 0.931
0.0ValXaa: 0.0 ± 0.0
Trp
0.532TrpAla: 0.532 ± 0.643
0.0TrpCys: 0.0 ± 0.0
0.266TrpAsp: 0.266 ± 0.489
1.597TrpGlu: 1.597 ± 0.653
0.799TrpPhe: 0.799 ± 0.32
1.065TrpGly: 1.065 ± 0.627
0.532TrpHis: 0.532 ± 0.313
2.129TrpIle: 2.129 ± 2.078
1.331TrpLys: 1.331 ± 0.594
2.396TrpLeu: 2.396 ± 0.783
0.266TrpMet: 0.266 ± 0.369
0.799TrpAsn: 0.799 ± 0.47
0.532TrpPro: 0.532 ± 0.313
0.266TrpGln: 0.266 ± 0.157
0.532TrpArg: 0.532 ± 0.313
0.532TrpSer: 0.532 ± 0.313
1.065TrpThr: 1.065 ± 0.627
1.331TrpVal: 1.331 ± 0.594
0.0TrpTrp: 0.0 ± 0.0
1.597TrpTyr: 1.597 ± 0.378
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.065TyrAla: 1.065 ± 0.936
1.331TyrCys: 1.331 ± 0.565
2.928TyrAsp: 2.928 ± 1.15
1.597TyrGlu: 1.597 ± 0.74
2.396TyrPhe: 2.396 ± 0.532
3.46TyrGly: 3.46 ± 0.681
1.065TyrHis: 1.065 ± 0.568
0.799TyrIle: 0.799 ± 0.32
3.46TyrLys: 3.46 ± 1.399
3.726TyrLeu: 3.726 ± 0.905
0.532TyrMet: 0.532 ± 0.313
2.396TyrAsn: 2.396 ± 1.024
1.863TyrPro: 1.863 ± 0.491
1.065TyrGln: 1.065 ± 0.465
1.863TyrArg: 1.863 ± 0.465
4.259TyrSer: 4.259 ± 0.918
1.597TyrThr: 1.597 ± 0.768
2.662TyrVal: 2.662 ± 1.019
0.266TyrTrp: 0.266 ± 0.378
1.065TyrTyr: 1.065 ± 0.438
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 8 proteins (3758 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski