Amino acid dipepetide frequency for Barur virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.262AlaAla: 2.262 ± 0.668
1.414AlaCys: 1.414 ± 0.519
1.98AlaAsp: 1.98 ± 0.644
2.545AlaGlu: 2.545 ± 0.587
1.131AlaPhe: 1.131 ± 0.621
2.262AlaGly: 2.262 ± 0.507
1.414AlaHis: 1.414 ± 0.588
4.242AlaIle: 4.242 ± 0.875
3.111AlaLys: 3.111 ± 1.034
6.222AlaLeu: 6.222 ± 1.079
0.0AlaMet: 0.0 ± 0.0
1.414AlaAsn: 1.414 ± 0.588
1.98AlaPro: 1.98 ± 0.582
2.262AlaGln: 2.262 ± 1.236
3.111AlaArg: 3.111 ± 0.654
4.242AlaSer: 4.242 ± 0.806
1.697AlaThr: 1.697 ± 0.354
3.676AlaVal: 3.676 ± 0.755
0.0AlaTrp: 0.0 ± 0.0
1.131AlaTyr: 1.131 ± 0.373
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.566CysCys: 0.566 ± 0.313
0.848CysAsp: 0.848 ± 1.108
1.697CysGlu: 1.697 ± 0.717
0.566CysPhe: 0.566 ± 0.391
1.131CysGly: 1.131 ± 0.397
1.131CysHis: 1.131 ± 1.258
1.131CysIle: 1.131 ± 0.662
1.414CysLys: 1.414 ± 0.992
1.98CysLeu: 1.98 ± 0.676
0.283CysMet: 0.283 ± 0.466
0.283CysAsn: 0.283 ± 0.165
1.98CysPro: 1.98 ± 0.683
1.131CysGln: 1.131 ± 0.765
0.848CysArg: 0.848 ± 0.379
1.131CysSer: 1.131 ± 0.662
1.697CysThr: 1.697 ± 0.539
1.131CysVal: 1.131 ± 0.397
0.566CysTrp: 0.566 ± 0.331
0.566CysTyr: 0.566 ± 0.73
0.0CysXaa: 0.0 ± 0.0
Asp
1.414AspAla: 1.414 ± 0.405
0.566AspCys: 0.566 ± 0.313
4.242AspAsp: 4.242 ± 0.593
4.525AspGlu: 4.525 ± 1.821
2.545AspPhe: 2.545 ± 0.909
2.828AspGly: 2.828 ± 0.968
1.414AspHis: 1.414 ± 0.486
2.828AspIle: 2.828 ± 1.161
1.131AspLys: 1.131 ± 0.373
8.484AspLeu: 8.484 ± 1.733
1.697AspMet: 1.697 ± 0.757
1.697AspAsn: 1.697 ± 0.66
3.959AspPro: 3.959 ± 1.183
2.828AspGln: 2.828 ± 0.939
3.959AspArg: 3.959 ± 0.638
3.111AspSer: 3.111 ± 1.322
3.676AspThr: 3.676 ± 1.25
4.242AspVal: 4.242 ± 1.157
1.131AspTrp: 1.131 ± 0.373
3.676AspTyr: 3.676 ± 0.955
0.0AspXaa: 0.0 ± 0.0
Glu
2.828GluAla: 2.828 ± 1.276
0.566GluCys: 0.566 ± 0.61
5.09GluAsp: 5.09 ± 0.908
6.505GluGlu: 6.505 ± 2.511
3.959GluPhe: 3.959 ± 0.961
4.525GluGly: 4.525 ± 0.95
1.98GluHis: 1.98 ± 0.899
5.09GluIle: 5.09 ± 0.78
4.242GluLys: 4.242 ± 1.352
6.505GluLeu: 6.505 ± 1.81
1.697GluMet: 1.697 ± 0.604
3.111GluAsn: 3.111 ± 0.357
1.697GluPro: 1.697 ± 0.546
1.697GluGln: 1.697 ± 0.734
2.828GluArg: 2.828 ± 0.642
5.656GluSer: 5.656 ± 1.282
3.676GluThr: 3.676 ± 0.632
5.939GluVal: 5.939 ± 2.623
0.0GluTrp: 0.0 ± 0.0
2.828GluTyr: 2.828 ± 0.992
0.0GluXaa: 0.0 ± 0.0
Phe
1.697PheAla: 1.697 ± 0.757
1.131PheCys: 1.131 ± 0.89
1.98PheAsp: 1.98 ± 0.808
3.394PheGlu: 3.394 ± 0.98
2.545PhePhe: 2.545 ± 0.717
3.959PheGly: 3.959 ± 0.487
1.98PheHis: 1.98 ± 0.644
0.566PheIle: 0.566 ± 0.391
1.98PheLys: 1.98 ± 0.596
4.242PheLeu: 4.242 ± 2.097
0.848PheMet: 0.848 ± 0.649
1.697PheAsn: 1.697 ± 0.354
2.262PhePro: 2.262 ± 0.675
1.414PheGln: 1.414 ± 0.827
2.545PheArg: 2.545 ± 1.065
3.959PheSer: 3.959 ± 0.812
0.848PheThr: 0.848 ± 0.478
2.262PheVal: 2.262 ± 0.836
1.131PheTrp: 1.131 ± 0.626
0.848PheTyr: 0.848 ± 0.379
0.0PheXaa: 0.0 ± 0.0
Gly
1.98GlyAla: 1.98 ± 0.899
0.566GlyCys: 0.566 ± 0.331
3.111GlyAsp: 3.111 ± 0.842
3.959GlyGlu: 3.959 ± 1.438
1.98GlyPhe: 1.98 ± 0.699
3.959GlyGly: 3.959 ± 1.911
0.848GlyHis: 0.848 ± 0.317
3.676GlyIle: 3.676 ± 0.929
2.828GlyLys: 2.828 ± 1.128
8.767GlyLeu: 8.767 ± 1.688
1.697GlyMet: 1.697 ± 0.406
1.131GlyAsn: 1.131 ± 0.397
2.828GlyPro: 2.828 ± 0.364
3.394GlyGln: 3.394 ± 1.332
3.959GlyArg: 3.959 ± 1.425
4.808GlySer: 4.808 ± 0.965
2.262GlyThr: 2.262 ± 0.65
1.414GlyVal: 1.414 ± 0.963
0.566GlyTrp: 0.566 ± 0.331
1.414GlyTyr: 1.414 ± 0.405
0.0GlyXaa: 0.0 ± 0.0
His
1.697HisAla: 1.697 ± 0.472
1.131HisCys: 1.131 ± 0.662
3.111HisAsp: 3.111 ± 1.533
2.545HisGlu: 2.545 ± 1.059
0.566HisPhe: 0.566 ± 0.331
0.283HisGly: 0.283 ± 0.388
1.414HisHis: 1.414 ± 0.524
2.828HisIle: 2.828 ± 0.63
0.848HisLys: 0.848 ± 0.478
3.111HisLeu: 3.111 ± 2.096
1.697HisMet: 1.697 ± 0.546
0.566HisAsn: 0.566 ± 0.331
2.828HisPro: 2.828 ± 0.476
0.566HisGln: 0.566 ± 0.391
2.262HisArg: 2.262 ± 0.966
2.545HisSer: 2.545 ± 1.129
1.98HisThr: 1.98 ± 0.447
2.262HisVal: 2.262 ± 0.675
0.566HisTrp: 0.566 ± 0.313
1.131HisTyr: 1.131 ± 0.662
0.0HisXaa: 0.0 ± 0.0
Ile
2.545IleAla: 2.545 ± 1.015
1.131IleCys: 1.131 ± 0.373
3.959IleAsp: 3.959 ± 1.439
2.828IleGlu: 2.828 ± 0.364
1.414IlePhe: 1.414 ± 0.537
4.525IleGly: 4.525 ± 0.89
1.131IleHis: 1.131 ± 0.626
4.525IleIle: 4.525 ± 1.199
3.676IleLys: 3.676 ± 0.778
6.787IleLeu: 6.787 ± 1.335
0.848IleMet: 0.848 ± 0.536
3.111IleAsn: 3.111 ± 1.092
3.111IlePro: 3.111 ± 0.45
2.262IleGln: 2.262 ± 0.917
3.394IleArg: 3.394 ± 0.796
3.676IleSer: 3.676 ± 0.546
4.525IleThr: 4.525 ± 1.284
3.959IleVal: 3.959 ± 0.928
1.697IleTrp: 1.697 ± 0.546
1.98IleTyr: 1.98 ± 0.808
0.0IleXaa: 0.0 ± 0.0
Lys
3.676LysAla: 3.676 ± 1.105
2.262LysCys: 2.262 ± 0.375
5.373LysAsp: 5.373 ± 1.729
3.394LysGlu: 3.394 ± 0.523
2.545LysPhe: 2.545 ± 0.583
3.676LysGly: 3.676 ± 1.09
2.828LysHis: 2.828 ± 1.242
3.111LysIle: 3.111 ± 1.092
3.676LysLys: 3.676 ± 0.79
4.525LysLeu: 4.525 ± 0.926
1.697LysMet: 1.697 ± 1.607
2.545LysAsn: 2.545 ± 1.26
2.545LysPro: 2.545 ± 1.635
1.414LysGln: 1.414 ± 0.66
3.394LysArg: 3.394 ± 0.464
3.394LysSer: 3.394 ± 0.666
3.394LysThr: 3.394 ± 0.831
3.676LysVal: 3.676 ± 0.462
1.697LysTrp: 1.697 ± 0.634
1.414LysTyr: 1.414 ± 0.827
0.0LysXaa: 0.0 ± 0.0
Leu
4.808LeuAla: 4.808 ± 0.583
3.394LeuCys: 3.394 ± 0.754
6.222LeuAsp: 6.222 ± 0.916
8.767LeuGlu: 8.767 ± 1.744
3.676LeuPhe: 3.676 ± 1.342
5.09LeuGly: 5.09 ± 1.056
3.394LeuHis: 3.394 ± 0.998
7.07LeuIle: 7.07 ± 0.84
7.919LeuLys: 7.919 ± 0.969
7.636LeuLeu: 7.636 ± 3.629
3.111LeuMet: 3.111 ± 1.08
6.222LeuAsn: 6.222 ± 1.632
3.676LeuPro: 3.676 ± 1.029
1.414LeuGln: 1.414 ± 0.869
4.242LeuArg: 4.242 ± 1.465
9.333LeuSer: 9.333 ± 3.19
5.656LeuThr: 5.656 ± 1.493
5.939LeuVal: 5.939 ± 0.823
1.131LeuTrp: 1.131 ± 1.307
2.545LeuTyr: 2.545 ± 1.065
0.0LeuXaa: 0.0 ± 0.0
Met
1.131MetAla: 1.131 ± 0.568
0.848MetCys: 0.848 ± 0.478
1.697MetAsp: 1.697 ± 0.464
2.545MetGlu: 2.545 ± 1.532
1.131MetPhe: 1.131 ± 0.434
1.414MetGly: 1.414 ± 0.324
0.848MetHis: 0.848 ± 0.649
1.414MetIle: 1.414 ± 0.669
0.566MetLys: 0.566 ± 0.331
1.697MetLeu: 1.697 ± 0.734
1.414MetMet: 1.414 ± 0.99
1.414MetAsn: 1.414 ± 0.866
1.414MetPro: 1.414 ± 0.782
0.283MetGln: 0.283 ± 0.165
1.414MetArg: 1.414 ± 0.588
1.697MetSer: 1.697 ± 0.525
1.414MetThr: 1.414 ± 0.827
1.131MetVal: 1.131 ± 0.553
0.848MetTrp: 0.848 ± 0.478
0.566MetTyr: 0.566 ± 0.391
0.0MetXaa: 0.0 ± 0.0
Asn
2.545AsnAla: 2.545 ± 1.134
0.566AsnCys: 0.566 ± 0.775
1.697AsnAsp: 1.697 ± 0.604
1.98AsnGlu: 1.98 ± 0.923
1.414AsnPhe: 1.414 ± 0.528
2.545AsnGly: 2.545 ± 0.797
1.98AsnHis: 1.98 ± 0.447
3.394AsnIle: 3.394 ± 0.887
4.242AsnLys: 4.242 ± 0.899
4.242AsnLeu: 4.242 ± 0.842
0.848AsnMet: 0.848 ± 0.496
1.697AsnAsn: 1.697 ± 0.993
2.828AsnPro: 2.828 ± 1.075
1.414AsnGln: 1.414 ± 0.519
1.697AsnArg: 1.697 ± 0.525
2.545AsnSer: 2.545 ± 0.797
2.545AsnThr: 2.545 ± 0.662
1.414AsnVal: 1.414 ± 0.752
1.131AsnTrp: 1.131 ± 0.397
2.262AsnTyr: 2.262 ± 1.323
0.0AsnXaa: 0.0 ± 0.0
Pro
1.414ProAla: 1.414 ± 0.537
0.566ProCys: 0.566 ± 0.391
3.676ProAsp: 3.676 ± 1.276
2.828ProGlu: 2.828 ± 1.445
1.414ProPhe: 1.414 ± 0.486
2.262ProGly: 2.262 ± 1.607
1.697ProHis: 1.697 ± 0.66
2.262ProIle: 2.262 ± 0.691
1.98ProLys: 1.98 ± 0.808
5.656ProLeu: 5.656 ± 2.272
0.566ProMet: 0.566 ± 0.822
1.414ProAsn: 1.414 ± 0.324
2.828ProPro: 2.828 ± 1.615
1.131ProGln: 1.131 ± 0.621
2.262ProArg: 2.262 ± 0.341
4.525ProSer: 4.525 ± 2.285
4.242ProThr: 4.242 ± 1.852
4.808ProVal: 4.808 ± 0.732
0.283ProTrp: 0.283 ± 0.165
2.545ProTyr: 2.545 ± 1.637
0.0ProXaa: 0.0 ± 0.0
Gln
1.414GlnAla: 1.414 ± 1.132
0.283GlnCys: 0.283 ± 0.165
0.848GlnAsp: 0.848 ± 0.474
1.98GlnGlu: 1.98 ± 1.499
0.283GlnPhe: 0.283 ± 0.165
1.98GlnGly: 1.98 ± 0.567
0.283GlnHis: 0.283 ± 0.165
1.697GlnIle: 1.697 ± 0.634
2.262GlnLys: 2.262 ± 0.574
2.262GlnLeu: 2.262 ± 1.057
1.131GlnMet: 1.131 ± 0.89
1.414GlnAsn: 1.414 ± 0.324
1.98GlnPro: 1.98 ± 0.911
0.566GlnGln: 0.566 ± 0.581
2.545GlnArg: 2.545 ± 0.713
2.545GlnSer: 2.545 ± 0.667
1.414GlnThr: 1.414 ± 0.519
2.262GlnVal: 2.262 ± 1.083
0.283GlnTrp: 0.283 ± 0.165
1.697GlnTyr: 1.697 ± 0.802
0.0GlnXaa: 0.0 ± 0.0
Arg
2.828ArgAla: 2.828 ± 0.728
2.262ArgCys: 2.262 ± 1.006
2.828ArgAsp: 2.828 ± 0.991
4.525ArgGlu: 4.525 ± 1.285
3.676ArgPhe: 3.676 ± 1.115
3.111ArgGly: 3.111 ± 0.889
1.98ArgHis: 1.98 ± 0.913
1.98ArgIle: 1.98 ± 0.597
3.959ArgLys: 3.959 ± 1.39
5.656ArgLeu: 5.656 ± 0.965
1.131ArgMet: 1.131 ± 0.579
2.262ArgAsn: 2.262 ± 0.966
2.262ArgPro: 2.262 ± 0.574
0.848ArgGln: 0.848 ± 0.836
3.111ArgArg: 3.111 ± 0.781
3.394ArgSer: 3.394 ± 0.821
3.676ArgThr: 3.676 ± 1.47
2.545ArgVal: 2.545 ± 0.583
0.566ArgTrp: 0.566 ± 0.775
1.697ArgTyr: 1.697 ± 0.734
0.0ArgXaa: 0.0 ± 0.0
Ser
4.242SerAla: 4.242 ± 1.599
1.414SerCys: 1.414 ± 0.608
4.525SerAsp: 4.525 ± 1.123
4.808SerGlu: 4.808 ± 2.138
3.959SerPhe: 3.959 ± 1.335
2.545SerGly: 2.545 ± 0.807
2.545SerHis: 2.545 ± 1.217
3.676SerIle: 3.676 ± 1.523
4.525SerLys: 4.525 ± 0.773
6.505SerLeu: 6.505 ± 1.536
1.98SerMet: 1.98 ± 1.0
2.828SerAsn: 2.828 ± 1.066
1.98SerPro: 1.98 ± 0.319
2.262SerGln: 2.262 ± 0.818
5.373SerArg: 5.373 ± 1.815
5.09SerSer: 5.09 ± 1.588
4.242SerThr: 4.242 ± 1.122
4.808SerVal: 4.808 ± 0.869
3.111SerTrp: 3.111 ± 0.699
3.676SerTyr: 3.676 ± 0.546
0.0SerXaa: 0.0 ± 0.0
Thr
2.262ThrAla: 2.262 ± 0.966
0.566ThrCys: 0.566 ± 1.193
1.98ThrAsp: 1.98 ± 0.699
3.959ThrGlu: 3.959 ± 1.145
1.98ThrPhe: 1.98 ± 1.137
3.676ThrGly: 3.676 ± 0.856
2.545ThrHis: 2.545 ± 0.717
3.959ThrIle: 3.959 ± 1.289
3.676ThrLys: 3.676 ± 2.063
5.373ThrLeu: 5.373 ± 1.681
1.131ThrMet: 1.131 ± 0.397
3.394ThrAsn: 3.394 ± 0.464
3.394ThrPro: 3.394 ± 1.063
1.697ThrGln: 1.697 ± 0.993
1.414ThrArg: 1.414 ± 0.519
4.808ThrSer: 4.808 ± 0.826
3.959ThrThr: 3.959 ± 0.817
4.242ThrVal: 4.242 ± 1.331
1.414ThrTrp: 1.414 ± 0.524
3.394ThrTyr: 3.394 ± 1.112
0.0ThrXaa: 0.0 ± 0.0
Val
4.808ValAla: 4.808 ± 2.263
0.283ValCys: 0.283 ± 0.165
3.959ValAsp: 3.959 ± 1.438
2.828ValGlu: 2.828 ± 1.041
2.545ValPhe: 2.545 ± 0.667
2.262ValGly: 2.262 ± 0.475
3.394ValHis: 3.394 ± 1.618
4.808ValIle: 4.808 ± 1.99
2.828ValLys: 2.828 ± 0.568
5.373ValLeu: 5.373 ± 1.113
1.131ValMet: 1.131 ± 1.085
3.959ValAsn: 3.959 ± 0.692
2.545ValPro: 2.545 ± 0.963
1.414ValGln: 1.414 ± 0.588
3.394ValArg: 3.394 ± 0.859
3.394ValSer: 3.394 ± 1.379
5.09ValThr: 5.09 ± 2.059
3.394ValVal: 3.394 ± 1.631
1.697ValTrp: 1.697 ± 0.717
2.828ValTyr: 2.828 ± 0.748
0.0ValXaa: 0.0 ± 0.0
Trp
1.414TrpAla: 1.414 ± 0.519
0.283TrpCys: 0.283 ± 0.596
1.131TrpAsp: 1.131 ± 0.626
1.414TrpGlu: 1.414 ± 0.519
1.98TrpPhe: 1.98 ± 0.644
1.414TrpGly: 1.414 ± 0.827
0.283TrpHis: 0.283 ± 0.165
0.848TrpIle: 0.848 ± 0.91
1.131TrpLys: 1.131 ± 0.397
1.697TrpLeu: 1.697 ± 0.354
1.131TrpMet: 1.131 ± 0.777
0.848TrpAsn: 0.848 ± 0.496
0.566TrpPro: 0.566 ± 0.331
0.566TrpGln: 0.566 ± 0.543
0.566TrpArg: 0.566 ± 0.331
1.131TrpSer: 1.131 ± 0.397
0.566TrpThr: 0.566 ± 0.61
1.131TrpVal: 1.131 ± 1.133
0.283TrpTrp: 0.283 ± 0.165
0.283TrpTyr: 0.283 ± 0.388
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.131TyrAla: 1.131 ± 0.373
0.283TyrCys: 0.283 ± 0.388
1.697TyrAsp: 1.697 ± 0.464
3.394TyrGlu: 3.394 ± 0.621
1.98TyrPhe: 1.98 ± 0.613
1.697TyrGly: 1.697 ± 0.993
0.848TyrHis: 0.848 ± 0.96
1.98TyrIle: 1.98 ± 0.98
4.242TyrLys: 4.242 ± 1.016
4.525TyrLeu: 4.525 ± 1.066
0.566TyrMet: 0.566 ± 0.391
1.98TyrAsn: 1.98 ± 0.512
1.98TyrPro: 1.98 ± 0.61
0.566TyrGln: 0.566 ± 0.313
1.98TyrArg: 1.98 ± 0.841
3.111TyrSer: 3.111 ± 1.582
2.262TyrThr: 2.262 ± 1.836
1.697TyrVal: 1.697 ± 0.354
0.566TyrTrp: 0.566 ± 0.313
0.848TyrTyr: 0.848 ± 0.496
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (3537 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski