Amino acid dipepetide frequency for Urucuri virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.272AlaAla: 5.272 ± 1.313
0.753AlaCys: 0.753 ± 0.195
3.013AlaAsp: 3.013 ± 2.341
3.766AlaGlu: 3.766 ± 0.39
1.506AlaPhe: 1.506 ± 0.622
3.264AlaGly: 3.264 ± 0.789
2.26AlaHis: 2.26 ± 0.411
3.013AlaIle: 3.013 ± 0.526
2.009AlaLys: 2.009 ± 0.839
3.264AlaLeu: 3.264 ± 1.108
2.511AlaMet: 2.511 ± 0.618
2.26AlaAsn: 2.26 ± 1.05
2.26AlaPro: 2.26 ± 0.572
1.506AlaGln: 1.506 ± 0.309
2.009AlaArg: 2.009 ± 0.43
4.519AlaSer: 4.519 ± 0.483
1.757AlaThr: 1.757 ± 0.921
4.77AlaVal: 4.77 ± 1.766
0.251AlaTrp: 0.251 ± 0.157
1.506AlaTyr: 1.506 ± 0.606
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
1.255CysAsp: 1.255 ± 0.801
0.502CysGlu: 0.502 ± 0.539
0.251CysPhe: 0.251 ± 0.23
1.255CysGly: 1.255 ± 0.483
0.753CysHis: 0.753 ± 0.689
1.757CysIle: 1.757 ± 0.409
2.26CysLys: 2.26 ± 1.049
4.017CysLeu: 4.017 ± 1.523
0.502CysMet: 0.502 ± 0.539
1.506CysAsn: 1.506 ± 0.7
1.506CysPro: 1.506 ± 0.432
1.506CysGln: 1.506 ± 0.488
2.009CysArg: 2.009 ± 0.895
1.757CysSer: 1.757 ± 0.435
3.013CysThr: 3.013 ± 0.526
0.251CysVal: 0.251 ± 0.23
0.251CysTrp: 0.251 ± 0.23
1.255CysTyr: 1.255 ± 0.305
0.0CysXaa: 0.0 ± 0.0
Asp
4.017AspAla: 4.017 ± 1.852
1.506AspCys: 1.506 ± 0.532
4.519AspAsp: 4.519 ± 0.494
4.519AspGlu: 4.519 ± 1.3
2.511AspPhe: 2.511 ± 0.833
3.264AspGly: 3.264 ± 0.388
1.004AspHis: 1.004 ± 0.288
3.264AspIle: 3.264 ± 1.395
4.519AspLys: 4.519 ± 0.721
5.523AspLeu: 5.523 ± 0.945
1.506AspMet: 1.506 ± 0.78
2.26AspAsn: 2.26 ± 0.551
2.511AspPro: 2.511 ± 0.703
2.511AspGln: 2.511 ± 0.852
3.264AspArg: 3.264 ± 1.633
4.77AspSer: 4.77 ± 0.93
1.506AspThr: 1.506 ± 0.35
2.26AspVal: 2.26 ± 1.086
1.004AspTrp: 1.004 ± 0.324
3.264AspTyr: 3.264 ± 2.35
0.0AspXaa: 0.0 ± 0.0
Glu
3.013GluAla: 3.013 ± 0.898
2.26GluCys: 2.26 ± 1.375
5.523GluAsp: 5.523 ± 1.983
6.277GluGlu: 6.277 ± 1.303
3.766GluPhe: 3.766 ± 1.521
3.766GluGly: 3.766 ± 0.976
1.506GluHis: 1.506 ± 0.622
5.272GluIle: 5.272 ± 0.952
3.766GluLys: 3.766 ± 0.641
5.272GluLeu: 5.272 ± 1.44
1.757GluMet: 1.757 ± 0.735
2.511GluAsn: 2.511 ± 0.945
1.506GluPro: 1.506 ± 0.966
1.506GluGln: 1.506 ± 1.379
4.017GluArg: 4.017 ± 0.751
5.021GluSer: 5.021 ± 1.342
3.515GluThr: 3.515 ± 1.009
4.77GluVal: 4.77 ± 0.743
1.004GluTrp: 1.004 ± 0.538
3.013GluTyr: 3.013 ± 0.864
0.0GluXaa: 0.0 ± 0.0
Phe
3.013PheAla: 3.013 ± 1.228
1.255PheCys: 1.255 ± 0.483
1.506PheAsp: 1.506 ± 0.606
2.009PheGlu: 2.009 ± 0.931
2.26PhePhe: 2.26 ± 0.441
1.255PheGly: 1.255 ± 0.47
0.753PheHis: 0.753 ± 0.544
2.26PheIle: 2.26 ± 1.104
2.762PheLys: 2.762 ± 0.446
4.77PheLeu: 4.77 ± 1.929
2.26PheMet: 2.26 ± 0.468
2.762PheAsn: 2.762 ± 0.512
1.757PhePro: 1.757 ± 0.901
1.757PheGln: 1.757 ± 0.921
2.26PheArg: 2.26 ± 1.209
3.013PheSer: 3.013 ± 0.679
1.757PheThr: 1.757 ± 0.573
2.762PheVal: 2.762 ± 1.091
0.502PheTrp: 0.502 ± 0.144
1.255PheTyr: 1.255 ± 0.674
0.0PheXaa: 0.0 ± 0.0
Gly
4.268GlyAla: 4.268 ± 1.213
1.255GlyCys: 1.255 ± 0.801
1.255GlyAsp: 1.255 ± 0.305
2.26GlyGlu: 2.26 ± 0.762
3.264GlyPhe: 3.264 ± 0.631
5.523GlyGly: 5.523 ± 0.586
1.004GlyHis: 1.004 ± 0.629
3.013GlyIle: 3.013 ± 0.484
3.515GlyLys: 3.515 ± 0.617
2.511GlyLeu: 2.511 ± 0.418
2.26GlyMet: 2.26 ± 0.479
2.26GlyAsn: 2.26 ± 1.207
2.511GlyPro: 2.511 ± 0.852
1.255GlyGln: 1.255 ± 0.483
3.515GlyArg: 3.515 ± 1.124
7.281GlySer: 7.281 ± 1.293
3.515GlyThr: 3.515 ± 0.87
3.013GlyVal: 3.013 ± 0.738
0.753GlyTrp: 0.753 ± 0.577
1.004GlyTyr: 1.004 ± 0.288
0.0GlyXaa: 0.0 ± 0.0
His
0.251HisAla: 0.251 ± 0.157
1.004HisCys: 1.004 ± 0.452
1.255HisAsp: 1.255 ± 0.47
1.757HisGlu: 1.757 ± 0.409
1.255HisPhe: 1.255 ± 0.47
1.506HisGly: 1.506 ± 0.39
1.004HisHis: 1.004 ± 0.288
1.506HisIle: 1.506 ± 0.432
2.009HisLys: 2.009 ± 0.392
0.753HisLeu: 0.753 ± 0.195
0.251HisMet: 0.251 ± 0.23
1.004HisAsn: 1.004 ± 1.079
0.753HisPro: 0.753 ± 0.466
1.255HisGln: 1.255 ± 0.381
1.757HisArg: 1.757 ± 0.435
1.255HisSer: 1.255 ± 0.429
0.753HisThr: 0.753 ± 0.35
1.255HisVal: 1.255 ± 0.47
0.251HisTrp: 0.251 ± 0.157
1.255HisTyr: 1.255 ± 0.426
0.0HisXaa: 0.0 ± 0.0
Ile
2.009IleAla: 2.009 ± 0.338
1.255IleCys: 1.255 ± 0.483
4.017IleAsp: 4.017 ± 0.172
4.017IleGlu: 4.017 ± 0.681
2.009IlePhe: 2.009 ± 0.931
3.766IleGly: 3.766 ± 0.485
2.009IleHis: 2.009 ± 0.392
5.021IleIle: 5.021 ± 0.448
3.264IleLys: 3.264 ± 0.794
6.026IleLeu: 6.026 ± 0.953
0.753IleMet: 0.753 ± 0.446
3.766IleAsn: 3.766 ± 0.99
2.762IlePro: 2.762 ± 0.446
3.264IleGln: 3.264 ± 1.115
4.77IleArg: 4.77 ± 1.269
5.775IleSer: 5.775 ± 1.021
3.515IleThr: 3.515 ± 1.243
5.272IleVal: 5.272 ± 0.333
0.251IleTrp: 0.251 ± 0.157
1.255IleTyr: 1.255 ± 0.483
0.0IleXaa: 0.0 ± 0.0
Lys
4.519LysAla: 4.519 ± 0.565
1.506LysCys: 1.506 ± 0.7
2.762LysAsp: 2.762 ± 0.607
4.268LysGlu: 4.268 ± 1.713
2.762LysPhe: 2.762 ± 1.16
1.757LysGly: 1.757 ± 0.511
0.753LysHis: 0.753 ± 0.195
5.272LysIle: 5.272 ± 1.025
4.77LysLys: 4.77 ± 0.32
6.026LysLeu: 6.026 ± 0.877
3.013LysMet: 3.013 ± 1.16
3.013LysAsn: 3.013 ± 1.052
3.013LysPro: 3.013 ± 0.217
1.757LysGln: 1.757 ± 0.59
2.511LysArg: 2.511 ± 1.242
4.77LysSer: 4.77 ± 2.011
4.77LysThr: 4.77 ± 1.426
5.775LysVal: 5.775 ± 0.54
1.255LysTrp: 1.255 ± 0.429
1.506LysTyr: 1.506 ± 0.309
0.0LysXaa: 0.0 ± 0.0
Leu
4.268LeuAla: 4.268 ± 1.704
1.757LeuCys: 1.757 ± 0.921
4.268LeuAsp: 4.268 ± 1.702
7.03LeuGlu: 7.03 ± 1.243
3.766LeuPhe: 3.766 ± 1.122
5.021LeuGly: 5.021 ± 1.147
1.506LeuHis: 1.506 ± 0.35
4.519LeuIle: 4.519 ± 1.277
6.026LeuLys: 6.026 ± 1.262
5.523LeuLeu: 5.523 ± 1.042
2.009LeuMet: 2.009 ± 0.576
2.511LeuAsn: 2.511 ± 0.61
2.511LeuPro: 2.511 ± 0.335
2.762LeuGln: 2.762 ± 0.311
7.03LeuArg: 7.03 ± 0.918
10.545LeuSer: 10.545 ± 1.984
4.519LeuThr: 4.519 ± 0.986
4.519LeuVal: 4.519 ± 1.384
0.0LeuTrp: 0.0 ± 0.0
2.009LeuTyr: 2.009 ± 0.576
0.0LeuXaa: 0.0 ± 0.0
Met
1.004MetAla: 1.004 ± 0.549
0.753MetCys: 0.753 ± 0.195
2.26MetAsp: 2.26 ± 0.734
2.26MetGlu: 2.26 ± 1.061
1.004MetPhe: 1.004 ± 0.629
1.757MetGly: 1.757 ± 0.435
1.004MetHis: 1.004 ± 0.549
2.511MetIle: 2.511 ± 0.763
2.009MetLys: 2.009 ± 0.648
2.009MetLeu: 2.009 ± 0.491
2.26MetMet: 2.26 ± 0.865
1.757MetAsn: 1.757 ± 0.907
0.0MetPro: 0.0 ± 0.0
1.004MetGln: 1.004 ± 0.288
2.511MetArg: 2.511 ± 0.999
2.511MetSer: 2.511 ± 0.434
2.511MetThr: 2.511 ± 0.966
2.511MetVal: 2.511 ± 0.418
0.251MetTrp: 0.251 ± 0.157
1.506MetTyr: 1.506 ± 0.39
0.0MetXaa: 0.0 ± 0.0
Asn
0.753AsnAla: 0.753 ± 0.669
1.004AsnCys: 1.004 ± 0.814
2.762AsnAsp: 2.762 ± 0.446
3.766AsnGlu: 3.766 ± 1.763
2.26AsnPhe: 2.26 ± 0.734
1.506AsnGly: 1.506 ± 0.532
1.506AsnHis: 1.506 ± 0.532
2.009AsnIle: 2.009 ± 0.392
2.762AsnLys: 2.762 ± 0.831
4.519AsnLeu: 4.519 ± 0.661
1.004AsnMet: 1.004 ± 0.275
1.255AsnAsn: 1.255 ± 0.611
2.762AsnPro: 2.762 ± 0.655
1.004AsnGln: 1.004 ± 0.324
2.762AsnArg: 2.762 ± 0.764
5.775AsnSer: 5.775 ± 0.927
1.757AsnThr: 1.757 ± 1.014
2.762AsnVal: 2.762 ± 0.548
0.0AsnTrp: 0.0 ± 0.0
0.753AsnTyr: 0.753 ± 0.876
0.0AsnXaa: 0.0 ± 0.0
Pro
1.004ProAla: 1.004 ± 0.452
0.753ProCys: 0.753 ± 0.35
1.506ProAsp: 1.506 ± 0.39
3.515ProGlu: 3.515 ± 1.022
2.762ProPhe: 2.762 ± 1.334
3.264ProGly: 3.264 ± 1.012
0.502ProHis: 0.502 ± 0.314
1.506ProIle: 1.506 ± 0.39
1.506ProLys: 1.506 ± 0.39
3.013ProLeu: 3.013 ± 0.738
1.506ProMet: 1.506 ± 0.762
2.009ProAsn: 2.009 ± 0.688
0.502ProPro: 0.502 ± 0.314
1.255ProGln: 1.255 ± 0.429
1.757ProArg: 1.757 ± 1.251
3.515ProSer: 3.515 ± 0.634
2.26ProThr: 2.26 ± 0.493
3.013ProVal: 3.013 ± 0.847
1.506ProTrp: 1.506 ± 0.481
1.506ProTyr: 1.506 ± 0.481
0.0ProXaa: 0.0 ± 0.0
Gln
1.255GlnAla: 1.255 ± 0.943
1.757GlnCys: 1.757 ± 0.7
3.515GlnAsp: 3.515 ± 0.153
1.255GlnGlu: 1.255 ± 0.305
0.502GlnPhe: 0.502 ± 1.158
2.511GlnGly: 2.511 ± 0.833
1.255GlnHis: 1.255 ± 0.47
3.013GlnIle: 3.013 ± 0.781
3.013GlnLys: 3.013 ± 0.738
1.757GlnLeu: 1.757 ± 0.927
0.502GlnMet: 0.502 ± 0.314
1.506GlnAsn: 1.506 ± 0.532
1.255GlnPro: 1.255 ± 0.381
1.004GlnGln: 1.004 ± 0.344
0.753GlnArg: 0.753 ± 0.858
2.26GlnSer: 2.26 ± 0.493
2.762GlnThr: 2.762 ± 0.83
2.762GlnVal: 2.762 ± 0.936
0.251GlnTrp: 0.251 ± 0.444
0.502GlnTyr: 0.502 ± 0.144
0.0GlnXaa: 0.0 ± 0.0
Arg
3.013ArgAla: 3.013 ± 0.807
1.004ArgCys: 1.004 ± 0.574
4.519ArgAsp: 4.519 ± 0.721
3.766ArgGlu: 3.766 ± 0.713
1.004ArgPhe: 1.004 ± 0.538
3.515ArgGly: 3.515 ± 1.086
1.004ArgHis: 1.004 ± 0.747
4.268ArgIle: 4.268 ± 0.326
3.515ArgLys: 3.515 ± 0.701
5.523ArgLeu: 5.523 ± 1.262
1.506ArgMet: 1.506 ± 0.943
2.511ArgAsn: 2.511 ± 1.321
2.762ArgPro: 2.762 ± 0.877
1.757ArgGln: 1.757 ± 0.518
3.013ArgArg: 3.013 ± 0.832
6.277ArgSer: 6.277 ± 0.94
1.255ArgThr: 1.255 ± 1.251
2.762ArgVal: 2.762 ± 1.16
1.255ArgTrp: 1.255 ± 0.426
2.511ArgTyr: 2.511 ± 0.858
0.0ArgXaa: 0.0 ± 0.0
Ser
5.523SerAla: 5.523 ± 1.031
4.017SerCys: 4.017 ± 1.662
5.272SerAsp: 5.272 ± 1.285
7.03SerGlu: 7.03 ± 1.716
5.272SerPhe: 5.272 ± 0.515
4.77SerGly: 4.77 ± 1.725
1.757SerHis: 1.757 ± 0.621
5.523SerIle: 5.523 ± 1.806
5.523SerLys: 5.523 ± 0.391
7.532SerLeu: 7.532 ± 0.084
2.009SerMet: 2.009 ± 0.491
3.013SerAsn: 3.013 ± 0.484
3.264SerPro: 3.264 ± 0.582
3.013SerGln: 3.013 ± 0.758
3.515SerArg: 3.515 ± 0.83
10.294SerSer: 10.294 ± 2.287
5.523SerThr: 5.523 ± 0.877
7.03SerVal: 7.03 ± 0.669
2.26SerTrp: 2.26 ± 0.411
3.013SerTyr: 3.013 ± 0.971
0.0SerXaa: 0.0 ± 0.0
Thr
1.255ThrAla: 1.255 ± 0.381
1.506ThrCys: 1.506 ± 0.606
4.77ThrAsp: 4.77 ± 1.274
3.766ThrGlu: 3.766 ± 0.495
1.255ThrPhe: 1.255 ± 0.611
4.77ThrGly: 4.77 ± 1.354
0.502ThrHis: 0.502 ± 0.144
4.268ThrIle: 4.268 ± 0.685
3.013ThrLys: 3.013 ± 0.738
5.775ThrLeu: 5.775 ± 0.897
1.255ThrMet: 1.255 ± 0.659
2.511ThrAsn: 2.511 ± 0.703
2.26ThrPro: 2.26 ± 0.757
1.757ThrGln: 1.757 ± 0.409
3.264ThrArg: 3.264 ± 0.99
4.77ThrSer: 4.77 ± 1.086
2.762ThrThr: 2.762 ± 0.712
2.762ThrVal: 2.762 ± 1.411
0.251ThrTrp: 0.251 ± 0.579
1.004ThrTyr: 1.004 ± 0.431
0.0ThrXaa: 0.0 ± 0.0
Val
4.268ValAla: 4.268 ± 0.685
1.757ValCys: 1.757 ± 0.675
4.519ValAsp: 4.519 ± 0.814
4.519ValGlu: 4.519 ± 1.32
3.013ValPhe: 3.013 ± 1.104
1.255ValGly: 1.255 ± 0.801
1.506ValHis: 1.506 ± 0.432
4.017ValIle: 4.017 ± 1.152
5.523ValLys: 5.523 ± 0.78
5.021ValLeu: 5.021 ± 0.836
3.766ValMet: 3.766 ± 0.712
2.26ValAsn: 2.26 ± 1.769
2.511ValPro: 2.511 ± 0.655
2.762ValGln: 2.762 ± 1.513
3.766ValArg: 3.766 ± 0.833
6.277ValSer: 6.277 ± 0.847
2.511ValThr: 2.511 ± 0.519
5.272ValVal: 5.272 ± 1.158
1.506ValTrp: 1.506 ± 0.481
0.251ValTyr: 0.251 ± 0.157
0.0ValXaa: 0.0 ± 0.0
Trp
0.753TrpAla: 0.753 ± 0.471
0.251TrpCys: 0.251 ± 0.579
0.251TrpAsp: 0.251 ± 0.157
0.753TrpGlu: 0.753 ± 0.195
0.502TrpPhe: 0.502 ± 0.144
0.753TrpGly: 0.753 ± 0.487
0.0TrpHis: 0.0 ± 0.0
0.502TrpIle: 0.502 ± 0.46
1.004TrpLys: 1.004 ± 0.747
1.255TrpLeu: 1.255 ± 0.305
0.753TrpMet: 0.753 ± 0.35
0.502TrpAsn: 0.502 ± 0.314
0.502TrpPro: 0.502 ± 0.539
0.0TrpGln: 0.0 ± 0.0
0.502TrpArg: 0.502 ± 0.144
1.506TrpSer: 1.506 ± 0.35
2.26TrpThr: 2.26 ± 1.034
0.753TrpVal: 0.753 ± 0.375
0.251TrpTrp: 0.251 ± 0.157
0.753TrpTyr: 0.753 ± 0.471
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.009TyrAla: 2.009 ± 0.576
0.251TyrCys: 0.251 ± 0.23
0.753TyrAsp: 0.753 ± 0.544
2.26TyrGlu: 2.26 ± 0.586
1.004TyrPhe: 1.004 ± 0.629
0.753TyrGly: 0.753 ± 0.195
0.502TyrHis: 0.502 ± 0.314
2.26TyrIle: 2.26 ± 0.29
3.013TyrLys: 3.013 ± 0.349
2.009TyrLeu: 2.009 ± 1.095
1.757TyrMet: 1.757 ± 0.776
1.506TyrAsn: 1.506 ± 0.481
1.255TyrPro: 1.255 ± 0.943
0.753TyrGln: 0.753 ± 0.876
1.757TyrArg: 1.757 ± 0.337
3.264TyrSer: 3.264 ± 1.307
1.255TyrThr: 1.255 ± 0.483
2.009TyrVal: 2.009 ± 0.969
0.753TyrTrp: 0.753 ± 0.195
0.251TyrTyr: 0.251 ± 0.579
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (3984 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski