Amino acid dipepetide frequency for Farmington virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.9AlaAla: 8.9 ± 3.732
2.157AlaCys: 2.157 ± 0.613
4.045AlaAsp: 4.045 ± 0.771
3.776AlaGlu: 3.776 ± 0.918
4.045AlaPhe: 4.045 ± 0.405
1.079AlaGly: 1.079 ± 0.41
1.348AlaHis: 1.348 ± 0.471
7.012AlaIle: 7.012 ± 0.788
2.427AlaLys: 2.427 ± 1.491
6.203AlaLeu: 6.203 ± 1.4
1.888AlaMet: 1.888 ± 0.769
1.618AlaAsn: 1.618 ± 0.496
2.697AlaPro: 2.697 ± 0.818
2.967AlaGln: 2.967 ± 1.563
4.585AlaArg: 4.585 ± 0.539
5.933AlaSer: 5.933 ± 1.421
3.506AlaThr: 3.506 ± 1.578
7.282AlaVal: 7.282 ± 1.052
1.618AlaTrp: 1.618 ± 0.727
2.427AlaTyr: 2.427 ± 0.764
0.0AlaXaa: 0.0 ± 0.0
Cys
1.348CysAla: 1.348 ± 0.539
0.809CysCys: 0.809 ± 0.338
1.079CysAsp: 1.079 ± 0.583
0.539CysGlu: 0.539 ± 0.354
0.27CysPhe: 0.27 ± 0.158
0.539CysGly: 0.539 ± 0.316
0.539CysHis: 0.539 ± 0.244
0.809CysIle: 0.809 ± 0.252
0.539CysLys: 0.539 ± 0.529
2.427CysLeu: 2.427 ± 0.83
0.809CysMet: 0.809 ± 0.338
0.809CysAsn: 0.809 ± 0.448
0.27CysPro: 0.27 ± 0.325
0.0CysGln: 0.0 ± 0.0
1.618CysArg: 1.618 ± 0.774
1.348CysSer: 1.348 ± 0.22
1.079CysThr: 1.079 ± 0.872
1.079CysVal: 1.079 ± 0.442
0.27CysTrp: 0.27 ± 0.158
0.27CysTyr: 0.27 ± 0.158
0.0CysXaa: 0.0 ± 0.0
Asp
3.776AspAla: 3.776 ± 1.176
0.27AspCys: 0.27 ± 0.325
2.157AspAsp: 2.157 ± 1.042
3.776AspGlu: 3.776 ± 1.192
1.888AspPhe: 1.888 ± 1.368
3.236AspGly: 3.236 ± 0.808
1.348AspHis: 1.348 ± 0.552
4.315AspIle: 4.315 ± 0.392
2.967AspLys: 2.967 ± 0.921
8.9AspLeu: 8.9 ± 1.744
0.809AspMet: 0.809 ± 0.252
2.157AspAsn: 2.157 ± 0.976
4.854AspPro: 4.854 ± 1.345
0.539AspGln: 0.539 ± 0.316
2.427AspArg: 2.427 ± 1.067
2.157AspSer: 2.157 ± 0.955
4.045AspThr: 4.045 ± 0.856
3.506AspVal: 3.506 ± 0.351
0.539AspTrp: 0.539 ± 0.541
1.079AspTyr: 1.079 ± 0.444
0.0AspXaa: 0.0 ± 0.0
Glu
4.585GluAla: 4.585 ± 1.236
1.348GluCys: 1.348 ± 0.791
4.045GluAsp: 4.045 ± 1.044
3.506GluGlu: 3.506 ± 0.92
2.697GluPhe: 2.697 ± 0.596
2.967GluGly: 2.967 ± 1.082
0.809GluHis: 0.809 ± 0.528
2.967GluIle: 2.967 ± 0.597
1.079GluLys: 1.079 ± 0.328
7.821GluLeu: 7.821 ± 1.699
2.157GluMet: 2.157 ± 0.762
2.157GluAsn: 2.157 ± 0.441
4.045GluPro: 4.045 ± 0.38
1.079GluGln: 1.079 ± 0.556
4.585GluArg: 4.585 ± 0.678
3.776GluSer: 3.776 ± 1.716
4.045GluThr: 4.045 ± 1.285
4.315GluVal: 4.315 ± 0.818
0.809GluTrp: 0.809 ± 0.474
1.079GluTyr: 1.079 ± 0.255
0.0GluXaa: 0.0 ± 0.0
Phe
2.967PheAla: 2.967 ± 0.876
0.0PheCys: 0.0 ± 0.0
1.618PheAsp: 1.618 ± 0.613
2.967PheGlu: 2.967 ± 0.751
1.348PhePhe: 1.348 ± 0.477
2.967PheGly: 2.967 ± 0.476
0.27PheHis: 0.27 ± 0.158
0.809PheIle: 0.809 ± 0.368
2.967PheLys: 2.967 ± 0.92
2.967PheLeu: 2.967 ± 1.162
0.809PheMet: 0.809 ± 0.252
1.079PheAsn: 1.079 ± 0.343
1.888PhePro: 1.888 ± 1.107
1.618PheGln: 1.618 ± 0.723
2.697PheArg: 2.697 ± 1.161
3.506PheSer: 3.506 ± 1.429
1.348PheThr: 1.348 ± 0.522
2.697PheVal: 2.697 ± 1.021
0.27PheTrp: 0.27 ± 0.158
1.618PheTyr: 1.618 ± 0.996
0.0PheXaa: 0.0 ± 0.0
Gly
3.506GlyAla: 3.506 ± 0.708
0.539GlyCys: 0.539 ± 0.316
4.045GlyAsp: 4.045 ± 0.757
1.888GlyGlu: 1.888 ± 0.738
2.697GlyPhe: 2.697 ± 0.617
4.854GlyGly: 4.854 ± 1.289
1.348GlyHis: 1.348 ± 0.47
2.967GlyIle: 2.967 ± 0.449
1.618GlyLys: 1.618 ± 0.505
5.663GlyLeu: 5.663 ± 1.581
1.888GlyMet: 1.888 ± 0.318
1.888GlyAsn: 1.888 ± 0.461
2.697GlyPro: 2.697 ± 0.738
1.079GlyGln: 1.079 ± 0.255
4.315GlyArg: 4.315 ± 1.158
3.506GlySer: 3.506 ± 0.734
2.697GlyThr: 2.697 ± 0.644
2.427GlyVal: 2.427 ± 0.613
1.348GlyTrp: 1.348 ± 0.421
4.045GlyTyr: 4.045 ± 0.75
0.0GlyXaa: 0.0 ± 0.0
His
1.079HisAla: 1.079 ± 0.488
0.27HisCys: 0.27 ± 0.158
0.809HisAsp: 0.809 ± 0.802
1.348HisGlu: 1.348 ± 0.791
1.079HisPhe: 1.079 ± 0.343
1.888HisGly: 1.888 ± 0.542
1.348HisHis: 1.348 ± 0.791
0.809HisIle: 0.809 ± 0.474
0.809HisLys: 0.809 ± 0.368
2.157HisLeu: 2.157 ± 0.551
0.27HisMet: 0.27 ± 0.325
0.27HisAsn: 0.27 ± 0.325
2.697HisPro: 2.697 ± 0.455
0.539HisGln: 0.539 ± 0.354
2.157HisArg: 2.157 ± 0.546
2.697HisSer: 2.697 ± 1.39
0.809HisThr: 0.809 ± 0.252
1.348HisVal: 1.348 ± 0.791
0.0HisTrp: 0.0 ± 0.0
1.079HisTyr: 1.079 ± 0.632
0.0HisXaa: 0.0 ± 0.0
Ile
4.585IleAla: 4.585 ± 1.839
0.539IleCys: 0.539 ± 0.244
4.315IleAsp: 4.315 ± 0.891
2.427IleGlu: 2.427 ± 0.434
1.618IlePhe: 1.618 ± 0.556
4.854IleGly: 4.854 ± 0.347
1.079IleHis: 1.079 ± 0.343
3.236IleIle: 3.236 ± 0.49
1.618IleLys: 1.618 ± 0.383
5.663IleLeu: 5.663 ± 1.216
1.348IleMet: 1.348 ± 0.552
3.506IleAsn: 3.506 ± 0.683
4.585IlePro: 4.585 ± 1.559
1.079IleGln: 1.079 ± 0.719
5.394IleArg: 5.394 ± 1.088
3.236IleSer: 3.236 ± 0.902
4.585IleThr: 4.585 ± 0.917
3.506IleVal: 3.506 ± 1.522
1.079IleTrp: 1.079 ± 0.399
1.618IleTyr: 1.618 ± 0.613
0.0IleXaa: 0.0 ± 0.0
Lys
4.045LysAla: 4.045 ± 1.9
1.079LysCys: 1.079 ± 0.442
1.888LysAsp: 1.888 ± 0.637
2.157LysGlu: 2.157 ± 2.051
1.348LysPhe: 1.348 ± 0.471
3.236LysGly: 3.236 ± 0.49
1.888LysHis: 1.888 ± 0.42
2.697LysIle: 2.697 ± 0.459
2.967LysLys: 2.967 ± 0.559
5.124LysLeu: 5.124 ± 1.99
1.618LysMet: 1.618 ± 0.368
1.079LysAsn: 1.079 ± 0.343
2.697LysPro: 2.697 ± 0.929
0.809LysGln: 0.809 ± 0.252
3.506LysArg: 3.506 ± 1.427
2.427LysSer: 2.427 ± 0.526
2.427LysThr: 2.427 ± 0.828
2.427LysVal: 2.427 ± 0.743
0.809LysTrp: 0.809 ± 0.363
1.888LysTyr: 1.888 ± 0.762
0.0LysXaa: 0.0 ± 0.0
Leu
6.472LeuAla: 6.472 ± 0.75
1.348LeuCys: 1.348 ± 0.774
4.854LeuAsp: 4.854 ± 0.727
6.203LeuGlu: 6.203 ± 2.718
4.585LeuPhe: 4.585 ± 1.141
6.203LeuGly: 6.203 ± 1.673
4.045LeuHis: 4.045 ± 0.543
6.203LeuIle: 6.203 ± 1.086
4.854LeuLys: 4.854 ± 0.57
7.551LeuLeu: 7.551 ± 1.409
3.506LeuMet: 3.506 ± 1.445
4.315LeuAsn: 4.315 ± 0.874
5.663LeuPro: 5.663 ± 0.574
2.427LeuGln: 2.427 ± 0.824
6.472LeuArg: 6.472 ± 1.153
11.866LeuSer: 11.866 ± 1.149
6.742LeuThr: 6.742 ± 1.754
5.933LeuVal: 5.933 ± 1.504
1.618LeuTrp: 1.618 ± 0.54
2.157LeuTyr: 2.157 ± 0.576
0.0LeuXaa: 0.0 ± 0.0
Met
2.427MetAla: 2.427 ± 0.824
0.0MetCys: 0.0 ± 0.0
0.809MetAsp: 0.809 ± 0.41
1.618MetGlu: 1.618 ± 1.287
1.079MetPhe: 1.079 ± 0.41
0.539MetGly: 0.539 ± 0.316
0.0MetHis: 0.0 ± 0.0
2.427MetIle: 2.427 ± 0.743
1.348MetLys: 1.348 ± 0.471
2.697MetLeu: 2.697 ± 0.612
1.618MetMet: 1.618 ± 0.516
0.809MetAsn: 0.809 ± 0.338
0.809MetPro: 0.809 ± 0.338
0.809MetGln: 0.809 ± 0.338
2.157MetArg: 2.157 ± 0.465
2.427MetSer: 2.427 ± 0.743
1.618MetThr: 1.618 ± 0.682
2.157MetVal: 2.157 ± 1.265
0.539MetTrp: 0.539 ± 0.529
0.27MetTyr: 0.27 ± 0.325
0.0MetXaa: 0.0 ± 0.0
Asn
2.427AsnAla: 2.427 ± 0.908
0.27AsnCys: 0.27 ± 0.158
1.348AsnAsp: 1.348 ± 0.476
2.697AsnGlu: 2.697 ± 0.596
0.27AsnPhe: 0.27 ± 0.325
2.697AsnGly: 2.697 ± 1.228
1.079AsnHis: 1.079 ± 0.343
2.157AsnIle: 2.157 ± 0.337
1.348AsnLys: 1.348 ± 0.31
4.315AsnLeu: 4.315 ± 0.747
0.809AsnMet: 0.809 ± 0.338
2.157AsnAsn: 2.157 ± 0.719
2.427AsnPro: 2.427 ± 0.757
1.888AsnGln: 1.888 ± 0.821
1.079AsnArg: 1.079 ± 0.872
2.427AsnSer: 2.427 ± 0.449
2.697AsnThr: 2.697 ± 0.612
1.618AsnVal: 1.618 ± 0.54
0.539AsnTrp: 0.539 ± 0.316
1.618AsnTyr: 1.618 ± 0.505
0.0AsnXaa: 0.0 ± 0.0
Pro
5.124ProAla: 5.124 ± 0.758
1.618ProCys: 1.618 ± 1.001
3.506ProAsp: 3.506 ± 0.572
2.427ProGlu: 2.427 ± 0.832
1.888ProPhe: 1.888 ± 1.107
1.618ProGly: 1.618 ± 0.732
1.079ProHis: 1.079 ± 0.399
3.236ProIle: 3.236 ± 0.746
2.697ProLys: 2.697 ± 1.066
5.933ProLeu: 5.933 ± 0.889
1.079ProMet: 1.079 ± 0.832
2.697ProAsn: 2.697 ± 0.437
4.045ProPro: 4.045 ± 1.521
3.506ProGln: 3.506 ± 0.735
3.506ProArg: 3.506 ± 0.366
5.394ProSer: 5.394 ± 0.755
4.045ProThr: 4.045 ± 2.245
4.854ProVal: 4.854 ± 0.458
0.27ProTrp: 0.27 ± 0.158
2.967ProTyr: 2.967 ± 0.561
0.0ProXaa: 0.0 ± 0.0
Gln
1.348GlnAla: 1.348 ± 1.039
0.539GlnCys: 0.539 ± 0.613
1.888GlnAsp: 1.888 ± 0.251
2.427GlnGlu: 2.427 ± 0.572
0.539GlnPhe: 0.539 ± 0.316
1.079GlnGly: 1.079 ± 0.488
0.809GlnHis: 0.809 ± 0.338
1.618GlnIle: 1.618 ± 0.505
1.618GlnLys: 1.618 ± 0.373
2.697GlnLeu: 2.697 ± 0.537
0.27GlnMet: 0.27 ± 0.39
1.348GlnAsn: 1.348 ± 0.616
0.27GlnPro: 0.27 ± 0.158
1.348GlnGln: 1.348 ± 0.522
1.079GlnArg: 1.079 ± 0.328
2.967GlnSer: 2.967 ± 0.885
1.348GlnThr: 1.348 ± 0.455
2.967GlnVal: 2.967 ± 0.348
0.27GlnTrp: 0.27 ± 0.158
2.157GlnTyr: 2.157 ± 0.759
0.0GlnXaa: 0.0 ± 0.0
Arg
6.742ArgAla: 6.742 ± 1.541
1.348ArgCys: 1.348 ± 0.47
2.967ArgAsp: 2.967 ± 1.004
4.585ArgGlu: 4.585 ± 1.234
1.618ArgPhe: 1.618 ± 0.741
4.315ArgGly: 4.315 ± 1.24
1.618ArgHis: 1.618 ± 0.653
3.776ArgIle: 3.776 ± 0.579
2.157ArgLys: 2.157 ± 0.913
7.012ArgLeu: 7.012 ± 1.891
1.348ArgMet: 1.348 ± 0.791
2.427ArgAsn: 2.427 ± 0.327
4.585ArgPro: 4.585 ± 2.021
2.157ArgGln: 2.157 ± 0.302
4.854ArgArg: 4.854 ± 1.693
2.697ArgSer: 2.697 ± 0.537
5.394ArgThr: 5.394 ± 1.038
4.585ArgVal: 4.585 ± 1.207
1.888ArgTrp: 1.888 ± 0.42
1.079ArgTyr: 1.079 ± 0.442
0.0ArgXaa: 0.0 ± 0.0
Ser
5.394SerAla: 5.394 ± 0.506
2.157SerCys: 2.157 ± 1.34
5.124SerAsp: 5.124 ± 1.718
6.203SerGlu: 6.203 ± 0.649
3.506SerPhe: 3.506 ± 1.294
3.236SerGly: 3.236 ± 1.199
1.888SerHis: 1.888 ± 0.81
4.045SerIle: 4.045 ± 1.136
4.045SerLys: 4.045 ± 1.099
9.709SerLeu: 9.709 ± 2.531
1.348SerMet: 1.348 ± 0.471
1.348SerAsn: 1.348 ± 0.471
5.124SerPro: 5.124 ± 1.209
0.809SerGln: 0.809 ± 0.338
4.585SerArg: 4.585 ± 1.21
5.394SerSer: 5.394 ± 0.624
3.776SerThr: 3.776 ± 1.042
4.315SerVal: 4.315 ± 0.798
1.888SerTrp: 1.888 ± 0.729
1.348SerTyr: 1.348 ± 0.476
0.0SerXaa: 0.0 ± 0.0
Thr
4.315ThrAla: 4.315 ± 0.915
0.809ThrCys: 0.809 ± 0.41
3.236ThrAsp: 3.236 ± 0.571
4.315ThrGlu: 4.315 ± 1.423
1.888ThrPhe: 1.888 ± 0.581
3.506ThrGly: 3.506 ± 0.597
1.618ThrHis: 1.618 ± 0.774
4.045ThrIle: 4.045 ± 0.494
2.697ThrLys: 2.697 ± 1.074
6.203ThrLeu: 6.203 ± 1.255
1.618ThrMet: 1.618 ± 0.736
1.888ThrAsn: 1.888 ± 0.762
4.585ThrPro: 4.585 ± 0.76
1.888ThrGln: 1.888 ± 0.542
4.585ThrArg: 4.585 ± 0.482
4.585ThrSer: 4.585 ± 0.918
4.585ThrThr: 4.585 ± 0.669
2.967ThrVal: 2.967 ± 0.714
1.618ThrTrp: 1.618 ± 0.308
1.348ThrTyr: 1.348 ± 0.522
0.0ThrXaa: 0.0 ± 0.0
Val
2.967ValAla: 2.967 ± 0.484
0.539ValCys: 0.539 ± 0.244
3.776ValAsp: 3.776 ± 0.549
4.854ValGlu: 4.854 ± 1.229
1.888ValPhe: 1.888 ± 0.406
2.427ValGly: 2.427 ± 0.434
1.079ValHis: 1.079 ± 0.343
3.506ValIle: 3.506 ± 1.072
6.742ValLys: 6.742 ± 1.612
5.124ValLeu: 5.124 ± 0.91
1.348ValMet: 1.348 ± 0.904
2.697ValAsn: 2.697 ± 0.819
4.854ValPro: 4.854 ± 1.176
1.618ValGln: 1.618 ± 1.003
5.394ValArg: 5.394 ± 0.327
4.854ValSer: 4.854 ± 1.022
4.045ValThr: 4.045 ± 0.78
3.776ValVal: 3.776 ± 0.257
1.348ValTrp: 1.348 ± 0.522
1.888ValTyr: 1.888 ± 0.318
0.0ValXaa: 0.0 ± 0.0
Trp
1.618TrpAla: 1.618 ± 0.54
0.0TrpCys: 0.0 ± 0.0
1.888TrpAsp: 1.888 ± 0.42
0.539TrpGlu: 0.539 ± 0.541
0.539TrpPhe: 0.539 ± 0.316
1.618TrpGly: 1.618 ± 0.682
0.27TrpHis: 0.27 ± 0.158
0.0TrpIle: 0.0 ± 0.0
0.539TrpLys: 0.539 ± 0.354
1.079TrpLeu: 1.079 ± 0.863
0.809TrpMet: 0.809 ± 0.371
1.079TrpAsn: 1.079 ± 0.863
0.539TrpPro: 0.539 ± 0.316
0.539TrpGln: 0.539 ± 0.316
1.079TrpArg: 1.079 ± 0.872
1.618TrpSer: 1.618 ± 0.373
1.348TrpThr: 1.348 ± 0.41
0.809TrpVal: 0.809 ± 0.743
0.27TrpTrp: 0.27 ± 0.158
0.539TrpTyr: 0.539 ± 0.244
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.888TyrAla: 1.888 ± 0.391
0.809TyrCys: 0.809 ± 0.363
1.348TyrAsp: 1.348 ± 0.47
1.618TyrGlu: 1.618 ± 0.373
1.618TyrPhe: 1.618 ± 0.741
2.157TyrGly: 2.157 ± 0.546
0.0TyrHis: 0.0 ± 0.0
3.236TyrIle: 3.236 ± 0.746
1.079TyrLys: 1.079 ± 0.442
3.776TyrLeu: 3.776 ± 1.412
0.539TyrMet: 0.539 ± 0.314
0.539TyrAsn: 0.539 ± 0.316
2.157TyrPro: 2.157 ± 0.756
1.888TyrGln: 1.888 ± 0.762
1.079TyrArg: 1.079 ± 0.664
2.427TyrSer: 2.427 ± 0.449
2.157TyrThr: 2.157 ± 0.456
2.157TyrVal: 2.157 ± 0.913
0.0TyrTrp: 0.0 ± 0.0
0.809TyrTyr: 0.809 ± 0.338
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (3709 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski