Amino acid dipepetide frequency for Avian metaavulavirus 20

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.936AlaAla: 6.936 ± 2.343
0.991AlaCys: 0.991 ± 0.435
2.378AlaAsp: 2.378 ± 0.897
5.153AlaGlu: 5.153 ± 1.713
1.585AlaPhe: 1.585 ± 0.589
3.369AlaGly: 3.369 ± 0.966
0.991AlaHis: 0.991 ± 0.281
5.549AlaIle: 5.549 ± 1.136
2.576AlaLys: 2.576 ± 0.595
7.531AlaLeu: 7.531 ± 1.103
1.784AlaMet: 1.784 ± 0.583
2.378AlaAsn: 2.378 ± 0.775
0.991AlaPro: 0.991 ± 0.636
2.774AlaGln: 2.774 ± 0.861
2.774AlaArg: 2.774 ± 0.787
6.342AlaSer: 6.342 ± 1.75
4.558AlaThr: 4.558 ± 0.988
2.973AlaVal: 2.973 ± 1.004
0.198AlaTrp: 0.198 ± 0.226
1.585AlaTyr: 1.585 ± 0.551
0.0AlaXaa: 0.0 ± 0.0
Cys
1.982CysAla: 1.982 ± 0.856
0.595CysCys: 0.595 ± 0.286
0.991CysAsp: 0.991 ± 0.31
0.793CysGlu: 0.793 ± 0.251
0.198CysPhe: 0.198 ± 0.237
0.396CysGly: 0.396 ± 0.36
0.198CysHis: 0.198 ± 0.123
1.387CysIle: 1.387 ± 0.699
1.982CysLys: 1.982 ± 0.624
1.982CysLeu: 1.982 ± 0.621
0.396CysMet: 0.396 ± 0.334
1.982CysAsn: 1.982 ± 0.663
1.387CysPro: 1.387 ± 0.553
0.793CysGln: 0.793 ± 0.344
1.585CysArg: 1.585 ± 0.597
2.18CysSer: 2.18 ± 0.919
0.396CysThr: 0.396 ± 0.204
0.991CysVal: 0.991 ± 0.278
0.396CysTrp: 0.396 ± 0.388
0.595CysTyr: 0.595 ± 0.265
0.0CysXaa: 0.0 ± 0.0
Asp
2.973AspAla: 2.973 ± 0.504
0.396AspCys: 0.396 ± 0.208
4.954AspAsp: 4.954 ± 1.125
1.189AspGlu: 1.189 ± 0.409
2.378AspPhe: 2.378 ± 0.46
1.387AspGly: 1.387 ± 0.396
0.793AspHis: 0.793 ± 0.307
5.351AspIle: 5.351 ± 0.901
3.765AspLys: 3.765 ± 0.908
3.765AspLeu: 3.765 ± 0.636
0.793AspMet: 0.793 ± 0.387
3.171AspAsn: 3.171 ± 0.655
2.378AspPro: 2.378 ± 0.813
1.784AspGln: 1.784 ± 0.474
1.189AspArg: 1.189 ± 0.312
4.36AspSer: 4.36 ± 0.907
4.756AspThr: 4.756 ± 0.834
0.793AspVal: 0.793 ± 0.33
0.595AspTrp: 0.595 ± 0.415
1.585AspTyr: 1.585 ± 0.247
0.0AspXaa: 0.0 ± 0.0
Glu
1.982GluAla: 1.982 ± 0.759
1.387GluCys: 1.387 ± 0.535
2.378GluAsp: 2.378 ± 1.031
3.369GluGlu: 3.369 ± 0.544
3.765GluPhe: 3.765 ± 1.05
3.171GluGly: 3.171 ± 0.675
0.595GluHis: 0.595 ± 0.239
3.964GluIle: 3.964 ± 0.831
3.765GluLys: 3.765 ± 0.688
7.134GluLeu: 7.134 ± 1.452
1.189GluMet: 1.189 ± 0.284
1.784GluAsn: 1.784 ± 0.563
1.189GluPro: 1.189 ± 0.409
1.982GluGln: 1.982 ± 0.294
1.387GluArg: 1.387 ± 0.418
6.342GluSer: 6.342 ± 1.422
2.973GluThr: 2.973 ± 0.964
1.982GluVal: 1.982 ± 0.368
0.595GluTrp: 0.595 ± 0.257
1.585GluTyr: 1.585 ± 0.316
0.0GluXaa: 0.0 ± 0.0
Phe
0.793PheAla: 0.793 ± 0.33
0.991PheCys: 0.991 ± 0.294
1.189PheAsp: 1.189 ± 0.534
0.991PheGlu: 0.991 ± 0.614
2.774PhePhe: 2.774 ± 0.903
2.973PheGly: 2.973 ± 0.716
0.396PheHis: 0.396 ± 0.37
1.982PheIle: 1.982 ± 0.741
2.576PheLys: 2.576 ± 1.234
3.765PheLeu: 3.765 ± 0.872
1.189PheMet: 1.189 ± 0.44
2.18PheAsn: 2.18 ± 0.721
1.189PhePro: 1.189 ± 0.47
0.595PheGln: 0.595 ± 0.241
1.784PheArg: 1.784 ± 0.346
4.36PheSer: 4.36 ± 0.367
1.784PheThr: 1.784 ± 0.741
2.973PheVal: 2.973 ± 0.782
0.595PheTrp: 0.595 ± 0.308
0.396PheTyr: 0.396 ± 0.204
0.0PheXaa: 0.0 ± 0.0
Gly
2.18GlyAla: 2.18 ± 0.733
0.793GlyCys: 0.793 ± 0.658
2.774GlyAsp: 2.774 ± 0.874
5.351GlyGlu: 5.351 ± 0.94
1.387GlyPhe: 1.387 ± 0.766
2.973GlyGly: 2.973 ± 0.376
0.991GlyHis: 0.991 ± 0.364
3.369GlyIle: 3.369 ± 0.652
2.576GlyLys: 2.576 ± 1.35
5.747GlyLeu: 5.747 ± 0.86
1.189GlyMet: 1.189 ± 0.444
2.576GlyAsn: 2.576 ± 0.783
2.378GlyPro: 2.378 ± 0.456
2.378GlyGln: 2.378 ± 0.361
2.378GlyArg: 2.378 ± 0.366
3.964GlySer: 3.964 ± 0.658
2.576GlyThr: 2.576 ± 0.991
5.747GlyVal: 5.747 ± 1.425
0.198GlyTrp: 0.198 ± 0.226
1.784GlyTyr: 1.784 ± 0.373
0.0GlyXaa: 0.0 ± 0.0
His
1.387HisAla: 1.387 ± 0.423
0.595HisCys: 0.595 ± 0.304
0.793HisAsp: 0.793 ± 0.358
0.793HisGlu: 0.793 ± 0.491
0.396HisPhe: 0.396 ± 0.194
0.595HisGly: 0.595 ± 0.239
0.396HisHis: 0.396 ± 0.246
1.189HisIle: 1.189 ± 0.534
0.595HisLys: 0.595 ± 0.368
2.576HisLeu: 2.576 ± 0.949
0.198HisMet: 0.198 ± 0.123
1.387HisAsn: 1.387 ± 0.629
1.982HisPro: 1.982 ± 0.375
1.189HisGln: 1.189 ± 0.39
0.595HisArg: 0.595 ± 0.522
3.369HisSer: 3.369 ± 0.499
1.585HisThr: 1.585 ± 0.478
0.793HisVal: 0.793 ± 0.277
0.198HisTrp: 0.198 ± 0.237
0.595HisTyr: 0.595 ± 0.241
0.0HisXaa: 0.0 ± 0.0
Ile
5.351IleAla: 5.351 ± 0.922
1.189IleCys: 1.189 ± 0.564
3.765IleAsp: 3.765 ± 1.253
3.171IleGlu: 3.171 ± 0.558
2.378IlePhe: 2.378 ± 0.899
4.162IleGly: 4.162 ± 1.175
2.378IleHis: 2.378 ± 0.812
4.954IleIle: 4.954 ± 1.279
6.143IleLys: 6.143 ± 1.36
7.729IleLeu: 7.729 ± 1.753
1.387IleMet: 1.387 ± 0.453
2.774IleAsn: 2.774 ± 0.802
2.973IlePro: 2.973 ± 0.715
3.964IleGln: 3.964 ± 0.673
2.576IleArg: 2.576 ± 0.51
5.351IleSer: 5.351 ± 1.12
6.143IleThr: 6.143 ± 1.296
4.954IleVal: 4.954 ± 1.121
0.595IleTrp: 0.595 ± 0.257
0.991IleTyr: 0.991 ± 0.453
0.0IleXaa: 0.0 ± 0.0
Lys
2.774LysAla: 2.774 ± 0.64
1.585LysCys: 1.585 ± 0.294
2.378LysAsp: 2.378 ± 0.756
2.576LysGlu: 2.576 ± 0.679
1.784LysPhe: 1.784 ± 0.693
2.378LysGly: 2.378 ± 0.8
0.595LysHis: 0.595 ± 0.368
4.162LysIle: 4.162 ± 1.049
2.576LysLys: 2.576 ± 0.856
5.945LysLeu: 5.945 ± 0.946
1.982LysMet: 1.982 ± 0.729
3.369LysAsn: 3.369 ± 0.625
1.784LysPro: 1.784 ± 0.737
1.982LysGln: 1.982 ± 0.431
2.973LysArg: 2.973 ± 0.764
7.531LysSer: 7.531 ± 1.79
2.774LysThr: 2.774 ± 0.403
3.171LysVal: 3.171 ± 0.73
0.198LysTrp: 0.198 ± 0.123
1.585LysTyr: 1.585 ± 0.47
0.0LysXaa: 0.0 ± 0.0
Leu
6.936LeuAla: 6.936 ± 1.018
2.378LeuCys: 2.378 ± 0.548
6.54LeuAsp: 6.54 ± 0.861
4.954LeuGlu: 4.954 ± 0.887
4.756LeuPhe: 4.756 ± 1.433
5.747LeuGly: 5.747 ± 0.693
2.973LeuHis: 2.973 ± 1.218
6.342LeuIle: 6.342 ± 1.393
5.945LeuLys: 5.945 ± 1.089
9.909LeuLeu: 9.909 ± 2.317
2.18LeuMet: 2.18 ± 0.562
5.153LeuAsn: 5.153 ± 1.527
2.973LeuPro: 2.973 ± 0.615
3.567LeuGln: 3.567 ± 0.453
4.162LeuArg: 4.162 ± 1.246
11.891LeuSer: 11.891 ± 2.185
7.729LeuThr: 7.729 ± 1.606
4.954LeuVal: 4.954 ± 0.649
1.784LeuTrp: 1.784 ± 0.418
3.765LeuTyr: 3.765 ± 0.94
0.0LeuXaa: 0.0 ± 0.0
Met
1.784MetAla: 1.784 ± 0.585
0.396MetCys: 0.396 ± 0.208
1.189MetAsp: 1.189 ± 0.391
1.784MetGlu: 1.784 ± 0.886
0.396MetPhe: 0.396 ± 0.37
1.784MetGly: 1.784 ± 0.935
0.595MetHis: 0.595 ± 0.305
1.189MetIle: 1.189 ± 0.737
0.991MetLys: 0.991 ± 0.405
1.982MetLeu: 1.982 ± 0.397
0.595MetMet: 0.595 ± 0.265
0.595MetAsn: 0.595 ± 0.281
0.793MetPro: 0.793 ± 0.33
0.595MetGln: 0.595 ± 0.368
1.387MetArg: 1.387 ± 0.445
1.784MetSer: 1.784 ± 0.313
2.18MetThr: 2.18 ± 0.756
2.18MetVal: 2.18 ± 0.606
0.396MetTrp: 0.396 ± 0.256
0.991MetTyr: 0.991 ± 0.428
0.0MetXaa: 0.0 ± 0.0
Asn
2.18AsnAla: 2.18 ± 0.563
0.595AsnCys: 0.595 ± 0.51
3.171AsnAsp: 3.171 ± 0.815
2.973AsnGlu: 2.973 ± 1.02
0.991AsnPhe: 0.991 ± 0.323
1.982AsnGly: 1.982 ± 0.429
1.387AsnHis: 1.387 ± 0.68
1.982AsnIle: 1.982 ± 0.809
1.387AsnLys: 1.387 ± 0.502
5.549AsnLeu: 5.549 ± 1.044
1.585AsnMet: 1.585 ± 0.547
2.378AsnAsn: 2.378 ± 0.817
3.567AsnPro: 3.567 ± 0.957
4.36AsnGln: 4.36 ± 1.268
1.982AsnArg: 1.982 ± 0.814
4.756AsnSer: 4.756 ± 0.517
2.576AsnThr: 2.576 ± 1.168
2.973AsnVal: 2.973 ± 0.591
0.991AsnTrp: 0.991 ± 0.423
1.387AsnTyr: 1.387 ± 0.428
0.0AsnXaa: 0.0 ± 0.0
Pro
1.784ProAla: 1.784 ± 0.57
0.396ProCys: 0.396 ± 0.208
1.189ProAsp: 1.189 ± 0.395
1.982ProGlu: 1.982 ± 0.731
0.991ProPhe: 0.991 ± 0.281
2.576ProGly: 2.576 ± 1.038
0.396ProHis: 0.396 ± 0.246
3.765ProIle: 3.765 ± 0.758
2.774ProLys: 2.774 ± 0.57
3.171ProLeu: 3.171 ± 0.514
0.991ProMet: 0.991 ± 0.29
1.585ProAsn: 1.585 ± 0.532
2.378ProPro: 2.378 ± 0.977
3.171ProGln: 3.171 ± 1.243
2.774ProArg: 2.774 ± 0.642
3.567ProSer: 3.567 ± 1.657
2.18ProThr: 2.18 ± 0.536
2.378ProVal: 2.378 ± 0.601
0.0ProTrp: 0.0 ± 0.0
1.387ProTyr: 1.387 ± 0.495
0.0ProXaa: 0.0 ± 0.0
Gln
3.369GlnAla: 3.369 ± 1.041
0.793GlnCys: 0.793 ± 0.331
1.784GlnAsp: 1.784 ± 0.464
2.576GlnGlu: 2.576 ± 0.775
1.982GlnPhe: 1.982 ± 0.496
2.973GlnGly: 2.973 ± 0.61
1.189GlnHis: 1.189 ± 0.613
4.756GlnIle: 4.756 ± 1.357
1.784GlnLys: 1.784 ± 0.693
5.351GlnLeu: 5.351 ± 1.069
1.189GlnMet: 1.189 ± 0.439
0.991GlnAsn: 0.991 ± 0.423
2.774GlnPro: 2.774 ± 0.982
2.973GlnGln: 2.973 ± 1.298
1.189GlnArg: 1.189 ± 0.402
5.549GlnSer: 5.549 ± 2.078
3.567GlnThr: 3.567 ± 1.303
3.567GlnVal: 3.567 ± 0.746
0.396GlnTrp: 0.396 ± 0.246
1.585GlnTyr: 1.585 ± 0.897
0.0GlnXaa: 0.0 ± 0.0
Arg
2.378ArgAla: 2.378 ± 1.238
0.396ArgCys: 0.396 ± 0.262
1.784ArgAsp: 1.784 ± 0.49
2.774ArgGlu: 2.774 ± 0.846
1.982ArgPhe: 1.982 ± 0.812
2.378ArgGly: 2.378 ± 0.709
0.991ArgHis: 0.991 ± 0.435
2.774ArgIle: 2.774 ± 0.71
2.576ArgLys: 2.576 ± 0.78
5.747ArgLeu: 5.747 ± 0.584
0.595ArgMet: 0.595 ± 0.286
1.189ArgAsn: 1.189 ± 0.386
1.387ArgPro: 1.387 ± 0.399
1.585ArgGln: 1.585 ± 0.447
1.585ArgArg: 1.585 ± 0.617
3.567ArgSer: 3.567 ± 0.569
2.576ArgThr: 2.576 ± 0.561
3.567ArgVal: 3.567 ± 0.919
0.991ArgTrp: 0.991 ± 0.419
0.991ArgTyr: 0.991 ± 0.294
0.0ArgXaa: 0.0 ± 0.0
Ser
5.945SerAla: 5.945 ± 1.586
1.982SerCys: 1.982 ± 0.461
5.945SerAsp: 5.945 ± 1.039
4.954SerGlu: 4.954 ± 1.247
2.973SerPhe: 2.973 ± 0.781
6.54SerGly: 6.54 ± 1.249
2.774SerHis: 2.774 ± 0.642
7.134SerIle: 7.134 ± 0.7
3.964SerLys: 3.964 ± 1.359
10.107SerLeu: 10.107 ± 1.539
1.982SerMet: 1.982 ± 0.761
8.522SerAsn: 8.522 ± 1.398
3.171SerPro: 3.171 ± 1.107
7.927SerGln: 7.927 ± 1.586
3.369SerArg: 3.369 ± 0.588
11.494SerSer: 11.494 ± 2.082
5.351SerThr: 5.351 ± 0.551
6.143SerVal: 6.143 ± 0.554
1.387SerTrp: 1.387 ± 0.502
2.378SerTyr: 2.378 ± 0.806
0.0SerXaa: 0.0 ± 0.0
Thr
4.756ThrAla: 4.756 ± 0.88
2.378ThrCys: 2.378 ± 0.336
1.189ThrAsp: 1.189 ± 0.433
4.162ThrGlu: 4.162 ± 1.112
1.387ThrPhe: 1.387 ± 0.441
2.973ThrGly: 2.973 ± 0.343
1.387ThrHis: 1.387 ± 0.403
5.945ThrIle: 5.945 ± 1.402
2.576ThrLys: 2.576 ± 0.912
7.134ThrLeu: 7.134 ± 0.912
0.793ThrMet: 0.793 ± 0.321
2.576ThrAsn: 2.576 ± 1.081
2.576ThrPro: 2.576 ± 0.953
3.171ThrGln: 3.171 ± 0.806
3.765ThrArg: 3.765 ± 0.744
5.945ThrSer: 5.945 ± 0.955
6.738ThrThr: 6.738 ± 1.589
5.351ThrVal: 5.351 ± 1.147
0.793ThrTrp: 0.793 ± 0.321
0.793ThrTyr: 0.793 ± 0.298
0.0ThrXaa: 0.0 ± 0.0
Val
5.549ValAla: 5.549 ± 1.778
1.982ValCys: 1.982 ± 0.554
2.378ValAsp: 2.378 ± 0.931
1.784ValGlu: 1.784 ± 0.501
2.378ValPhe: 2.378 ± 0.55
3.171ValGly: 3.171 ± 0.937
1.784ValHis: 1.784 ± 0.511
4.558ValIle: 4.558 ± 0.909
3.765ValLys: 3.765 ± 0.743
5.351ValLeu: 5.351 ± 1.033
1.982ValMet: 1.982 ± 0.496
1.784ValAsn: 1.784 ± 0.697
1.982ValPro: 1.982 ± 0.416
3.369ValGln: 3.369 ± 1.042
2.973ValArg: 2.973 ± 0.477
7.134ValSer: 7.134 ± 1.07
3.369ValThr: 3.369 ± 0.622
2.378ValVal: 2.378 ± 0.624
0.793ValTrp: 0.793 ± 0.33
1.585ValTyr: 1.585 ± 0.572
0.0ValXaa: 0.0 ± 0.0
Trp
0.396TrpAla: 0.396 ± 0.204
0.396TrpCys: 0.396 ± 0.362
0.396TrpAsp: 0.396 ± 0.246
0.793TrpGlu: 0.793 ± 0.424
0.396TrpPhe: 0.396 ± 0.356
0.396TrpGly: 0.396 ± 0.256
0.0TrpHis: 0.0 ± 0.0
1.189TrpIle: 1.189 ± 0.409
0.595TrpLys: 0.595 ± 0.257
0.793TrpLeu: 0.793 ± 0.472
0.595TrpMet: 0.595 ± 0.244
0.793TrpAsn: 0.793 ± 0.263
0.595TrpPro: 0.595 ± 0.425
0.595TrpGln: 0.595 ± 0.317
0.595TrpArg: 0.595 ± 0.239
1.189TrpSer: 1.189 ± 0.558
0.595TrpThr: 0.595 ± 0.239
0.396TrpVal: 0.396 ± 0.327
0.198TrpTrp: 0.198 ± 0.237
0.396TrpTyr: 0.396 ± 0.246
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.982TyrAla: 1.982 ± 0.602
0.991TyrCys: 0.991 ± 0.327
1.189TyrAsp: 1.189 ± 0.414
0.396TyrGlu: 0.396 ± 0.208
0.595TyrPhe: 0.595 ± 0.3
1.387TyrGly: 1.387 ± 0.396
0.396TyrHis: 0.396 ± 0.246
1.387TyrIle: 1.387 ± 0.553
1.189TyrLys: 1.189 ± 0.404
3.369TyrLeu: 3.369 ± 0.866
0.793TyrMet: 0.793 ± 0.241
1.585TyrAsn: 1.585 ± 0.353
0.991TyrPro: 0.991 ± 0.502
1.784TyrGln: 1.784 ± 0.444
0.793TyrArg: 0.793 ± 0.371
3.369TyrSer: 3.369 ± 0.609
1.982TyrThr: 1.982 ± 0.461
1.784TyrVal: 1.784 ± 0.351
0.0TyrTrp: 0.0 ± 0.0
1.189TyrTyr: 1.189 ± 0.395
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 8 proteins (5047 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski