Neural implicit camera and geometry representations for multiview 3D reconstruction without camera parameters / by Mohamed Shawky Zaky AbdelAal Sabae ; Supervision Prof. Hoda Anis Baraka

By:

Mohamed Shawky Zaky AbdelAal Sabae [preparation.]

Contributor(s):

Hoda Anis Baraka [thesis advisor.]

Material type:

TextLanguage: English Summary language: English, Arabic Producer: 2025Description: 63 pages : illustrations ; 30 cm. + CDContent type:

text

Media type:

Unmediated

Carrier type:

volume

Other title:

التمثيلات الضمنية العصبية للكاميرا و الهندسة لإعادة البناء ثلاثي الأبعاد متعدد الرؤى بدون معلومات الكاميرا [Added title page title]

Subject(s):

DDC classification:

621.39

Available additional physical forms:

Issues also as CD.

Dissertation note: Thesis (M.Sc)-Cairo University, 2025. Summary: 3D reconstruction from multi-view RGB images is a fundamental problem in Computer Vision and Computer Graphics. Recently, neural surface reconstruc- tion methods have demonstrated their ability to reconstruct high-quality 3D geometry for complex scenes, but they typically require highly accurate multi- view camera parameters, limiting their real-world applicability. In this work, we present Bundle Adjusting-NeuS (BA-NeuS), a novel end-to-end optimization ap- proach built on top of neural surface reconstruction that can recover high-quality 3D geometry without the need for accurate camera parameters. We implicitly represent the camera parameters as a Multi-layer Perceptron (MLP) to stabilize the optimization process. Moreover, we introduce two additional loss functions, the point cloud alignment loss and the monocular depth loss, to further constrain the learned camera parameters and geometry. Our experiments on both object- level and scene-level datasets show that the proposed approach can reconstruct accurate scene surfaces while estimating relatively accurate camera parameters. Furthermore, the proposed approach significantly outperforms other baselines that optimize camera parameters, achieving a mean Chamfer distance of 0.86 and a mean absolute trajectory error of 3.28cm on the DTU dataset benchmark. Summary: إعادة بناء المشاهد ثلاثية الأبعاد من صور RGB متعددة الزوايا هى مشكلة أساسية في رؤية الحاسوب ورسومات الحاسوب. مؤخرًا، أظهرت طرق إعادة بناء الأسطح العصبية قدرتها على إعادة بناء هندسة ثلاثية الأبعاد عالية الجودة للمشاهد المعقدة، لكنها تتطلب عادةً معلومات كاميرا دقيقة للغاية، مما يحد من تطبيقاتها في العالم الحقيقي. في هذه الرسالة، نقدم BA-NeuS، وهو نهج تحسين شامل جديد يعتمد على إعادة بناء الأسطح العصبية، يمكنه استعادة هندسة ثلاثية الأبعاد عالية الجودة دون الحاجة إلى معلومات كاميرا دقيقة. نحن نمثل معلومات الكاميرا ضمنيًا كشبكة عصبية متعددة الطبقات (MLP) لتحقيق استقرار عملية التحسين. علاوة على ذلك، نقدم دالتين إضافيتين، وهما خسارة محاذاة سحابة النقاط وخسارة العمق الأحادي، لتقييد معلومات الكاميرا والهندسة المستفادة بشكل أكبر. تظهر تجاربنا على مجموعات البيانات على مستوى الكائن والمشهد أن النهج المقترح يمكنه إعادة بناء أسطح المشاهد بدقة بينما يقدر معلومات الكاميرا بدقة نسبية. علاوة على ذلك، يتفوق النهج المقترح بشكل كبير على الأساليب الأخرى التي تحسن معلومات الكاميرا، محققًا متوسط مسافة شامفر 0.86 ومتوسط خطأ المسار المطلق قدره 3.28 سم على معيار مجموعة بيانات DTU.

Tags from this library: No tags from this library for this title. Log in to add tags.

Average rating: 0.0 (0 votes)

Holdings
Item type	Current library	Home library	Call number	Status	Barcode
Thesis	قاعة الرسائل الجامعية - الدور الاول	المكتبة المركزبة الجديدة - جامعة القاهرة	Cai01.13.06.M.Sc.2025.Mo.N (Browse shelf(Opens below))	Not for loan	01010110092092000

Browsing المكتبة المركزبة الجديدة - جامعة القاهرة shelves Close shelf browser (Hides shelf browser)

Previous	No cover image available	No cover image available	No cover image available	No cover image available	No cover image available	No cover image available	No cover image available	Next
Previous	Cai01.13.06.M.Sc.2023.Ya.N A novel hybrid deep learning model for object detection on remote sensing images	Cai01.13.06.M.Sc.2024.Ah.T Transformer-based model for computer code generation to assist programmerse /	Cai01.13.06.M.Sc.2024.Wa.D Detecting And Masking Misogyny In Arabic Text Speech Using Deep Learning Techniques /	Cai01.13.06.M.Sc.2025.Mo.N Neural implicit camera and geometry representations for multiview 3D reconstruction without camera parameters /	Cai01.13.06.Ph.D.2003.Sh.N A new scalable quality guaranteed model for the real - time multimedia communications across the internet /	Cai01.13.06.Ph.D.2003.Sh.N A new scalable quality guaranteed model for the real - time multimedia communications across the internet /	Cai01.13.06.Ph.D.2005.Sa.D. Design and analysis of optical hierarchical interconnection networks for massively parallel systems /	Next

Thesis (M.Sc)-Cairo University, 2025.

Bibliography: pages 59-63.

3D reconstruction from multi-view RGB images is a fundamental problem in Computer Vision and Computer Graphics. Recently, neural surface reconstruc- tion methods have demonstrated their ability to reconstruct high-quality 3D geometry for complex scenes, but they typically require highly accurate multi- view camera parameters, limiting their real-world applicability. In this work, we present Bundle Adjusting-NeuS (BA-NeuS), a novel end-to-end optimization ap- proach built on top of neural surface reconstruction that can recover high-quality 3D geometry without the need for accurate camera parameters. We implicitly represent the camera parameters as a Multi-layer Perceptron (MLP) to stabilize the optimization process. Moreover, we introduce two additional loss functions, the point cloud alignment loss and the monocular depth loss, to further constrain the learned camera parameters and geometry. Our experiments on both object- level and scene-level datasets show that the proposed approach can reconstruct accurate scene surfaces while estimating relatively accurate camera parameters. Furthermore, the proposed approach significantly outperforms other baselines that optimize camera parameters, achieving a mean Chamfer distance of 0.86 and a mean absolute trajectory error of 3.28cm on the DTU dataset benchmark.

إعادة بناء المشاهد ثلاثية الأبعاد من صور RGB متعددة الزوايا هى مشكلة أساسية في رؤية الحاسوب ورسومات الحاسوب. مؤخرًا، أظهرت طرق إعادة بناء الأسطح العصبية قدرتها على إعادة بناء هندسة ثلاثية الأبعاد عالية الجودة للمشاهد المعقدة، لكنها تتطلب عادةً معلومات كاميرا دقيقة للغاية، مما يحد من تطبيقاتها في العالم الحقيقي. في هذه الرسالة، نقدم BA-NeuS، وهو نهج تحسين شامل جديد يعتمد على إعادة بناء الأسطح العصبية، يمكنه استعادة هندسة ثلاثية الأبعاد عالية الجودة دون الحاجة إلى معلومات كاميرا دقيقة. نحن نمثل معلومات الكاميرا ضمنيًا كشبكة عصبية متعددة الطبقات (MLP) لتحقيق استقرار عملية التحسين. علاوة على ذلك، نقدم دالتين إضافيتين، وهما خسارة محاذاة سحابة النقاط وخسارة العمق الأحادي، لتقييد معلومات الكاميرا والهندسة المستفادة بشكل أكبر. تظهر تجاربنا على مجموعات البيانات على مستوى الكائن والمشهد أن النهج المقترح يمكنه إعادة بناء أسطح المشاهد بدقة بينما يقدر معلومات الكاميرا بدقة نسبية. علاوة على ذلك، يتفوق النهج المقترح بشكل كبير على الأساليب الأخرى التي تحسن معلومات الكاميرا، محققًا متوسط مسافة شامفر 0.86 ومتوسط خطأ المسار المطلق قدره 3.28 سم على معيار مجموعة بيانات DTU.

Issues also as CD.

Text in English and abstract in Arabic & English.

There are no comments on this title.

to post a comment.

جامعة القاهرة

المكتبة المركزية الجديدة

مكتبة جامعة القاهرة الأهلية

Neural implicit camera and geometry representations for multiview 3D reconstruction without camera parameters / by Mohamed Shawky Zaky AbdelAal Sabae ; Supervision Prof. Hoda Anis Baraka

Browsing المكتبة المركزبة الجديدة - جامعة القاهرة shelves Close shelf browser (Hides shelf browser)